Raunaq Naidu

I'm a systems engineer building at the intersection of real-time audio, on-device ML, and AI infrastructure. At Meta, I lead the Voice AI Platform -- an 8-stage streaming audio pipeline that powers always-on ambient intelligence on Ray-Ban Meta smart glasses, from dual-microphone capture at 48 kHz through privacy-preserving transcription in trusted execution environments to multi-modal retrieval over weeks of conversational memory.

Outside Meta, I build products. I'm the founder of three startups spanning AI-native documents, private social sharing, and enterprise data infrastructure. I care about shipping things that work, building systems that scale, and the craft of turning research into products people use every day.

Experience

Meta Platforms Nov 2016 -- Present

Senior Staff Software Engineer

Overall Technical Lead, Voice AI Platform (2025 -- present) -- Architected an 8-stage streaming audio pipeline for always-on ambient audio on Ray-Ban Meta smart glasses. Own architecture, quality, and delivery across 8 workstreams and 20+ engineers. Shipped 200M RNN-T ASR and a 5M-parameter voice-print speaker-diarization model in a Trusted Execution Environment.
Technical Lead, Autonomous Crash Investigation Agent (2025) -- Production multi-agent system that autonomously diagnoses device crashes across 4+ wearable product lines. Reduced median crash resolution from 13 to 3 days.
TL & EM, Systems Performance (2023 -- 2024) -- Built and managed a team of 10 engineers driving performance optimization for AI-powered wrist wearables.
TL & EM, Display & Graphics (2019 -- 2023) -- Shipped the display software stack on Quest 2, Quest Pro, Quest 3, and Ray-Ban Meta smart glasses. Tens of millions of users.

NVIDIA Jun 2014 -- Nov 2016

Senior Software Engineer

Built Direct Mode rendering for VR (adopted broadly across the VR ecosystem) and HDR display drivers (DisplayPort 1.4) for consumer GPUs used by millions.

Startups & Projects

HTML Docs 1,000+ users

AI-native document platform for agents and humans

A document platform built for the agent era. Any AI agent can publish a polished, hosted web document with a single API call -- no auth required, instant URL, full editing and collaboration from the moment it lands. As AI agents become primary content creators, they need a native output layer that goes beyond markdown dumps and terminal logs.

Agents get a clean POST-HTML-get-URL API, MCP server integration, and a forward proxy. Humans get real-time collaboration, AI-powered Docsmith chat, native PDF viewing with annotation, version history, and Google Docs export.

Founder · 2025 -- present · html-docs.com

Amika Consumer app

Nurture your friendships -- plan memories, events & trips

A private social app designed for the people who actually matter -- your close friends. A digital rolodex meets shared journal: collect and revisit memories, plan events and trips, and build a living archive of your friendships, away from the noise of public social media. No algorithms, no strangers, no performative posting.

Founder · amika.vercel.app

Skybright $400K ARR

Customized ETL for enterprise marketing data

Custom data pipelines for enterprise marketing teams that have outgrown off-the-shelf ETL tools. Bespoke pipelines tailored to each customer's stack and data model -- extraction from all major marketing platforms, schema normalization, cross-touchpoint identity resolution, and clean delivery into the customer's warehouse. A single source of truth for spend, performance, and attribution.

Founder · $400K annual recurring revenue

Education

North Carolina State University

M.S. Computer Engineering, GPA 4.0 · 2014

Thesis: Embedded systems & real-time compute

VJTI, Mumbai University

B.Tech Electronics Engineering, GPA 8.2/10 · 2012

Patent

US20230360566A1 -- Dynamic display brightness and refresh-rate modulation with multi-view image fusion. An adaptive system-control technique for resource-aware scheduling in on-device AI inference.

Technical Areas

Voice & Audio

Streaming pipeline design, Opus/OGG codecs, VAD, echo cancellation, speaker diarization, TEE-based model serving

ML & Retrieval

LLM orchestration (Llama), RAG pipelines, FAISS/DRAGON+/BM25, RNN-T ASR, eval frameworks

Systems

Embedded RTOS, Android/iOS internals, real-time rendering, GPU driver stacks, latency instrumentation

Languages

Python, C, C++, Java, TypeScript · Next.js, FastAPI, PyTorch, LangChain