Raunaq Naidu

Senior Staff Engineer at Meta · Voice AI & Audio Infrastructure · Founder

I'm a systems engineer building at the intersection of real-time audio, on-device ML, and AI infrastructure. At Meta, I lead the Voice AI Platform -- an 8-stage streaming audio pipeline that powers always-on ambient intelligence on Ray-Ban Meta smart glasses, from dual-microphone capture at 48 kHz through privacy-preserving transcription in trusted execution environments to multi-modal retrieval over weeks of conversational memory.

Outside Meta, I build products. I'm the founder of three startups spanning AI-native documents, private social sharing, and enterprise data infrastructure. I care about shipping things that work, building systems that scale, and the craft of turning research into products people use every day.

Experience

Meta Platforms Nov 2016 -- Present
Senior Staff Software Engineer
  • Overall Technical Lead, Voice AI Platform (2025 -- present) -- Architected an 8-stage streaming audio pipeline for always-on ambient audio on Ray-Ban Meta smart glasses. Own architecture, quality, and delivery across 8 workstreams and 20+ engineers. Shipped 200M RNN-T ASR and a 5M-parameter voice-print speaker-diarization model running inside a Trusted Execution Environment.
  • Technical Lead, Autonomous Crash Investigation Agent (2025) -- Built a production multi-agent system that autonomously diagnoses device crashes across 4+ wearable product lines. Reduced median crash resolution time from 13 to 3 days.
  • Technical Lead & Engineering Manager, Systems Performance (2023 -- 2024) -- Built and managed a team of 10 engineers driving performance optimization for AI-powered wrist wearables.
  • Technical Lead & Engineering Manager, Display & Graphics (2019 -- 2023) -- Shipped the display software stack on Quest 2, Quest Pro, Quest 3, and Ray-Ban Meta smart glasses. Products reaching tens of millions of users.
NVIDIA Jun 2014 -- Nov 2016
Senior Software Engineer
Built Direct Mode rendering for VR (adopted broadly across the VR ecosystem) and HDR display drivers (DisplayPort 1.4) for consumer GPUs used by millions.

Startups & Projects

HTML Docs 1,000+ users

AI-native document platform for agents and humans

HTML Docs is a document platform built for the agent era. Any AI agent can publish a polished, hosted web document with a single API call -- no auth required, instant URL, full editing and collaboration from the moment it lands. The thesis: as AI agents become primary content creators, they need a native output layer that goes beyond markdown dumps and terminal logs. HTML is the new markdown.

The platform serves both sides: agents get a clean POST-HTML-get-URL API, MCP server integration, and a forward proxy for building rich documents. Humans get a full editing environment with real-time collaboration, AI-powered Docsmith chat for in-document editing, native PDF viewing with annotation, version history, and Google Docs export.

Founder · 2025 – present · html-docs.com

Amika Consumer app

Nurture your friendships -- plan memories, events & trips with friends

Amika is a private social app designed for the people who actually matter -- your close friends. A digital rolodex meets shared journal: collect and revisit memories, plan events and trips, and build a living archive of your friendships, away from the noise of public social media.

No algorithms, no strangers, no performative posting. Just the people you care about and the moments you want to remember.

Founder · amika.vercel.app

Skybright $400K ARR

Customized ETL for enterprise marketing data

Skybright builds custom data pipelines for enterprise marketing teams that have outgrown off-the-shelf ETL tools. Marketing data is fragmented across dozens of platforms, each with its own schema, API quirks, and rate limits. Skybright solves this with bespoke pipelines tailored to each customer's specific stack and data model -- extraction, schema normalization, cross-touchpoint identity resolution, and clean delivery into the customer's warehouse.

Founder · $400K annual recurring revenue

Education

North Carolina State University
M.S. Computer Engineering, GPA 4.0 · 2014
Thesis: Embedded systems & real-time compute. Coursework: machine learning, computer vision, parallel computing.
VJTI, Mumbai University
B.Tech Electronics Engineering, GPA 8.2/10 · 2012

Patent

US20230360566A1 -- Dynamic display brightness and refresh-rate modulation with multi-view image fusion. An adaptive system-control technique for resource-aware scheduling in on-device AI inference.

Technical Areas

Voice & Audio Streaming pipeline design, Opus/OGG codecs, VAD, echo cancellation, speaker diarization, TEE-based model serving ML & Retrieval LLM orchestration (Llama), RAG pipelines, FAISS/DRAGON+/BM25 vector retrieval, RNN-T ASR, eval frameworks Systems Embedded RTOS, Android/iOS internals, real-time rendering, GPU driver stacks, latency instrumentation Languages Python, C, C++, Java, TypeScript · Next.js, FastAPI, PyTorch, LangChain