Building a Data Layer for AI Personalization on Robots
Robots that understand people need more than sensors and actuators. They need a memory
architecture — a structured data layer that captures who you are, what you prefer,
and how you change over time. This document lays out the design.
Raunaq Naidu·May 2026·Working draft — open for review
The robotics stack has historically been organized around two problems: perception
(what is in the world?) and control (how do I move through it?). Decades of
research have produced increasingly capable solutions for both. A modern mobile manipulator can
navigate a cluttered kitchen, identify objects on a counter, and execute grasps with sub-centimeter
precision.
And yet, most robots deployed in homes and workplaces today are, frankly, not very useful. Not
because they can't see or move — but because they don't know you.
The core gap
A robot that can fetch a coffee mug is a demo. A robot that knows which mug you want,
that you take oat milk on weekdays and whole milk on weekends, that you don't like being
interrupted before 9 AM, and that your partner's preferences are different — that's a product.
The gap between those two robots is not perception or control. It's a data layer.
This data layer — what we might call the personalization substrate — sits between
the foundation-model reasoning layer and the physical execution layer. It provides the memory,
preferences, and contextual knowledge that make a robot's actions feel intelligent rather than
merely competent.
What "personalization" actually means for robots
Personalization on robots is not a recommendation engine. It spans at least four dimensions:
Preference Memory
Persistent knowledge of a user's likes, dislikes, habits, and routines. Goes beyond explicit
settings — inferred from interaction history over weeks and months.
Contextual Adaptation
Adjusting behavior based on time of day, who else is present, what happened
recently, ambient conditions. The same request at 7 AM and 11 PM may need different responses.
Behavioral Calibration
Speed, chattiness, proactivity, personal space, noise tolerance — the meta-parameters
of how a robot behaves, not just what it does.
Environmental Memory
Persistent spatial knowledge: where things belong, what's changed since last time,
the layout of a space, and named locations ("the good shelf," "Mom's chair").
None of these fit neatly into a SLAM map or an object detector. They require a dedicated data
architecture — one designed for the unique constraints of physical agents operating in human
environments.
The Personalization Gap
2. Architecture: What the Data Layer Looks Like
The personalization data layer is not a single database. It is a composition of stores,
each optimized for a different kind of knowledge, connected through a unified query interface that
the robot's reasoning layer can access in real time.
Personalization Stack
Core data stores
The data layer decomposes into five primary stores:
Timestamped log of every interaction: commands, corrections, feedback, implicit signals
Append-only, time-range queries, aggregation
Event store / time-series DB
Preference Models
Learned preference vectors, Bayesian priors, multi-armed bandit state for ongoing optimization
Read per-decision, batch update on schedule
Model artifact store + KV cache
Context Graph
Relationships between users, objects, locations, routines, and activities
Graph traversal (who → uses → what → when)
Property graph DB
Spatial / Environmental Memory
Persistent map annotations, object placements, named regions, change history
Spatial queries (nearest, within region), temporal diffs
Spatial index + document store
The unified query interface
The reasoning layer should not need to know which store holds what. Instead, it queries a
personalization context API that assembles a coherent context object for
any given situation:
// Pseudocode: what the reasoning layer sees
const ctx = await personalization.getContext({
user: "alice",
location: "kitchen",
time: now(),
task: "prepare_morning_beverage",
history_window: "7d"
});
// ctx contains:
// - alice.preferences.beverages → { weekday: "oat latte", weekend: "cortado" }
// - alice.behavior.morning_tolerance → "low" (don't be chatty)
// - kitchen.objects.mug_locations → [{ name: "alice_favorite", shelf: 2, slot: 3 }]
// - kitchen.recent_changes → [{ item: "oat_milk", status: "low", detected: "2h ago" }]
// - interaction_history.beverage_requests → last 30 days of what was asked/served
This context object is the primary input to the LLM or policy network that decides what to do.
The personalization data layer's job is to make assembling this context fast
(sub-100ms), consistent (no stale preference contradicting a recent correction),
and privacy-preserving (only surface data the current task is authorized to access).
3. Key Design Decisions
On-device vs. cloud
This is the defining architectural tension. On-device keeps data private and latency low, but
limits storage, compute, and fleet-wide learning. Cloud enables richer models and cross-device
continuity, but introduces latency and trust concerns.
The On-Device vs Cloud Tradeoff
On-Device First
Keep by default: user profiles, preference cache, recent interaction history,
hot spatial memory, behavioral calibration state.
Why: Sub-10ms access. Works offline. No data leaves the home. Critical for
user trust — especially in healthcare and domestic settings.
Cloud When Needed
Sync selectively: anonymized interaction patterns for fleet learning,
model updates, cross-device preference sync (if user opts in), long-term archival.
Why: Training data aggregation, backup, multi-robot households, and
enrichment from external knowledge.
Design principle
The robot should be fully functional on its own device data, with cloud
as an enhancement layer, never a dependency. If the network goes down, the
robot still knows your name and how you like your coffee.
Privacy by design
Personalization data is inherently sensitive. The architecture must encode privacy as a structural property, not a policy overlay:
Data minimization: Store derived signals (preference vectors, behavioral summaries) rather than raw sensor feeds. A robot doesn't need to keep video of you making breakfast — it needs to know you make breakfast at 7:15 AM.
Tiered retention: Raw interaction logs age out aggressively (7–30 days). Derived preferences persist longer. Explicit user settings persist indefinitely.
Per-user encryption at rest: Each user's personalization data encrypted with a key derived from their identity. Household-shared data uses a shared key. Robot manufacturer cannot read personalization data without user consent.
Access scoping: Each task execution gets a scoped view. The "make coffee" task can read beverage preferences but not health data. Enforced at the query interface level, not application logic.
Audit log: Every data access is logged. Users can inspect what the robot knows about them and delete any subset.
Real-time inference vs. batch learning
The data layer supports two temporal modes, and getting the boundary right matters enormously:
Dual-Loop Learning Architecture
Fast loop (real-time): Immediate corrections ("no, the blue mug") update a preference cache in-place. Uses simple rules or lightweight Bayesian updates. Must complete in <50ms to not break interaction flow.
Slow loop (batch): Periodically (nightly, or on-device idle), re-trains preference models from the full interaction history. Discovers longer-term patterns: seasonal shifts, evolving routines, preference drift. Produces updated embeddings and model weights that replace the previous versions.
The fast loop is authoritative for recent corrections. The slow loop provides the baseline.
Conflicts are resolved with a recency-weighted merge — explicit recent corrections
always win over statistical patterns.
Schema design for heterogeneous form factors
A home assistant, a warehouse logistics bot, and a healthcare companion share no physical
morphology. But they can share a personalization schema — because personalization is about
the human, not the robot.
The key insight: the core schema is about the person and their context, with
form-factor-specific capabilities layered as extensions. When a household adds a new
robot, it inherits the family's personalization data immediately. Day-one personalization without
a cold-start problem.
4. The AI Stack: Foundation Models Meet Personalization
Foundation models — LLMs, vision-language models, multimodal transformers — are increasingly the
reasoning backbone of robot systems. The personalization data layer connects to these models in
three primary ways:
Embeddings for user intent
Every user interaction is embedded into a shared vector space — not just the words, but the
intent in context. "Make it warmer" means different things when said to a thermostat
robot vs. a beverage-making robot vs. a robot adjusting a blanket.
The personalization layer maintains per-user intent embeddings that capture how
this specific user uses language. Over time, the system learns that when Alice says
"the usual," she means a specific sequence of actions that's different from what Bob means.
Intent Resolution Pipeline
Retrieval-augmented personalization (RAP)
Just as RAG (retrieval-augmented generation) grounds LLMs in external documents, RAP
grounds robot reasoning in personal context. Before the LLM plans an action:
The query is embedded and used to retrieve the top-k relevant personalization records from the vector store.
Retrieved context is injected into the LLM prompt alongside the task specification.
The LLM generates a personalized plan that accounts for the user's specific preferences and history.
This pattern is powerful because it separates the personalization data from the reasoning
model. You can upgrade the LLM without losing personalization. You can transfer
personalization to a new robot without retraining. The data layer is the durable asset; the
model is a replaceable reasoning engine.
Few-shot adaptation
For novel preferences that don't have enough history for statistical learning, the system uses
few-shot adaptation: a small number of examples (often just one explicit
correction) are stored and used as in-context demonstrations for the LLM.
Example: Learning a new preference in one interaction
Day 1: Robot serves Alice tea in a random mug. Alice says, "Use the green mug next time." Stored:{ user: "alice", domain: "tea", key: "mug_preference", value: "green_mug", source: "explicit_correction", confidence: 0.95 } Day 2: Robot serves tea in the green mug without asking. Alice says nothing → implicit confirmation → confidence → 0.98. Day 14: Slow loop detects consistent pattern across 12 interactions. Promotes from "few-shot correction" to "stable preference."
Multimodal grounding
Personalization doesn't only come from language. The data layer ingests signals from multiple
modalities:
Visual: Facial expression → mood estimation → behavioral adjustment (gentler interaction when user appears stressed)
Spatial: Where the user spends time, movement patterns → routine inference
Haptic: Force feedback during handovers → grip preference calibration
Ambient: Time of day, lighting, temperature, noise level → contextual defaults
All signals are fused into the context graph as weighted edges, creating a rich, multimodal
representation of the user's current state and long-term patterns.
5. Concrete Examples Across Domains
Three Domains, One Data Layer
Home assistant robots
Scenario: Multi-person household
A family of four shares a home assistant robot. The personalization layer maintains:
Per-person profiles: Alice (parent, early riser, vegetarian), Bob (parent, night owl, coffee enthusiast), Charlie (teen, gaming schedule), Dana (child, needs supervision for certain tasks).
Household context graph: Shared preferences (thermostat range everyone tolerates), individual overrides (Alice's office is 2° cooler), pet care schedule, shared calendar integration.
Interaction routing: Robot identifies who it's interacting with (voice ID + face recognition → user_id lookup) and loads that person's context. When context is ambiguous (two people in the room), it asks rather than guessing.
Routine learning: After 3 weeks, the robot has learned the family's weekday morning sequence. It pre-positions items, adjusts its proactivity level based on who's awake, and avoids vacuuming during Charlie's sleep-in hours.
Warehouse & logistics robots
Scenario: Adapting to worker patterns
A fleet of mobile robots in a fulfillment center personalizes to individual workers:
Pace matching: Experienced picker Maria works 40% faster than average. The robots assigned to her lane pre-stage items more aggressively and use tighter delivery windows.
Ergonomic adaptation: Worker James has a shoulder injury (flagged in his profile with consent). Robots present items at waist height rather than above-shoulder, even if that's less efficient for packing.
Shift-pattern memory: The fleet knows Monday morning after a holiday weekend will be slower. It adjusts throughput expectations and reduces pressure on new workers.
Handoff preferences: Some workers prefer the robot to wait; others prefer a "drop and go" pattern. Learned from interaction history, not configured manually.
Healthcare companion robots
Scenario: Elderly patient support
A companion robot in an assisted living facility, where personalization is literally life-quality:
Communication calibration: Mrs. Chen speaks softly and prefers Mandarin for emotional topics but English for daily logistics. The robot switches languages based on conversational context, not a static setting.
Medication adherence: The robot knows Mrs. Chen's medication schedule, which pills she consistently forgets (the evening one), and the gentlest effective reminder strategy (visual cue on the table lamp, not a verbal nag).
Mood-aware interaction: On days when Mrs. Chen's activity level drops (detected via interaction frequency), the robot increases proactive social engagement — showing family photos, suggesting a call with her daughter, playing favorite music.
Privacy hierarchy: Medical data is strictly scoped. The robot's general-purpose assistant mode cannot access health records. Only the medication-management task, which runs under a separate permission scope, can read that data.
6. Infrastructure Patterns
Event sourcing for robot interactions
Every interaction is an event. Rather than maintaining mutable state ("Alice's mug preference is green"),
the data layer stores the full event stream ("Alice corrected mug choice to green at timestamp T").
This pattern provides:
Complete audit trail: Every preference can be traced to the interaction that established it.
Temporal queries: "What did Alice prefer three months ago?" is a first-class query, not a reconstruction.
Replayability: If preference models are updated, they can be retrained from the full event history.
Conflict resolution: When a real-time correction conflicts with a batch-learned preference, the event log provides the ground truth.
The interaction history, once embedded, lives in a vector store that supports
semantic retrieval — "find interactions similar to this situation" rather than
"find interactions matching these exact fields."
This powers several key capabilities:
Analogical Reasoning
"Alice liked X in situation Y. This situation Z is similar to Y. She'll probably want
something like X."
Anomaly Detection
"This request doesn't match any prior pattern for this user. Ask for confirmation rather
than assuming."
Preference Transfer
"Alice hasn't used this feature before, but her general preference patterns are similar to
other users who preferred option B."
The vector store runs on-device with a compact index (HNSW with product quantization, typically
under 500MB for a year of household interactions). Cloud sync pushes anonymized embeddings
(not raw events) for fleet-wide model improvement.
Federated learning for fleet-wide personalization
The hardest infrastructure problem: how do you improve personalization across a fleet of thousands
of robots without centralizing raw user data?
Federated Personalization Pipeline
The federated approach works in three stages:
Local training: Each robot trains (or fine-tunes) a personalization model on its local data. This happens during idle periods — overnight, when the house is empty, during charging.
Gradient sharing: Instead of sending raw data, each robot sends model gradients (or parameter deltas) to the aggregation server. These gradients contain statistical patterns but not individual user data. Differential privacy noise is added before upload.
Federated averaging: The server averages gradients across the fleet, producing an improved global model that benefits from the collective experience of all robots. This model is pushed back to each robot as a better starting point for local adaptation.
Privacy guarantee
With differential privacy (ε ≤ 2), it is mathematically provable that the aggregation server
cannot reconstruct any individual user's preferences from the gradient updates. The fleet
gets smarter without any robot sharing its family's data.
Sync engine for multi-robot households
When a household has more than one robot (or when a user interacts with robots in
multiple locations — home, office, car), the personalization data layer needs a
conflict-free sync protocol:
CRDTs for preference state: Last-writer-wins registers for simple preferences. Observed-remove sets for lists (e.g., favorite playlists). Counters for usage statistics.
Causal ordering for corrections: If Alice corrects a preference on Robot A, Robot B must apply that correction before any subsequent interaction — even if B hasn't synced the full event log yet. Priority: explicit corrections > learned preferences > defaults.
Selective sync: Not all data syncs everywhere. The kitchen robot's detailed spatial memory of cabinet contents doesn't need to sync to the car. Sync is scoped by domain and relevance.
7. Open Problems & Research Directions
This architecture is a starting point. Several hard problems remain unsolved or under-explored:
Preference drift detection
People change. A preference learned six months ago may be wrong today. How do you detect
drift (gradual change) vs. noise (one-off exception) vs. context switch
(different behavior in different situations)? Current approaches use sliding-window statistics,
but we lack good benchmarks for evaluating drift detection in human-robot interaction.
Cold start for new household members
When a guest visits or a new family member joins, the robot has zero personalization data. How
much should it infer from household patterns? From demographic priors? From the first few
interactions? The tension between "be helpful immediately" and "don't make assumptions" is
unresolved. Transfer learning from similar users helps but raises its own privacy concerns.
Explainability of personalized decisions
When a robot makes a personalized choice, the user should be able to ask why — and get
a truthful, understandable answer. "I used the green mug because you told me to on March 14"
is good. "My preference vector's cosine similarity was 0.94" is useless. Generating natural-language
explanations from the personalization data layer, traceable to specific events, is an active
research problem.
Multi-agent preference negotiation
In a shared space, preferences conflict. Alice wants the room at 68°F, Bob wants 72°F. The
robot mediates. Current approaches are primitive (average, take turns, defer to whoever asked
last). A principled framework for multi-stakeholder preference resolution — that feels fair
and transparent — is wide open.
Adversarial robustness
What happens when someone deliberately tries to corrupt a robot's personalization model? A
child repeatedly giving incorrect feedback, a malicious visitor injecting false preferences,
or a coordinated attack on the federated learning pipeline. Robustness to adversarial
personalization inputs is largely unstudied in the robotics context.
Cross-cultural personalization
Personalization defaults and interaction norms vary dramatically across cultures. Eye contact,
personal space, silence tolerance, directness of communication, household roles — all are
culturally loaded. The data layer needs cultural context models, but encoding culture as a
set of parameters risks oversimplification and stereotyping. Research needed on respectful,
adaptive cultural calibration.
Benchmarks we need
The field lacks standard benchmarks for evaluating personalization quality in embodied agents.
We need:
PersonaBench: Standardized user profiles with known preferences, routines, and correction patterns. Evaluate how quickly a system learns and how accurately it personalizes.
DriftSim: Simulated preference evolution over time — seasonal changes, life events, gradual shifts. Evaluate adaptation and staleness detection.
MultiUserSim: Household simulation with conflicting preferences, shared resources, and varying interaction frequencies. Evaluate fairness and negotiation.
PrivacyAudit: Automated tools that verify a personalization system doesn't leak data across user boundaries, across permission scopes, or to external services.
8. The Agent Infrastructure Thesis
Here's the bigger picture: robots are physical agents. And the personalization
data layer they need is structurally identical to what software agents need.
Consider what a good AI coding agent maintains about its user: coding style preferences, project
context, past decisions, preferred tools, communication style, review feedback history. Now
consider what a home robot maintains: interaction style preferences, environmental context,
past decisions, preferred routines, communication calibration, correction history.
These are the same primitives.
Shared Agent Personalization Primitives
The implication is that the personalization data layer should not be built as a robotics-specific
system. It should be built as a general agent personalization infrastructure
that physical agents consume alongside software agents.
This convergence is already happening in practice. Systems like LangMem, Mem0, and Zep provide
memory and personalization layers for LLM-based agents. The missing piece is extending these
to handle the additional modalities and constraints of embodied agents: spatial data, real-time
sensor streams, physical safety constraints, and multi-user environments.
What a unified agent personalization platform looks like
The company that builds this unified layer — agent personalization infrastructure that works
for both software and physical agents — is building one of the most important pieces of the
AI stack. Every agent, whether it lives in a terminal or a robot body, needs to know its user.
The data layer that enables that knowledge is the foundation everything else is built on.
Summary
Robots don't fail because they can't perceive or act. They fail because they don't remember
and don't adapt. The personalization data layer — user profiles, interaction history,
preference models, context graphs, and environmental memory — is the missing
infrastructure that turns competent machines into useful ones.
And the best part: this infrastructure isn't specific to robots. It's the same memory and
personalization substrate that every AI agent needs. Build it once, and you've built the
foundation for both the software agents of today and the physical agents of tomorrow.
Published on HTML Docs·Open for review and comments