Skip to main content
datadoo
Blog

Notes on physical AI & synthetic data

Notes on physical AI and synthetic data from datadoo experts: engineers, data scientists, and our research team.

Robots Are Shipping. Training Data Is Not.
Research
ResearchMay 5, 20266 min read

Robots Are Shipping. Training Data Is Not.

Humanoids are deploying, $6B flowed into physical AI in Q1 2026, and the bottleneck has shifted from hardware to training data. Physics-accurate synthetic data is the binding constraint.

Read article
The EU AI Act's Article 10 Is an Argument for Synthetic Data
Research
ResearchApr 20, 20268 min read

The EU AI Act's Article 10 Is an Argument for Synthetic Data

Article 10 of the EU AI Act demands training data that is representative, complete, and free of errors. Real-world data rarely meets that bar. Synthetic data does.

Read article
Digital Twins for Training: How We Build Simulation Environments in Omniverse
Engineering
EngineeringApr 8, 20264 min read

Digital Twins for Training: How We Build Simulation Environments in Omniverse

A technical walkthrough of how datadoo builds Digital Twins in NVIDIA Omniverse and uses them to generate physics-accurate training data at scale.

Read article
What We Took Away from GTC 2026
Company
CompanyMar 23, 20264 min read

What We Took Away from GTC 2026

We presented our research on synthetic windshield damage detection at GTC 2026. Here is what we learned, what we heard on the floor, and why Physical AI is moving faster than anyone expected.

Read article
Why Physical AI Starts with Synthetic Data
Research
ResearchMar 12, 20263 min read

Why Physical AI Starts with Synthetic Data

Physical AI systems need training data that obeys the laws of physics. Real-world capture cannot provide it at the scale, speed, or safety required. Synthetic data can.

Read article
Grounded Intelligence: How World Models Can Bridge Today's AI and Physical AI
Research
ResearchFeb 17, 20263 min read

Grounded Intelligence: How World Models Can Bridge Today's AI and Physical AI

World models learn to simulate physical dynamics. They may be the missing link between today's language-centric AI and the embodied systems that need to act in the real world.

Read article
The Synthetic Data Trap: When More Data Makes Your Model Worse
Engineering
EngineeringDec 16, 20253 min read

The Synthetic Data Trap: When More Data Makes Your Model Worse

Generating millions of synthetic images is easy. Generating the right ones is hard. We break down the distribution mismatch problem and how to avoid it.

Read article
Physical AI Needs Physical Truth: Synthetic Data That Obeys the World
Research
ResearchSep 29, 20253 min read

Physical AI Needs Physical Truth: Synthetic Data That Obeys the World

Most synthetic data pipelines optimize for visual fidelity. For physical AI, that is necessary but not sufficient. The underlying physics must be accurate too.

Read article
How Data Shapes AI Behavior: A Synthetic Perspective
Research
ResearchJul 8, 20253 min read

How Data Shapes AI Behavior: A Synthetic Perspective

Training data is not a passive input to model training. It is the primary lever for controlling what a model learns, how it fails, and who it works for.

Read article