About

Boutique data ops for the teams actually building robots.

Train Them AI collects egocentric video, runs teleoperation sessions, and delivers policy-ready VLA datasets for Seed–Series B robotics teams that can't afford Scale AI's $500K minimums — but need the same quality.

Why we exist

The bottleneck for most robotics teams isn't model architecture. It's data: collecting demonstrations, segmenting subtasks, normalizing action vectors, and shipping in the exact format the training loop expects. Large vendors lock that work behind enterprise contracts. Academic labs can't move at startup speed.

We sit in the middle: a small, format-native team that can ship a 300-demo annotated pilot in two weeks, on your hardware, in LeRobot, RLDS, HDF5, or robomimic. Every delivery includes a policy smoke test before we hand over the data — no other vendor does this.

Who's behind it

Diego Pousa

Founder · Laboulaye, Argentina

Diego develops automated action labeling AI at a venture-backed AI company by day. Train Them AI applies the same techniques — fine-grained temporal segmentation, automated QC scoring, and action vector normalization — to the data ops layer most mid-market robotics teams have to build internally.

TTAI runs as a distributed operation: a vetted operator network across Latin America and the Philippines, on-site teleoperation with portable Quest 3 + GELLO setups, and a QC pipeline that validates every minute of capture before it ships.

LinkedIn ↗ diego.pousa@trainthemai.com

What we've shipped

60+

Operators onboarded

300+

Weekly demo capacity

2 wk

Brief to delivery

Continents covered

How we work

No hardware minimums. No 6-month sales cycles. Pilots start at 100 demos. We slot into your pipeline wherever you need us — collection, annotation, or both. We deliver in your target format with a full data card and a smoke-test report. If the policy doesn't converge on 10% of the data, we fix the data before you ever see it.

Start a Pilot → Become a Collaborator