Open Source Self-Driving with Comma AI
Get the full intelligence
Search transcripts, export clips, track mentions, and explore all topics from “Open Source Self-Driving with Comma AI” inside PodZeus.
In this episode of Practical AI, host Daniel Whitenack and co-host Chris Benson welcome Harold Schaefer, CTO of Comma AI (formerly Kama AI), to discuss the company's open-source self-driving technology, OpenPilot. Schaefer traces the evolution of OpenPilot from a niche hobbyist project to a widely adopted autonomy stack, now used in over 50% of miles driven by users on highways. The conversation dives into the technical architecture of OpenPilot, emphasizing its end-to-end machine learning approach—where raw video input directly produces steering and acceleration commands—unlike traditional systems that rely on intermediate steps like lane detection or traffic light recognition. A key innovation highlighted is the use of a learned simulation (a 'world model') trained via diffusion models to generate photorealistic, physically accurate video sequences that enable the system to learn recovery from mistakes, such as drifting off the lane. The system runs entirely on-device, with training and simulation handled in a centralized data center. Schaefer also discusses the challenges of working with legacy car hardware, the importance of open source for community-driven car support, and the company’s vision for expanding into indoor robotics and general-purpose AI agents. The episode closes with reflections on the future of accessible, open-source robotics that empower users rather than lock them into corporate ecosystems. Key takeaways include: 1) End-to-end autonomy via learned simulation enables robust, scalable training without manual labeling; 2) Open source is essential for community-driven hardware support and user trust; 3) On-device inference ensures privacy and reliability; 4) Future advancements will come from solving low-level control, reinforcement learning, and continual learning; 5) The long-term goal is to create simple, open-source AI tools that make daily life easier—like smarter vacuums and dishwashers—without corporate lock-in. The episode maintains a positive, forward-looking tone, celebrating open innovation and practical AI progress.
OpenPilot uses end-to-end machine learning trained in a learned simulation (world model) to enable autonomous driving without manual labeling.
The system runs entirely on-device, with training and simulation handled in a centralized data center, ensuring privacy and real-time performance.
Open source is critical for enabling community support across diverse car models and fostering user trust and ownership.
Future advancements depend on solving challenges in low-level control, reinforcement learning, and continual learning for real-world adaptability.
The long-term vision is for open-source AI tools that simplify daily tasks—like driving, cleaning, and navigation—without corporate control.
Introduction to Practical AI and Comma AI
The hosts introduce the Practical AI podcast and welcome Harold Schaefer, CTO of Comma AI, to discuss open-source self-driving technology and the evolution of OpenPilot.
The Origins and Evolution of OpenPilot
Harold recounts the early days of OpenPilot as a DIY project, the transition to a productized solution, and how the landscape of autonomous driving has evolved since 2017.
Architecture of OpenPilot: From Sensors to Actions
A detailed breakdown of OpenPilot’s system architecture, including on-device compute, camera input, CAN bus communication, and the end-to-end neural network policy.
End-to-End Learning and the World Model Revolution
“We train our models in a diffusion simulator. The videos are all generated. Waymo and Tesla are exploring these things, but they haven't been as focused on it.”
Open Source as a Strategic and Ethical Imperative
“If you buy a device and you don't get to control what runs on it, and you don't get to know what runs on it, I think you should question whether you really own the device.”
“If you buy a device and you don't get to control what runs on it, and you don't get to know what runs on it, I think you should question whether you really own the device.”
“We train our models in a diffusion simulator. The videos are all generated. Waymo and Tesla are exploring these things, but they haven't been as focused on it.”
“Imitation learning doesn't really work at all for tight feedback loop stuff. You need RL. And by and large, RL just doesn't work.”
Hosts
Guest
Harold Schaefer
person
OpenPilot
product
Comma AI
organization
Daniel Whitenack
person
Chris Benson
person
Waymo
organization
Python
other
Tesla FSD
product
CAN bus
other
C++
other
Get the full intelligence
Search transcripts, export clips, track mentions, and explore all topics from “Open Source Self-Driving with Comma AI” inside PodZeus.
Start discovering podcast insights today
Start with a 7-day trial and explore a growing catalog of popular podcasts. No credit card required.
No credit card required • 7-day trial • Cancel anytime
