Autonomous Driving Technology Approaches — Self-Driving Architecture Guide

Architectural approaches used in self-driving vehicle systems, spanning robotaxi platforms, consumer ADAS, and the full-stack suppliers powering both. Each approach reflects different first-principles tradeoffs between safety, scalability, sensor cost, and certification path.

The Core Architectural Spectrum

No single blueprint has emerged as the winner. The industry is running parallel bets on fundamentally different architectures — from HD map-dependent robotaxi systems at one extreme, to camera-only end-to-end neural networks at the other, with multi-modal sensor fusion occupying the middle ground.

Approach 1 - HD Map-Dependent (Waymo)

The map-first approach pre-builds centimeter-accurate HD maps of the operating environment. Onboard sensors answer primarily one question: where am I within this known map? This offloads world-knowledge to an offline, validated map and shrinks the real-time inference problem to a localization task. LiDAR is the primary sensor for 3D spatial reconstruction.

World Knowledge Source: External HD map (pre-built, continuously updated)
Primary Sensors: LiDAR, cameras, radar
Map Dependency: High — map must exist before deployment
Scalability Ceiling: Constrained by map coverage and maintenance cost
Certification Path: Clearest — formal Operational Design Domain (ODD) boundaries
Novel Environment Handling: Degrades at map edges; fails outside mapped area
Commercial Status: Most mature; robotaxi operations in select cities
Key Strength: Highest proven safety within geofenced ODD
Key Constraint: Map creation, freshness, and geographic coverage are hard limits on scale

Primary Practitioners

Waymo (Alphabet): Canonical example; commercially deployed in San Francisco, Phoenix, and Austin; LiDAR + camera + radar + HD map; Uber and Moove fleet partnerships
Mobileye REM: Road Experience Management — crowdsourced HD map updated by fleet cameras; powers SuperVision and Drive systems
Baidu Apollo: China's leading map-first robotaxi platform; commercially deployed in Wuhan, Beijing, Chongqing
AutoX: Map-dependent robotaxi platform; operating in Shenzhen

Approach 2 — Vision-Only/End-to-End Neural (Tesla)

No LiDAR, no HD maps. Camera-only (or camera-primary) perception feeds a large end-to-end neural network trained on massive fleet mileage. World knowledge is encoded in model weights rather than a static map. The vehicle reasons about novel situations from pixels to action, the same way a human does. Scalability is bounded by training data volume and compute, not geographic map coverage.

World Knowledge Source: Trained neural network weights (fleet data)
Primary Sensors: Cameras only (Tesla), or cameras + radar (XPeng)
Map Dependency: None
Scalability Ceiling: Low — scales globally with fleet size
Certification Path: Hardest — black-box neural networks resist formal verification
Novel Environment Handling: Generalizes in principle; dependent on training data distribution
Commercial Status: Supervised consumer deployment; unsupervised robotaxi emerging
Key Strength: No geographic dependency; largest possible training fleet; lowest hardware cost per vehicle
Key Constraint: Edge cases outside training distribution; regulatory scrutiny increasing

Primary Practitioners

Tesla FSD: Camera + radar; end-to-end neural network; v14.3 (in testing March 2026, wide release ~April 2026) adds reinforcement learning reasoning layer on top of fast-reaction neural stack — neural network reportedly 10× larger than prior versions; unsupervised robotaxi operating in Austin since January 2026; HW4 (AI4) vehicles primary platform
comma.ai (openpilot): Open-source vision-primary ADAS; runs on third-party hardware across many OEM vehicles; community-trained models
Wayve (UK): End-to-end neural approach; GAIA world model; operating in London; backed by SoftBank and Microsoft

Approach 3 — Multi-Modal Sensor Fusion (Rivian)

Combines rich sensor stacks (LiDAR + cameras + radar) with end-to-end neural training — taking the redundancy and 3D spatial accuracy of the map-heavy camp but replacing the static HD map with a trained large driving model. LiDAR point clouds feed a data flywheel for continuous model improvement rather than a pre-built map. Positioned as the safety-optimized path to scalable autonomy.

World Knowledge Source: Large Driving Model trained on multi-modal fleet data
Primary Sensors: LiDAR + multiple cameras + radar (no ultrasonic in latest gen)
Map Dependency: Low to none; LiDAR provides real-time 3D spatial grounding
Scalability Ceiling: Moderate-high — no map dependency, but higher per-vehicle hardware cost
Certification Path: Moderate — richer sensor redundancy aids safety case
Novel Environment Handling: Strong — LiDAR provides ground truth for edge cases
Commercial Status: Consumer ADAS deployed; robotaxi 2027–2028 target
Key Strength: Richest perception stack; LiDAR grounds training data quality
Key Constraint: Higher per-vehicle cost; smaller fleet data flywheel than Tesla at present

Primary Practitioners

Rivian (R2 Gen 3 platform): ACM3 compute (5B pixels/sec); RAP1 chip (1,600 TOPS, dual); 11 cameras (65MP total), 5 radars, 1 LiDAR (front roof, flush-mounted); Large Driving Model trained with GRPO; Gen 3 hardware arriving in R2 late 2026; Uber partnership announced March 2026 — up to 50,000 autonomous R2 robotaxis, $1.25B investment, first deployments San Francisco and Miami 2028
Mercedes Drive Pilot: L3 certified on German Autobahn; LiDAR + cameras + radar + HD map; geofenced highway use; first L3 system legally certified in the US (Nevada)
GM Ultra Cruise: Hands-free on 400,000+ miles of roads; LiDAR + cameras + radar; map-assisted but broader than Waymo ODD
Lucid / Nuro: Gravity SUV platform with Nuro autonomy stack; Uber partnership for robotaxi deployment
Mobileye SuperVision: Camera + radar; proprietary RSS safety model; supplied to Zeekr, SAIC, and others

Approach 4 — Chinese Vision-Primary/Large Driving Model (XPENG)

Chinese OEMs have largely converged on a vision-primary, map-free, end-to-end neural architecture — closely analogous to Tesla FSD but with faster iteration cycles enabled by government data policy, large domestic fleets, and aggressive AI infrastructure investment. Several companies have moved beyond pure imitation-learning toward hybrid reasoning architectures incorporating large language models.

World Knowledge Source: End-to-end large driving models; LLM reasoning layer
Primary Sensors: Vision-primary; radar; some retain LiDAR for premium trim
Map Dependency: None (post-2024 generation; explicit map-free architecture)
Iteration Speed: Very high — model updates every 1–2 days in some cases
Commercial Status: Consumer L2+/L3 deployed at scale; L4 robotaxi trials 2026
Key Strength: Government-backed data access; rapid iteration; cost advantage
Key Constraint: Geopolitical data restrictions outside China; US/EU tariff exposure; limited global fleet

Primary Practitioners

XPeng (XNGP): Three-model architecture — XNet (pure-vision 2K occupancy network), XPlanner (neural planning/control), XBrain (LLM reasoning); trained on 1B+ km video data; model iterations every 2 days; Turing AI chip (in-house); VLA 2.0 open-sourced to global partners including Volkswagen (first customer); XNGP licensed to VW for China models from 2026; L4 robotaxi trials in China targeted 2026; expanding to 60+ global markets by end 2026
Li Auto (AD Max): Vision + LiDAR on premium trims; Navigate on Autopilot citywide; large in-house model team; high-frequency OTA
NIO (Aquila / Adam): Aquila sensor platform (LiDAR + cameras + radar); Adam supercomputing cluster for training; NIO NAD subscription model
Xiaomi HyperOS Drive: Camera + radar; in-house Xiaomi autonomous driving team; tightly integrated with HyperOS vehicle stack
DeepRoute.ai: Third-party E2E system supplier; powers BYD Denza N7, GWM Wey Lanshan, and others

Approach 5 — Full-Stack Supplier (NVIDIA Platform

Rather than a vehicle OEM building its own stack, some players supply a complete autonomous driving system — chip, software, and cockpit OS — to multiple OEM partners. This mirrors the "Android" model for automotive AI: OEMs get proven technology without the full R&D cost; the platform supplier gains scale across many vehicle lines and accumulates cross-fleet data.

Huawei Qiankun ADS

Model: Full-stack supplier — chip (Ascend/HiSilicon), ADS software, HarmonyOS Cockpit; Huawei does not manufacture vehicles
Current Version: ADS 4.1 (January 2026); ADS 5.0 + HarmonyOS 6 / HarmonySpace 6 targeted April 2026
Sensor Suite: LiDAR + 11 HD cameras + 5 radars (ADS 4 generation)
World Model: ADS 4 introduced world model technology for edge-case reasoning; garage-to-garage navigation without pre-mapped routes since ADS 3.3
Fleet Data: 7.28 billion km of accumulated assisted driving mileage as of January 2026
OEM Partners (HIMA Alliance): Seres (AITO), Chery (Luxeed), BAIC BluePark (Stelato), JAC (Maextro), SAIC
Additional Partners: BYD (Fangchengbao / premium brands), Dongfeng, Changan, GAC, Audi China
Scale: 80+ models to ship with Qiankun by end 2026; ~3 million cumulative units targeted
Roadmap: Large-scale L3 highway deployment 2026; L4 urban pilot programs 2026–2027; L3 broad adoption 2027
Key Constraint: US sanctions limit chip supply chain; currently China-only deployment

Nvidia DRIVE Platform

Model: Compute platform supplier — architecture-agnostic; supplies across all camps
Current Silicon: DRIVE Orin (254 TOPS); DRIVE Thor (2,000 TOPS, in production ramp)
World Model: COSMOS — generative world model for synthetic training data generation; reduces dependence on real-world edge-case collection
Simulation: Omniverse platform for closed-loop AV testing and synthetic scenario generation
Key Customers: Mercedes-Benz, Volvo/Polestar, BYD, Li Auto, Zoox, Lucid, and dozens of Tier 1 suppliers
Robotaxi: Nvidia own robotaxi network announced for 2027 deployment; Uber partnership
Key Strength: Supplies all architectural camps; dominant in training infrastructure
Key Constraint: Export restrictions limit China supply; custom silicon from Tesla, Rivian, XPeng, Huawei reduces addressable market

Architecture Comparison Table

Dimension	HD Map-First (Waymo)	Vision-Only Neural (Tesla)	Sensor Fusion (Rivian)	Chinese Vision-LDM (XPeng)	Full-Stack Supplier (Huawei)
Primary sensing	LiDAR + Camera + Radar	Camera + Radar only	LiDAR + 11 Cam + 5 Radar	Vision-primary + Radar	LiDAR + Camera + Radar
HD map dependency	High — core requirement	None	None (data flywheel)	None — explicit map-free	Low (ADS 3.3+)
World knowledge source	External HD map	Trained model weights	Large Driving Model	XNet + XBrain LLM	World Model (ADS 4)
Custom silicon	Google TPU (cloud training)	FSD Chip / AI5 (Dojo)	RAP1 (1,600 TOPS)	Turing AI Chip	Ascend / HiSilicon
Reasoning layer	Structured rule + ML hybrid	RL + reasoning (v14.3)	GRPO-trained LDM	XBrain LLM / VLA 2.0	World Model (ADS 4)
Training data volume	Waymo fleet + simulation	~6M Tesla fleet (global)	R1 / R2 consumer fleet	1B+ km video data	7.28B km HIMA fleet
Robotaxi status	Commercial (SF, Phoenix, Austin)	Unsupervised, Austin (Jan 2026)	2028 target (Uber deal)	China trials 2026	L4 urban pilots 2026–27
Geographic scalability	Constrained by map coverage	Global — no map dependency	Global — no map dependency	Global (map-free)	China-focused currently
Certification path	Clearest — formal ODD	Hardest — black-box neural	Moderate	China L4 regulatory path	China L3→L4 roadmap
Key partnership	Uber, Moove	Self-operated; VW FSD	Uber ($1.25B, 50K vehicles)	Volkswagen (ADAS + chip)	BYD, Seres, BAIC, SAIC