Enabling trustworthy AI adoption for transportation.
Developed with transportation professionals, for transportation work.
Overall Leaderboard
| Model | ELO Rating | TB-Safety | TB-Planning Soon | TB-Operations Soon | TB-Aviation Soon |
|---|---|---|---|---|---|
| 1Claude Opus 4.6 | 1231 | 73.3% | - | - | - |
| 2Gemini 3 Flash Preview | 1216 | 56.4% | - | - | - |
| 3GPT-5 mini | 1200 | 58.1% | - | - | - |
| Gemini 3 Pro Preview | 1185 | 60.4% | - | - | - |
| Claude Haiku 4.5 | 1169 | 71.4% | - | - | - |
| GPT-5.2 | 1105 | 67.6% | - | - | - |
| Claude Sonnet 4.6 | - | 65.7% | - | - | - |
How It Works
Help Improve TransportationBench
Contribute real-world use cases from your transportation work. Get early access to new features and publication credit.
Contribute a Use Case