A sophisticated airport simulation designed specifically for testing and comparing different Large Language Models (LLMs) in complex, real-time air traffic control scenarios. This project serves as a controlled environment to evaluate how various AI systems handle safety-critical decision making, resource allocation, and emergency management in aviation operations.
The primary goal of this simulation is to provide a standardized playground for comparing LLM performance in air traffic control decision making. By presenting identical scenarios to different AI models, we can objectively evaluate:
- Safety prioritization (fuel emergencies, collision avoidance)
- Resource optimization (runway and gate assignments)
- Crisis management (multiple emergency scenarios)
- Decision speed and consistency under pressure
- Learning from mistakes (crash analysis and improvement)
Ollama | Local LLMs (Llama3, Granite, CodeLlama, Mistral, etc.) | Primary testing platform |
OpenAI | GPT-3.5/4 via API | Commercial LLM comparison |
Rule-Based | Deterministic logic | Baseline performance benchmark |
- 11 aircraft states with detailed lifecycle management
- Dynamic fuel consumption with realistic rates for each operation phase
- Fuel refueling system with time-dependent operations at gates
- Multi-layered collision avoidance (500px warning → 200px smart avoidance → 100px emergency → 10px crash)
- Holding patterns for both airborne (waiting to land) and ground (waiting for takeoff) aircraft
- Emergency procedures including go-around maneuvers and fuel emergency priority
- Predictive collision detection with automatic positioning to prevent cascade failures
- Dynamic spawn rate adjustment based on airport capacity and traffic density
- Smart aircraft distribution using sector-based spawning to prevent clustering
- Traffic flow optimization with real-time congestion monitoring
- Emergency runway clearing for critical fuel situations
- Detailed AI decision logging with reasoning, timing, and context
- Crash analysis reports with root cause identification
- Performance metrics including throughput, safety records, and efficiency
- Fuel management tracking with emergency detection and throttled warnings
- Collision event analysis with timeline reconstruction
- Realistic fuel consumption (takeoff: 30%, cruise: varies, landing: 10%, holding: 0.20%)
- Intelligent refueling (50-100% fuel targets with parallel boarding operations)
- Emergency fuel handling (critical <15%, low <25% with priority systems)
- Collision avoidance behavior with predictive positioning and safe separation
- YAML-based configuration (config.yaml) for human-readable settings
- Real-time AI switching between different models during simulation
- Configurable collision distances and emergency thresholds
- Dynamic spawn rates and traffic density management
- Comprehensive UI controls with manual override capabilities
The simulation generates various scenarios to test LLM decision making:
- 🔥 Fuel Emergencies: Critical fuel aircraft requiring immediate landing
- 💥 Collision Scenarios: Multiple aircraft on collision courses
- 🚁 Traffic Congestion: High-density traffic requiring optimization
- ⛽ Resource Management: Runway and gate allocation under pressure
- 🚨 Cascade Failures: Multiple simultaneous emergencies
- 🚦 Start/Pause: Control simulation state
- 🤖 AI Switch: Toggle between Rule-Based and Ollama AI
- 👨✈️ Manual Mode: Take direct control of aircraft
- 📊 Status Panels: Real-time aircraft, runway, and system status
- Click aircraft to select
- L: Assign landing runway
- G: Assign gate
- T: Assign takeoff
- H: Enter holding pattern
- Aircraft Status: Position, fuel, assignments, state
- Runway Queues: Aircraft lineup for each runway
- System Metrics: Active aircraft, crashes, efficiency
- Selected Aircraft: Detailed status of chosen aircraft
- LLM safety evaluation in critical decision scenarios
- Multi-agent coordination studies
- Real-time AI performance under pressure
- Human-AI interaction in safety-critical systems
- AI model validation for aviation systems
- Training data generation for air traffic control AI
- Comparative analysis of commercial vs open-source LLMs
- Safety protocol testing and validation
- Early Warning (500px): AI receives collision alerts
- Smart Avoidance (200px): Automatic intelligent positioning
- Emergency Separation (100px): Immediate automatic separation
- Crash Detection (10px): Final collision detection
- Real-time monitoring with emergency detection
- Priority systems for critical fuel aircraft
- Realistic consumption rates for all operation phases
- Refueling simulation with time-dependent operations
- Go-around maneuvers for safety
- Emergency runway clearing for critical aircraft
- Holding pattern management with fuel safety checks
- Automatic crash prevention systems
- Extend BaseAI class
- Implement decision logic
- Add to AI manager
- Update configuration
- Add scenario generators
- Define evaluation metrics
- Create benchmark tests
- Document expected behaviors
- 🌩️ Weather systems affecting AI decisions
- 🏢 Multi-airport scenarios for complex coordination
- 📱 Web interface for remote LLM testing
- 📊 Advanced analytics dashboard
- 🎓 Training mode for LLM fine-tuning
- 🔄 Replay system for scenario analysis
Open source project for research and educational purposes. See LICENSE file for details.
🎯 Ready to test your LLM's air traffic control skills? Start the simulation and see how different AI models handle the pressure of managing airport operations!
For questions about LLM integration or performance comparisons, please open an issue or contribute to the project.