About CEO Bench
CEO Bench is an open research benchmark for evaluating large language models on executive leadership tasks. It generates realistic management questions, collects model answers and scores them automatically to build the leaderboard below.
For months, CEOs have been asking "Can I replace all my workers with AI?"Thanks to CEO Bench we can now turn the question around: AI can replace the CEO.
The next challenge is figuring out just how small a model can still run the company as frontier LLMs saturate the benchmark.
The Python scripts powering this site are included in the repository so you can run your own evaluations or extend the question set. All data and code are released under the MIT License and contributions are welcome.
.png)
