Loading
Loading
This Benchmark tests AI’s ability to handle sequential decision-making in business contexts, focusing on game theory and strategic reasoning failures. The benchmark tests understanding of sequential-move (Stackelberg-type) games in microeconomics and the correct application of backward induction to find subgame perfect equilibria. Typical AI failure patterns include: 1.Incorrect order of reasoning (e.g., treating the follower as the leader), 2.Ignoring subgame perfect equilibrium logic, 3.Introducing irrelevant “reputation” or “collusion” effects not in the prompt, 4.Jumping directly to intuitive but non-equilibrium answers.
99
Total Prompts
1407
Scored Responses
10
Contributors
25%
Average Overall Score
| Rank | Model | Avg. Score | Prompts Tested | Avg. Response Time |
|---|---|---|---|---|
No leaderboard data available for this Benchmark yet.
Model evaluations will appear here once tests are run.