A simulation benchmark where humans and AI models make real marketing decisions, and their causal reasoning is put to the test.
New: 29 LLMs walk into a marketing sim, here's what they did →Top scores across humans and AI models
Want a deeper look at how models behave on this task? Read the analysis →