Assessing LLMs in GameAssessing LLMs in GameEnvironmentsand teamwork.New benchmark for evaluating LLM skillsComputation and LanguageEvaluating Large Language Models in Multi-Agent EnvironmentsNew benchmark assesses LLMs' skills in interacting with multiple agents.2025-09-04T00:58:30+00:00 ― 12 min read