LLMs Tested with ClassicLLMs Tested with ClassicGamesweaknesses of language models.New benchmark reveals strengths andArtificial IntelligenceBenchmarking Language Models Through Classic GamesAssessing LLM capabilities using grid-based games like Tic-Tac-Toe and Connect Four.2025-07-15T22:27:48+00:00 ― 7 min read