Jiangjie Chen

A new dataset tests AI’s ability to reason in real-life situations.

2025-10-21T15:11:00+00:00 ― 5 min read

This paper presents a method to enhance language models' interaction with tools.

2025-09-17T19:03:48+00:00 ― 6 min read

New watermarking methods improve text variety and detection in machine-generated content.

2025-09-05T23:19:18+00:00 ― 7 min read

Introducing a framework to enhance decision-making in language agents during complex tasks.

2025-07-31T19:06:42+00:00 ― 5 min read

DetectBench evaluates LLMs on their ability to detect hidden evidence in reasoning tasks.

2025-07-27T05:02:18+00:00 ― 5 min read

This study examines how AI can help find historical analogies for current events.

2025-06-07T07:31:48+00:00 ― 5 min read