Zhuosheng Zhang

Examining vulnerabilities and safety strategies for LLM-powered scientific agents.

2025-09-10T13:23:42+00:00 ― 6 min read

A new approach enhances reasoning accuracy in language models using selective filtering.

2025-08-24T13:21:36+00:00 ― 6 min read

A new method enhances out-of-distribution detection for AI in math tasks.

2025-08-09T04:33:42+00:00 ― 5 min read

DocBench benchmarks LLM-based systems for reading and responding to various document formats.

2025-07-13T04:45:42+00:00 ― 4 min read