Wenting Zhao

This research focuses on improving question reformulation for better user interactions.

2025-07-07T18:16:48+00:00 ― 8 min read

A new benchmark evaluates LLMs for factual accuracy.

2025-07-07T18:08:54+00:00 ― 6 min read

Explore the need for an open feedback system to improve AI responses.

2025-06-27T17:51:24+00:00 ― 5 min read

Language models excel at memory tasks but struggle with reasoning challenges.

2025-06-24T14:08:54+00:00 ― 5 min read

A tool to analyze chat logs quickly and effectively for researchers.

2025-06-16T17:36:06+00:00 ― 5 min read

Research focuses on improving language models' ability to understand longer texts.

2025-06-10T03:36:06+00:00 ― 8 min read

Large Language Models enhance code summarization assessments with creative evaluations.

2025-04-23T14:57:45+00:00 ― 6 min read

Examining issues in community-driven chatbot evaluations and ways to improve them.

2025-04-11T18:18:00+00:00 ― 5 min read