Zilong Zheng

MindDial enhances AI conversations by considering individual beliefs and perspectives.

2025-10-26T19:04:54+00:00 ― 5 min read

MathBench assesses LLMs' math capabilities across various educational stages.

2025-08-09T21:32:48+00:00 ― 5 min read

DiveR-CT improves automated red teaming for better safety assessments.

2025-08-05T02:44:00+00:00 ― 7 min read

A novel approach enhances Transformer models for better long text processing.

2025-07-24T22:15:54+00:00 ― 6 min read

New benchmark assesses how video-language models handle inaccuracies effectively.

2025-07-24T17:47:18+00:00 ― 6 min read

A new method helps robots navigate and orient correctly for tasks.

2025-07-14T07:05:42+00:00 ― 7 min read

This method enhances visual reasoning by implementing verification at each reasoning step.

2025-07-02T15:49:48+00:00 ― 7 min read

A framework using memory tokens improves video understanding and interaction.

2025-06-18T08:10:48+00:00 ― 7 min read