Ming Yan

A new approach improves efficiency in Vision-Language Pre-training tasks.

2025-10-11T17:07:48+00:00 ― 6 min read

A new method enhances stance detection for smaller language models using external knowledge.

2025-10-02T00:28:30+00:00 ― 5 min read

A new model improves the recovery of sparse signals in noisy environments.

2025-09-27T17:32:57+00:00 ― 7 min read

TRIPS enhances efficiency in vision-language tasks by selecting relevant image patches.

2025-09-17T20:38:36+00:00 ― 7 min read

A new approach using multi-agent systems to enhance smaller language models.

2025-09-17T04:26:54+00:00 ― 6 min read

This article discusses a new framework for assessing hallucinations in LVLMs.

2025-09-04T12:02:06+00:00 ― 6 min read

A new benchmark evaluates how role-playing agents interact socially.

2025-08-27T12:43:24+00:00 ― 6 min read

A new framework improves how language agents learn and perform tasks.

2025-08-27T05:28:54+00:00 ― 6 min read

A new framework improves efficiency and accuracy in solving complex physical problems.

2025-08-01T22:06:12+00:00 ― 6 min read

MIBench tests multimodal models' performance on multiple images.

2025-07-09T14:23:18+00:00 ― 6 min read

mPLUG-Owl3 improves understanding of images and videos for better responses.

2025-06-30T17:13:12+00:00 ― 6 min read

A new method to combine language models more effectively.

2025-06-29T22:23:30+00:00 ― 6 min read

MaVEn enhances AI's ability to process multiple images for better reasoning.

2025-06-23T15:38:00+00:00 ― 5 min read