Dingjie Song

AceGPT enhances Arabic language processing tailored for local culture and values.

2025-09-23T18:42:42+00:00 ― 5 min read

New benchmarks reveal challenges for MLLMs in real-world tasks with long contexts.

2025-08-15T10:16:00+00:00 ― 7 min read

LongLLaVA improves multi-image understanding for various applications.

2025-06-17T07:57:12+00:00 ― 5 min read

TRIM method reduces image tokens in multi-modal language models while maintaining performance.

2025-06-10T11:06:24+00:00 ― 5 min read

A new framework identifies when multimodal models use inappropriate training data.

2025-05-29T07:11:33+00:00 ― 5 min read