Zequn Jie

LLaVA-MoLE enhances multimodal models by using expert routing for better performance.

2025-09-13T14:51:54+00:00 ― 6 min read

Lumen enhances visual task learning through a two-stage process for better AI understanding.

2025-08-29T23:34:42+00:00 ― 7 min read

MindBench improves model evaluation for understanding complex mind maps.

2025-07-20T01:44:24+00:00 ― 5 min read

OV-DINO improves object detection by recognizing names not seen in training.

2025-07-15T23:15:12+00:00 ― 6 min read

A new approach improves 3D segmentation using less detailed annotations and language.

2025-07-14T01:26:00+00:00 ― 5 min read

New framework enhances understanding of images, text, and 3D objects.

2025-01-24T04:30:00+00:00 ― 7 min read