Rongrong Ji

A new method enhances the efficiency of VLP models for real-world tasks.

2025-09-30T18:27:18+00:00 ― 5 min read

FocSAM enhances interactive segmentation with improved stability and accuracy.

2025-08-06T01:22:48+00:00 ― 4 min read

A new method to enhance language models' performance with long texts.

2025-07-23T20:51:12+00:00 ― 5 min read

HRSAM improves image segmentation efficiency and accuracy for high-resolution inputs.

2025-07-20T15:41:48+00:00 ― 5 min read

New method RoE enhances multi-modal large language models' efficiency with dynamic routing.

2025-07-10T02:38:00+00:00 ― 7 min read

This method simplifies adding objects to images with text prompts, ensuring natural results.

2025-07-08T14:33:24+00:00 ― 6 min read

This approach enhances multimodal models without extensive retraining.

2025-07-04T06:24:30+00:00 ― 6 min read

A new method enhances efficiency and performance of multimodal large language models.

2025-06-30T21:33:54+00:00 ― 5 min read

Learn essential steps to format your paper for submissions.

2025-06-25T13:11:24+00:00 ― 5 min read

PartFormer enhances object recognition across varying conditions using Vision Transformers.

2025-06-20T07:26:54+00:00 ― 6 min read

New method enhances image matching from various camera spectra.

2025-06-01T07:52:54+00:00 ― 5 min read

Video-RAG simplifies how computers analyze long video content with extra information.

2025-05-15T21:30:40+00:00 ― 5 min read

A new approach makes multimodal models faster and more efficient.

2025-04-30T19:40:00+00:00 ― 5 min read