Gangshan Wu

A novel method improves the ability of Vision-Language Models to adapt to new tasks.

2025-10-06T20:44:12+00:00 ― 5 min read

JointFormer enhances VOS by integrating feature extraction, matching, and memory management.

2025-10-04T14:21:30+00:00 ― 5 min read

SportsHHI focuses on human interactions in basketball and volleyball videos for improved analysis.

2025-08-21T20:58:30+00:00 ― 5 min read

A new framework enhances the adaptability of vision-language models through smart data processing.

2025-07-18T17:05:12+00:00 ― 6 min read

A new method improves voice separation in noisy settings with multiple speakers.

2025-07-09T16:53:50+00:00 ― 5 min read

Self-TPT simplifies prompt tuning for vision-language models, improving speed and efficiency.

2025-06-29T10:40:24+00:00 ― 7 min read