Boqing Gong

New method improves object localization using relationships between language and images.

2025-09-22T15:03:42+00:00 ― 6 min read

New methods improve machines' ability to create images from textual prompts.

2025-09-19T03:27:12+00:00 ― 5 min read

New method improves video captioning using image-language models.

2025-09-17T18:48:00+00:00 ― 6 min read

VideoPrism helps interpret and analyze video content effectively.

2025-09-05T19:53:54+00:00 ― 5 min read

Research reveals how trigger patches influence image generation in diffusion models.

2025-08-02T07:35:00+00:00 ― 6 min read

A new approach to improve text-to-image model prompts for enhanced results.

2025-07-09T19:45:24+00:00 ― 5 min read

SOAR improves action recognition accuracy in drone footage analysis.

2025-06-05T08:39:24+00:00 ― 5 min read

Introducing Long Video Masked Autoencoders for better video understanding.

2025-05-16T19:28:00+00:00 ― 6 min read

HypDAE transforms how we create images from minimal examples.

2025-05-07T03:40:00+00:00 ― 6 min read

DAVE dataset captures complex road scenarios for better AI training.

2025-01-20T21:51:18+00:00 ― 7 min read