Le Zhuo

DiffDance creates detailed dance sequences that match music effectively.

2025-09-29T16:31:20+00:00 ― 5 min read

Transform text into images, videos, and audio seamlessly with Lumina-T2X.

2025-08-12T05:14:30+00:00 ― 6 min read

A new model revolutionizes image generation from text descriptions, enhancing various industries.

2025-07-02T04:22:30+00:00 ― 5 min read

LLaVA-MoD creates smaller multimodal models using knowledge from larger counterparts.

2025-06-20T22:35:24+00:00 ― 5 min read

A new dataset enhancing video understanding and AI reasoning.

2025-05-12T04:00:00+00:00 ― 6 min read