Rao Muhammad Anwer

Examining foundational models that combine vision and language for diverse applications.

2025-10-16T00:53:00+00:00 ― 5 min read

A new method allows models to recognize both known and unknown objects.

2025-09-22T03:12:42+00:00 ― 7 min read

New methods reduce human labeling while improving object detection accuracy.

2025-09-04T08:36:42+00:00 ― 7 min read

New framework improves video searches by combining visuals and detailed language descriptions.

2025-08-26T00:23:00+00:00 ― 6 min read

Open-YOLO 3D enhances 3D instance segmentation with speed and accuracy.

2025-08-02T14:25:48+00:00 ― 7 min read

New methods expose vulnerabilities in medical models through backdoor attacks.

2025-06-27T20:37:18+00:00 ― 5 min read

New tools improve how we describe changes in satellite images over time.

2025-06-06T22:03:00+00:00 ― 5 min read