Heng Ji

This work assesses how well VLMs reason based on visual content.

2025-09-29T06:14:48+00:00 ― 6 min read

Examining the trade-off between fine-tuning and preserving general abilities in AI models.

2025-09-28T00:29:24+00:00 ― 5 min read

A framework improves LLM performance by integrating tailored toolsets for various tasks.

2025-09-20T09:28:24+00:00 ― 5 min read

New approach enhances LLMs by integrating executable Python code for better action handling.

2025-09-12T09:22:18+00:00 ― 4 min read

Examining limitations of large vision-language models in detailed image understanding.

2025-09-03T23:07:54+00:00 ― 6 min read

A look at how machines analyze and interpret visual data.

2025-08-28T08:12:36+00:00 ― 7 min read

This article discusses a flexible ranking method using multi-vector embeddings for better search results.

2025-08-08T13:25:12+00:00 ― 6 min read

Enhancing user engagement in large vision-language models through proactive communication.

2025-07-26T03:29:42+00:00 ― 6 min read

This article discusses a new model combining visual and language processing.

2025-07-17T16:59:30+00:00 ― 5 min read

A new method streamlines chatbot conversations, keeping them focused and relevant.

2025-07-03T10:55:18+00:00 ― 6 min read

Geo2Seq transforms 3D molecular structures into manageable sequences for efficient generation.

2025-06-25T20:49:36+00:00 ― 11 min read

ARMADA improves image-text pairing through attribute-focused data creation.

2025-06-25T19:54:18+00:00 ― 9 min read

A framework using advanced models to improve research literature analysis.

2025-06-25T12:16:06+00:00 ― 5 min read

A system that learns and adapts through continuous interaction with its environment.

2025-06-08T14:59:54+00:00 ― 7 min read

CoRNStack streamlines code retrieval, making development more efficient and less chaotic.

2025-04-28T15:08:00+00:00 ― 6 min read

Discover how software engineering agents are transforming coding efficiency.

2025-01-19T00:54:54+00:00 ― 5 min read