Latest Articles for Reasoning

Machine Learning Advancements in Many-Shot Learning for Language Models

This paper reviews the benefits of many-shot learning in language models.

2025-08-19T08:00:48+00:00 ― 5 min read

Computation and Language Self-Explore: A New Method for Language Models

Introducing a self-guided approach for better reasoning in language models.

2025-08-19T01:49:30+00:00 ― 8 min read

Artificial Intelligence Improving Question-Answering Systems in Finance

Enhancing QA systems through fine-tuning and reasoning for better finance insights.

2025-08-18T22:55:42+00:00 ― 6 min read

Computation and Language Evaluating Paraphrastic Consistency in Language Models

This study examines how language models handle different expressions of the same reasoning problems.

2025-08-18T21:28:48+00:00 ― 4 min read

Artificial Intelligence The Rise of AI Agents in Modern Problem Solving

AI agents are shaping how we tackle tasks and challenges efficiently.

2025-08-18T20:25:36+00:00 ― 6 min read

Computer Vision and Pattern Recognition Advancements in Text-Centric Visual Question Answering

New dataset Square-10M significantly boosts open-source visual question answering capabilities.

2025-08-18T02:31:12+00:00 ― 6 min read

Computation and Language Advancing Reasoning in Language Models

Research improves reasoning clarity in language models for better accuracy.

2025-08-15T15:16:12+00:00 ― 5 min read

Computation and Language Evaluating Language Models: New Benchmark Insights

A new benchmark assesses language models' understanding of linguistic competence.

2025-08-15T14:20:54+00:00 ― 7 min read

Computation and Language Evaluating the True Abilities of Language Models in Math

Research uncovers concerns about large language models' math reasoning skills.

2025-08-14T17:56:24+00:00 ― 6 min read

Computer Vision and Pattern Recognition Advancements in Multi-Image Model Training

New dataset improves model performance on multi-image tasks.

2025-08-14T11:45:06+00:00 ― 5 min read

Machine Learning Assessing Large Language Models: Size and Precision Matters

This study evaluates how model size and quantization impact language model performance.

2025-08-13T18:22:18+00:00 ― 7 min read

Computer Vision and Pattern Recognition Evaluating the Future of Video-Large Multi-modal Models

Assessing the capabilities and challenges of advanced video understanding models.

2025-08-13T12:42:36+00:00 ― 5 min read

Artificial Intelligence Improving User Interface Agents with Latent State Estimation

Learn how enhancing UI agents can create better user experiences.

2025-08-10T10:34:54+00:00 ― 7 min read

Artificial Intelligence Examining Error Recovery in Large Language Models

This study analyzes how language models recover from reasoning errors during tasks.

2025-08-08T14:12:36+00:00 ― 8 min read

Software Engineering Advancements in Automated Vulnerability Repair

Exploring the role of AI in fixing software vulnerabilities.

2025-08-07T18:03:54+00:00 ― 6 min read

Machine Learning Enhancing Reasoning in Language Models with MindStar

MindStar framework improves reasoning skills in language models efficiently.

2025-08-07T01:12:42+00:00 ― 6 min read

Computation and Language Navigating Knowledge Privacy in Language Models

A new method tackles ethical concerns in language models.

2025-08-06T23:06:18+00:00 ― 5 min read

Computation and Language Introducing MMLU-Pro: A Tougher Benchmark for Language Models

MMLU-Pro challenges language models with harder questions and more answer options.

2025-08-03T04:54:48+00:00 ― 7 min read

Computation and Language Advancing Reasoning in Language Models

New methods aim to enhance reasoning capabilities in language models.

2025-08-02T09:25:36+00:00 ― 6 min read

Machine Learning LLMs Struggle with Basic Reasoning Tasks

Recent tests reveal LLMs' weaknesses in simple reasoning despite high benchmark scores.

2025-08-02T09:01:54+00:00 ― 5 min read

Computation and Language Assessing Out-of-Context Knowledge Reasoning in LLMs

Study evaluates how well LLMs reason beyond immediate context.

2025-07-30T11:54:24+00:00 ― 5 min read

Computer Vision and Pattern Recognition Evaluating Video Comprehension in Multimodal Language Models

A new benchmark aims to assess MLLMs in video understanding across multiple topics.

2025-07-29T22:20:42+00:00 ― 6 min read

Computer Vision and Pattern Recognition New Benchmark for Long Video Understanding

A benchmark created to improve comprehension of long video content.

2025-07-29T18:15:48+00:00 ― 7 min read

Computer Vision and Pattern Recognition Integrating Visual Sketching into Language Models

A new framework enhances reasoning in language models through visual sketches.

2025-07-29T11:40:48+00:00 ― 3 min read

Computation and Language Evaluating Reasoning Skills in Large Language Models

A study highlights gaps in reasoning abilities of LLMs for math problem solving.

2025-07-28T03:56:54+00:00 ― 6 min read

Computer Vision and Pattern Recognition Introducing VideoVista: A New Benchmark for Video QA

VideoVista offers a comprehensive evaluation for video question-answering models.

2025-07-27T13:35:48+00:00 ― 5 min read

Computation and Language DetectBench: A New Standard for Evidence Detection in Language Models

DetectBench evaluates LLMs on their ability to detect hidden evidence in reasoning tasks.

2025-07-27T05:02:18+00:00 ― 5 min read

Artificial Intelligence Neuron Activation and Arithmetic Reasoning in LLMs

Examining how neuron activation enhances arithmetic reasoning in large language models.

2025-07-27T00:17:54+00:00 ― 9 min read

Computation and Language Assessing Reasoning in Language Models

A new benchmark evaluates reasoning skills in language models.

2025-07-26T22:11:30+00:00 ― 7 min read

Artificial Intelligence AIPS: A Step Forward in Automated Math Problem Solving

AIPS showcases potential in solving complex algebraic inequalities independently.

2025-07-26T04:56:36+00:00 ― 6 min read

Information Retrieval Improving Text Generation with RAG Systems

This article discusses how RAG systems enhance text generation using external information.

2025-07-26T00:12:12+00:00 ― 7 min read

Computation and Language Evaluating Graph Reasoning in Language Models

A study examines how well LLMs reason with graph data.

2025-07-25T13:56:00+00:00 ― 5 min read

Machine Learning Improving Reasoning in Language Models with Preference Optimization

New methods refine reasoning skills in language models for better task performance.

2025-07-25T06:33:36+00:00 ― 7 min read

Machine Learning Improving Reasoning in Black-Box LLMs

A new method enhances accuracy in question-answering for black-box language models.

2025-07-24T02:15:06+00:00 ― 5 min read

Computation and Language Advancements in Synthetic Data for AI Training

A new synthetic dataset enhances training for multimodal AI models.

2025-07-23T15:35:12+00:00 ― 5 min read

Computation and Language Advancing Machine Reasoning with Visual Data

Improving how machines answer visual questions through structured reasoning.

2025-07-22T20:21:48+00:00 ― 6 min read

Computation and Language Evaluating Belief Revision in Language Models

A new method measures how language models adapt their beliefs with new evidence.

2025-07-22T18:07:30+00:00 ― 9 min read

Robotics Assessing Language Models for Robotics Tasks

A new benchmark evaluates language models' effectiveness in robotic applications.

2025-07-22T16:56:24+00:00 ― 6 min read

Computation and Language Improving Language Models with Step-Controlled DPO

A new approach enhances reasoning in language models by generating controlled errors.

2025-07-22T05:13:18+00:00 ― 6 min read

Computer Vision and Pattern Recognition Enhancing 3D Visual Grounding with ReGround3D

ReGround3D improves understanding of human instructions in 3D environments.

2025-07-21T19:05:00+00:00 ― 4 min read