Latest Articles for Reward Model

Machine Learning Advancing Offline Reinforcement Learning with a Reward Model

A new method improves decision-making in agents with limited data.

2025-07-13T07:55:18+00:00 ― 5 min read

Computation and Language Advancing Language Models with New Training Methods

A novel approach to improve language models using human feedback.

2025-07-11T07:12:18+00:00 ― 9 min read

Computation and Language Improving Text Generation with CARDS Method

A new method enhances efficiency and quality in language model text generation.

2025-07-10T13:16:48+00:00 ― 6 min read

Computation and Language Advancing Language Model Alignment Techniques

A comprehensive look at methods improving language model responses.

2025-07-08T00:20:12+00:00 ― 6 min read

Artificial Intelligence Balancing Safety and Helpfulness in Language Models

A new approach streamlines safety and helpfulness in language model training.

2025-06-21T13:59:42+00:00 ― 9 min read

Computation and Language Language Models: Truthfulness vs. Political Bias

Examining the link between truthfulness and political bias in language models.

2025-06-15T13:57:06+00:00 ― 6 min read

Machine Learning Improving Code Generation with PF-PPO

PF-PPO enhances language models by filtering out unreliable rewards for better code responses.

2025-06-14T14:15:06+00:00 ― 5 min read

Artificial Intelligence Evaluating Preference Datasets for Reward Models

This article examines key factors in preference dataset quality for better reward model training.

2025-06-12T06:17:36+00:00 ― 6 min read

Machine Learning Improving Large Language Models with Reward-Robust Framework

A new approach enhances reliability in training language models through robust feedback systems.

2025-06-11T00:24:18+00:00 ― 5 min read

Computation and Language Improving AI Alignment with New Reward Models

A fresh approach to training reward models enhances AI alignment with human preferences.

2025-06-09T16:00:54+00:00 ― 6 min read

Computation and Language Enhancing Models with Preference Tuning

Learn how preference tuning aligns models with human feedback.

2025-06-09T04:54:30+00:00 ― 4 min read

Robotics A Smarter Way for Robots to Learn

Robots can now learn tasks better through automated reward labeling.

2025-05-28T22:55:39+00:00 ― 7 min read

Machine Learning The Rise of Reward Models in AI

Discover how reward models are changing the way machines learn and perform.

2025-04-26T01:28:30+00:00 ― 7 min read

Machine Learning Bridging the Gap: AI Meets Physics Problem Solving

New method improves AI's ability to solve complex physics problems with human feedback.

2025-04-08T22:05:15+00:00 ― 4 min read

Computation and Language The Impact of Human Feedback on Language Models

Learn how human feedback shapes AI language model responses.

2025-04-02T03:58:57+00:00 ― 8 min read

Computer Vision and Pattern Recognition Making Sense of Long Videos with VCA

Video Curious Agent simplifies finding key moments in lengthy videos.

2025-03-18T19:42:36+00:00 ― 6 min read

Artificial Intelligence Reinforcement Learning Redefined with DTR

A look into how DTR tackles reward bias in learning.

2025-03-14T21:49:03+00:00 ― 7 min read

Machine Learning Raising the Bar in AI Math Skills

Researchers enhance language models for complex mathematical reasoning.

2025-03-09T06:59:51+00:00 ― 7 min read

Computation and Language UAlign: Making AI More Reliable

A new framework helps language models express uncertainty and improve their honesty.

2025-03-02T00:51:18+00:00 ― 8 min read

Computation and Language RAG-RewardBench: Aligning AI with Human Needs

A new tool improves AI responses to better match human preferences.

2025-02-17T07:06:09+00:00 ― 4 min read