Matthias Gallé

New methods promise better AI model performance through simplified reinforcement learning.

2025-09-05T04:29:36+00:00 ― 5 min read

A new method improves reward models using synthetic critiques for better alignment.

2025-08-03T23:12:54+00:00 ― 11 min read

Examining the impact of data contamination on code generation evaluations.

2025-07-15T17:43:24+00:00 ― 6 min read

Transform discarded models into powerful new solutions through model merging.

2025-04-10T18:13:30+00:00 ― 7 min read