Lichang Chen

Exploring the challenges and solutions of reward hacking in AI model training.

2025-09-09T06:58:48+00:00 ― 7 min read

A fresh approach to training reward models enhances AI alignment with human preferences.

2025-06-09T16:00:54+00:00 ― 6 min read