Bill Byrne

A new method enhances training speed and reduces memory use for language models.

2025-08-29T03:41:48+00:00 ― 6 min read

We enhance Direct Preference Optimization to better handle ties in decision-making.

2025-06-06T05:51:18+00:00 ― 6 min read