New methods enhance how language models forget unwanted knowledge.
Hongbang Yuan, Zhuoran Jin, Pengfei Cao
― 6 min read
Cutting edge science explained simply
New methods enhance how language models forget unwanted knowledge.
Hongbang Yuan, Zhuoran Jin, Pengfei Cao
― 6 min read
A new tool improves AI responses to better match human preferences.
Zhuoran Jin, Hongbang Yuan, Tianyi Men
― 4 min read