New method improves language models' ability to avoid unwanted topics.
― 6 min read
Cutting edge science explained simply
New method improves language models' ability to avoid unwanted topics.
― 6 min read
A new method to assess diverse user values in language models.
― 7 min read