A novel approach to reward over-optimization in language models using uncertainty estimation.
― 6 min read
Cutting edge science explained simply
A novel approach to reward over-optimization in language models using uncertainty estimation.
― 6 min read
A method to approximate fairness-accuracy trade-offs for machine learning models.
― 10 min read
A look at how LLMs process language through reasoning techniques.
― 5 min read