Investigating how filler tokens impact performance in language models.
― 6 min read
Cutting edge science explained simply
Investigating how filler tokens impact performance in language models.
― 6 min read
A look at controlling language model behavior with the KL-then-steer technique.
― 5 min read