Rishabh Agarwal

Research on how Transformers improve generalization for longer sequences in addition tasks.

2025-09-08T05:49:54+00:00 ― 7 min read

This article discusses using classification for value functions in deep reinforcement learning.

2025-08-23T12:51:08+00:00 ― 5 min read

This paper reviews the benefits of many-shot learning in language models.

2025-08-19T08:00:48+00:00 ― 5 min read

SiT enhances agents' ability to generalize in reinforcement learning through symmetry and attention.

2025-07-25T16:49:48+00:00 ― 6 min read

Research shows how MBR decoding enhances translation quality in smaller models.

2025-07-13T00:09:12+00:00 ― 5 min read

Gemma 2 offers high performance in a compact size for language tasks.

2025-07-04T12:59:30+00:00 ― 6 min read

Study reveals cheaper models may produce better training data for reasoning tasks.

2025-06-20T08:30:06+00:00 ― 5 min read

This method helps AIs learn through creating and solving challenges.

2025-05-26T00:12:48+00:00 ― 7 min read