Akshay Krishnamurthy

Introducing an efficient algorithm for reinforcement learning with deterministic dynamics.

2025-07-29T06:32:10+00:00 ― 6 min read

Discover how language models improve their outputs through self-evaluation techniques.

2025-04-02T07:29:43+00:00 ― 7 min read