Csaba Szepesvári

This paper examines methods to enhance value estimation in reinforcement learning despite challenges.

2025-10-08T16:09:36+00:00 ― 6 min read

A new method enhances FQI by using log-loss for improved learning efficiency.

2025-08-31T05:35:54+00:00 ― 6 min read

Addressing hallucinations to enhance the reliability of language models.

2025-08-22T17:15:06+00:00 ― 6 min read

A look at uncertainty types and their importance in language models.

2025-08-02T14:17:54+00:00 ― 5 min read

CMDPs merge reward maximization with safety in AI applications.

2025-07-24T01:04:00+00:00 ― 5 min read