Jeff Schneider

A novel method combines reinforcement learning and predictive models for trading in the Malaysian stock market.

2025-10-18T14:06:30+00:00 ― 5 min read

A new method enhances offline RL by using latent diffusion for better data utilization.

2025-09-27T20:24:30+00:00 ― 8 min read

Exploring the Diffusion-ES technique for improved self-driving car navigation.

2025-09-09T13:57:30+00:00 ― 5 min read

This study evaluates methods to enhance large language models using user preference data.

2025-08-17T07:09:54+00:00 ― 5 min read

Examining the importance of data valuation for language models and its implications.

2025-08-09T02:43:06+00:00 ― 7 min read

Soft-QMIX combines QMIX and maximum entropy for improved agent cooperation.

2025-07-26T21:16:12+00:00 ― 6 min read

A new method improves how agents learn from one another's actions in teamwork settings.

2025-06-30T06:57:00+00:00 ― 9 min read