Peiyi Wang

PCA-Bench tests large language models in complex decision-making scenarios.

2025-09-05T18:58:36+00:00 ― 6 min read

A new dataset aims to enhance AI's grasp of scientific images and reasoning.

2025-09-02T17:06:42+00:00 ― 5 min read

Exploring how preference learning improves language model alignment with human expectations.

2025-06-17T05:58:42+00:00 ― 8 min read