Hongning Wang

Examining the accuracy of health snippets in search engine results.

2025-09-13T21:50:36+00:00 ― 5 min read

This article presents a method for clients with diverse objectives in federated bandit learning.

2025-09-03T00:29:06+00:00 ― 6 min read

A novel approach to reward over-optimization in language models using uncertainty estimation.

2025-08-31T04:16:54+00:00 ― 6 min read

ChatGLM-RLHF improves AI interactions through human feedback and advanced training methods.

2025-08-23T14:27:00+00:00 ― 5 min read

GLM-4 models show improved capabilities in language understanding and generation.

2025-07-27T06:52:54+00:00 ― 8 min read

A new method to assess how well LLMs understand and apply rules.

2025-06-20T19:41:36+00:00 ― 5 min read

Learn how human feedback shapes AI language model responses.

2025-04-02T03:58:57+00:00 ― 8 min read

A fresh approach to enhance instruction-following in language models.

2025-02-28T18:21:36+00:00 ― 6 min read