Zhijiang Guo

Generative search engines face challenges from adversarial questions, impacting accuracy.

2025-09-04T11:22:36+00:00 ― 5 min read

A new approach enhances the performance of code generated by large language models.

2025-08-08T16:11:06+00:00 ― 7 min read

A new method enhances reasoning in language models by automating step labeling.

2025-08-07T00:33:12+00:00 ― 6 min read

New methods and benchmarks aim to simplify formalizing mathematics through Lean 4.

2025-08-03T08:59:42+00:00 ― 6 min read

A new benchmark evaluates reasoning skills in language models.

2025-07-26T22:11:30+00:00 ― 7 min read