Yingcong Li

A look into how transformers use attention layers for better language processing.

2025-09-18T09:23:48+00:00 ― 4 min read

A closer look at self-attention mechanisms in language processing models.

2025-08-13T15:40:29+00:00 ― 7 min read

Study reveals insights into in-context learning performance across various model architectures.

2025-07-06T05:43:19+00:00 ― 5 min read