Utkarsh Saxena

Combining SmoothQuant and GPTQ improves efficiency and performance of large language models.

2025-08-11T22:23:42+00:00 ― 6 min read

Eigen Attention improves memory efficiency for large language models processing long texts.

2025-06-29T16:43:48+00:00 ― 6 min read

ResQ optimizes large language models, enhancing performance and reducing costs.

2025-02-20T08:07:48+00:00 ― 6 min read