How low-bit quantization affects large language models during training.
Xu Ouyang, Tao Ge, Thomas Hartvigsen
― 6 min read
Cutting edge science explained simply
How low-bit quantization affects large language models during training.
Xu Ouyang, Tao Ge, Thomas Hartvigsen
― 6 min read
New methods improve large language models' handling of context for better performance.
Zhisong Zhang, Yan Wang, Xinting Huang
― 6 min read