Jianfei Chen

A new approach to reduce memory use in neural networks through 4-bit optimizers.

2025-09-30T18:51:00+00:00 ― 6 min read

VCAS improves neural network training efficiency without losing accuracy.

2025-09-03T13:23:18+00:00 ― 6 min read

A new method enhances sparse language model training while minimizing performance loss.

2025-07-04T17:36:00+00:00 ― 7 min read

Introducing S-STE, a novel approach to improve sparse neural network training efficiency.

2025-06-12T14:59:00+00:00 ― 5 min read

A new method speeds up AI processing without losing accuracy.

2025-05-21T20:37:30+00:00 ― 5 min read

ReMoE brings flexibility and efficiency to language models with dynamic expert selection.

2025-02-11T16:16:57+00:00 ― 7 min read