Sungyoon Kim

This article discusses methods for training two-layer ReLU neural networks efficiently.

2025-08-24T17:20:13+00:00 ― 6 min read

Introducing MoEfier for efficient transformation of language models with minimal training.

2025-06-30T06:41:12+00:00 ― 5 min read

Explore the loss landscape and the role of regularization in neural networks.

2025-05-24T22:21:27+00:00 ― 4 min read