Jee-weon Jung

VoxtLM combines speech recognition, synthesis, text generation, and continuation in one model.

2025-09-13T11:02:45+00:00 ― 4 min read

Exploring advancements in automated audio captioning and its impact on accessibility.

2025-09-02T01:21:35+00:00 ― 5 min read

An overview of advancements in speaker recognition through the VoxCeleb Challenge.

2025-06-23T13:02:25+00:00 ― 4 min read

A study shows i-vectors can compete with complex models in speaker recognition.

2025-06-10T06:49:10+00:00 ― 5 min read

ESPnet-Codec enhances training and evaluation of neural codecs for audio and speech.

2025-06-03T03:09:30+00:00 ― 7 min read