Haibin Wu

AV-SUPERB evaluates audio and visual models across various tasks for better performance.

2025-09-08T22:32:35+00:00 ― 5 min read

EMO-SUPERB project enhances speech emotion recognition through improved techniques and community collaboration.

2025-08-23T00:52:20+00:00 ― 6 min read

A new system to evaluate audio codec performance across various applications.

2025-08-22T23:15:10+00:00 ― 6 min read

A new framework for assessing foundation models in speech tasks.

2025-08-11T09:31:05+00:00 ― 8 min read

Examining how codecs retain emotional tones in voice data.

2025-07-12T06:26:10+00:00 ― 5 min read

This article discusses efficient training methods for speech models using self-supervised learning.

2025-06-16T15:02:50+00:00 ― 4 min read

MCMamba model improves speech quality in noisy environments using spatial and spectral information.

2025-06-09T21:54:45+00:00 ― 4 min read

This study evaluates low-latency methods for improving speech quality in noisy conditions.

2025-06-09T20:17:35+00:00 ― 6 min read

A look at the Codec-SUPERB challenge results and codec performance metrics.

2025-06-05T06:58:50+00:00 ― 5 min read

ESPnet-Codec enhances training and evaluation of neural codecs for audio and speech.

2025-06-03T03:09:30+00:00 ― 7 min read

A new method improves efficiency in attention workloads for AI systems.

2025-06-01T21:34:30+00:00 ― 7 min read

VERSA evaluates speech, audio, and music quality effectively.

2025-01-28T09:33:18+00:00 ― 9 min read