Joon Son Chung

FlexiAST allows models to adapt to various audio patch sizes efficiently.

2025-10-07T09:56:55+00:00 ― 6 min read

Improving the way we identify sound sources using audio-visual data.

2025-09-08T12:49:35+00:00 ― 6 min read

A new method improves speaker verification by managing session variability effectively.

2025-09-03T08:56:20+00:00 ― 6 min read

This article discusses an automated method for generating movie trailers efficiently.

2025-08-22T11:59:06+00:00 ― 7 min read

New methods improve video summarization using large datasets and advanced models.

2025-08-22T11:11:42+00:00 ― 7 min read

ElasticAST allows processing of variable length audio efficiently without losing important details.

2025-07-18T02:31:05+00:00 ― 5 min read

A study on improving sound source localization by better using audio and visual information.

2025-07-14T06:12:35+00:00 ― 7 min read

An overview of advancements in speaker recognition through the VoxCeleb Challenge.

2025-06-23T13:02:25+00:00 ― 4 min read