Sriram Ganapathy

Integrating metadata enhances performance in speech tasks like language identification.

2025-10-06T12:05:10+00:00 ― 6 min read

Combining foundational and specialized models boosts AI capabilities efficiently.

2025-09-18T23:14:24+00:00 ― 5 min read

New methods combine audio and metadata for better language recognition.

2025-09-08T07:09:30+00:00 ― 5 min read

Learn how dereverberation boosts speech recognition in noisy environments.

2025-09-05T12:45:40+00:00 ― 4 min read

E-SHARC improves speaker identification in various audio environments.

2025-08-28T06:22:45+00:00 ― 6 min read

This article presents a dual encoder system for effective speech representation learning.

2025-07-24T01:50:20+00:00 ― 6 min read

New method improves ASR systems' handling of various accents through specialized codebooks.

2025-07-22T04:29:40+00:00 ― 5 min read

A new benchmark aids in assessing speech tokenizers for better performance.

2025-06-20T00:01:10+00:00 ― 6 min read

A novel method combines meaning and sound for improved emotion detection in speech.

2025-06-16T16:40:00+00:00 ― 6 min read

New methods improve understanding of AI model predictions.

2025-06-08T13:31:25+00:00 ― 6 min read

Examining how our brains process sound and speech in different situations.

2025-05-10T20:35:30+00:00 ― 5 min read