Introducing fresh metrics to assess speaker diarization accuracy in conversational AI.
― 6 min read
Cutting edge science explained simply
Introducing fresh metrics to assess speaker diarization accuracy in conversational AI.
― 6 min read
New methods enhance accuracy and speed in speech recognition systems.
― 5 min read
A new method enhances ASR performance through text data integration.
― 6 min read
Text injection helps recognize personal information while maintaining privacy.
― 5 min read
Radio2Text uses mmWave signals for real-time speech recognition in noisy environments.
― 6 min read
This study enhances G2P models by focusing on error-prone areas during training.
― 5 min read
Discover methods that improve accuracy in formant tracking for speech analysis.
― 6 min read
New methods improve speech processing and generation in language models.
― 5 min read
New techniques improve audio clarity in noisy environments.
― 6 min read
New methods improve keyword spotting using available reading speech data.
― 4 min read
A new approach enhances confidence estimation in ASR systems for better accuracy.
― 4 min read
This study explores issues with using convnets for audio filterbank creation.
― 5 min read
This article explores advancements in speaker diarization using language models for better accuracy.
― 5 min read
New system enhances speech recognition using context-aware prompts.
― 4 min read
EnCodecMAE combines self-supervised learning and audio codecs for improved audio task performance.
― 5 min read
Introducing a flexible method for recognizing keywords in speech across languages.
― 5 min read
PIAVE helps machines extract voices clearly, even when speakers turn their heads.
― 6 min read
Introducing a flexible framework to enhance voice privacy research.
― 7 min read
A new method simplifies understanding of speech classification models.
― 6 min read
M-AUDIODEC compresses multi-channel audio while retaining speaker position and quality.
― 6 min read
Research reveals new models to enhance voice clarity in smart earbuds.
― 5 min read
A new method enhances robots' ability to follow spoken directions accurately.
― 5 min read
New methods are improving our ability to detect fake speech effectively.
― 6 min read
A new method enhances ASR models for individual users using quantisation and adaptation.
― 6 min read
New models adapt to improve speech recognition efficiency and responsiveness.
― 5 min read
Enhancing Whisper's speech recognition for Vietnamese and other low-resource languages.
― 4 min read
This study examines how hearing ability affects speech understanding in noisy settings.
― 6 min read
Using k-means clustering to optimize audio data for better model training.
― 5 min read
A method to choose the best ASR model based on audio features.
― 5 min read
MyST aims to improve children's science learning through virtual tutoring.
― 5 min read
A look at M2MeT 2.0 and its impact on meeting transcription.
― 5 min read
This study examines how model compression impacts speech recognition in noisy environments.
― 5 min read
A new model improves understanding of speech and sounds simultaneously.
― 6 min read
Introducing new models for better speech extraction in noisy environments.
― 5 min read
Research focuses on improving ASR systems for unsegmented audio.
― 4 min read
Examining performance gaps in speech recognition across different genders.
― 5 min read
LLMs enhance accuracy and error correction in speech recognition systems.
― 5 min read
PP-MeT aims to enhance accuracy in transcribing multi-speaker meetings.
― 5 min read
This research presents a model for improving speech clarity across different conditions.
― 5 min read
This project aims to improve recognition of Gujarati-English mixed speech.
― 6 min read