Latest Articles for Speech Recognition

Computation and Language New Methods for Evaluating Speaker Diarization

Introducing fresh metrics to assess speaker diarization accuracy in conversational AI.

2025-09-26T18:04:30+00:00 ― 6 min read

Computation and Language Advancements in Speech Recognition Technology

New methods enhance accuracy and speed in speech recognition systems.

2025-09-26T11:35:55+00:00 ― 5 min read

Computation and Language Improving Automatic Speech Recognition with Text Injection

A new method enhances ASR performance through text data integration.

2025-09-26T07:33:00+00:00 ― 6 min read

Computation and Language Improving Speech Recognition with Text Injection

Text injection helps recognize personal information while maintaining privacy.

2025-09-26T06:44:25+00:00 ― 5 min read

Sound Advancements in Speech Recognition with mmWave Technology

Radio2Text uses mmWave signals for real-time speech recognition in noisy environments.

2025-09-25T22:38:35+00:00 ― 6 min read

Computation and Language Improving Grapheme-to-Phoneme Conversion with New Sampling Method

This study enhances G2P models by focusing on error-prone areas during training.

2025-09-25T05:38:20+00:00 ― 5 min read

Audio and Speech Processing Advancements in Formant Tracking Techniques

Discover methods that improve accuracy in formant tracking for speech analysis.

2025-09-24T22:21:05+00:00 ― 6 min read

Computation and Language Advancements in Speech Language Modeling

New methods improve speech processing and generation in language models.

2025-09-19T16:02:05+00:00 ― 5 min read

Sound Advancements in Noise Suppression Technology

New techniques improve audio clarity in noisy environments.

2025-09-19T15:13:30+00:00 ― 6 min read

Audio and Speech Processing Advancing Few-Shot Keyword Spotting with Reading Speech Data

New methods improve keyword spotting using available reading speech data.

2025-09-19T13:36:20+00:00 ― 4 min read

Audio and Speech Processing Advancing Confidence Estimation in Automatic Speech Recognition

A new approach enhances confidence estimation in ASR systems for better accuracy.

2025-09-15T03:14:28+00:00 ― 4 min read

Machine Learning Challenges in Using Convnets for Audio Filterbank Design

This study explores issues with using convnets for audio filterbank creation.

2025-09-14T14:34:35+00:00 ― 5 min read

Audio and Speech Processing Improving Speaker Diarization with Language Models

This article explores advancements in speaker diarization using language models for better accuracy.

2025-09-14T03:14:25+00:00 ― 5 min read

Audio and Speech Processing PromptASR: Next-Level Speech Recognition Technology

New system enhances speech recognition using context-aware prompts.

2025-09-13T10:14:10+00:00 ― 4 min read

Sound Advancements in Universal Audio Models

EnCodecMAE combines self-supervised learning and audio codecs for improved audio task performance.

2025-09-13T09:25:35+00:00 ― 5 min read

Audio and Speech Processing A New Approach to Keyword Spotting

Introducing a flexible method for recognizing keywords in speech across languages.

2025-09-13T06:11:15+00:00 ― 5 min read

Sound New System Improves Voice Extraction from Unstable Head Positions

PIAVE helps machines extract voices clearly, even when speakers turn their heads.

2025-09-12T19:39:40+00:00 ― 6 min read

Sound A New Framework for Speaker Anonymization

Introducing a flexible framework to enhance voice privacy research.

2025-09-12T05:05:10+00:00 ― 7 min read

Computation and Language Improving Explanations for Speech Models

A new method simplifies understanding of speech classification models.

2025-09-12T02:39:25+00:00 ― 6 min read

Sound M-AUDIODEC: A New Way to Compress Audio

M-AUDIODEC compresses multi-channel audio while retaining speaker position and quality.

2025-09-11T16:56:25+00:00 ― 6 min read

Audio and Speech Processing Improving Sound Quality in Hearables

Research reveals new models to enhance voice clarity in smart earbuds.

2025-09-11T12:04:55+00:00 ― 5 min read

Artificial Intelligence Improving Robot Understanding of Human Instructions

A new method enhances robots' ability to follow spoken directions accurately.

2025-09-11T08:21:18+00:00 ― 5 min read

Audio and Speech Processing Advancing Fake Speech Detection Techniques

New methods are improving our ability to detect fake speech effectively.

2025-09-11T02:21:55+00:00 ― 6 min read

Sound Improving Speech Recognition with Personalisation Techniques

A new method enhances ASR models for individual users using quantisation and adaptation.

2025-09-10T13:24:35+00:00 ― 6 min read

Audio and Speech Processing Advancements in Speech Recognition through Early-Exit Models

New models adapt to improve speech recognition efficiency and responsiveness.

2025-09-09T21:12:55+00:00 ― 5 min read

Audio and Speech Processing Improving Whisper for Low-Resource Languages

Enhancing Whisper's speech recognition for Vietnamese and other low-resource languages.

2025-09-08T03:55:10+00:00 ― 4 min read

Neuroscience Understanding Speech Processing in Challenging Environments

This study examines how hearing ability affects speech understanding in noisy settings.

2025-09-07T04:34:28+00:00 ― 6 min read

Audio and Speech Processing Improving Audio Datasets with K-Means Clustering

Using k-means clustering to optimize audio data for better model training.

2025-09-06T15:28:55+00:00 ― 5 min read

Audio and Speech Processing Efficient Model Selection for Speech Recognition

A method to choose the best ASR model based on audio features.

2025-09-05T23:17:15+00:00 ― 5 min read

Computation and Language My Science Tutor Project: A New Way to Learn

MyST aims to improve children's science learning through virtual tutoring.

2025-09-05T09:31:20+00:00 ― 5 min read

Sound Advancements in Meeting Transcription Technology

A look at M2MeT 2.0 and its impact on meeting transcription.

2025-09-05T03:51:15+00:00 ― 5 min read

Audio and Speech Processing Advances and Challenges in Speech Recognition Models

This study examines how model compression impacts speech recognition in noisy environments.

2025-09-04T19:45:25+00:00 ― 5 min read

Sound Advancements in Audio and Speech Recognition Model

A new model improves understanding of speech and sounds simultaneously.

2025-09-04T18:08:15+00:00 ― 6 min read

Audio and Speech Processing Advancements in Speech Extraction Technology

Introducing new models for better speech extraction in noisy environments.

2025-09-04T02:45:10+00:00 ― 5 min read

Computation and Language Addressing Challenges in Long-Form Automatic Speech Recognition

Research focuses on improving ASR systems for unsegmented audio.

2025-09-03T13:47:50+00:00 ― 4 min read

Computation and Language Addressing Gender Bias in Speech Recognition Technology

Examining performance gaps in speech recognition across different genders.

2025-09-03T12:51:42+00:00 ― 5 min read

Computation and Language Improving Speech Recognition with Large Language Models

LLMs enhance accuracy and error correction in speech recognition systems.

2025-09-03T06:30:35+00:00 ― 5 min read

Audio and Speech Processing Improving Meeting Transcriptions with PP-MeT System

PP-MeT aims to enhance accuracy in transcribing multi-speaker meetings.

2025-09-02T04:35:55+00:00 ― 5 min read

Audio and Speech Processing A Universal Approach to Speech Enhancement

This research presents a model for improving speech clarity across different conditions.

2025-09-02T02:10:10+00:00 ― 5 min read

Computation and Language Advancements in Code-Switching Speech Recognition

This project aims to improve recognition of Gujarati-English mixed speech.

2025-08-30T05:46:00+00:00 ― 6 min read