Latest Articles for Speech Recognition

Audio and Speech Processing Advancing Fake Speech Detection Techniques

New methods are improving our ability to detect fake speech effectively.

2025-09-11T02:21:55+00:00 ― 6 min read

Sound Improving Speech Recognition with Personalisation Techniques

A new method enhances ASR models for individual users using quantisation and adaptation.

2025-09-10T13:24:35+00:00 ― 6 min read

Audio and Speech Processing Advancements in Speech Recognition through Early-Exit Models

New models adapt to improve speech recognition efficiency and responsiveness.

2025-09-09T21:12:55+00:00 ― 5 min read

Audio and Speech Processing Improving Whisper for Low-Resource Languages

Enhancing Whisper's speech recognition for Vietnamese and other low-resource languages.

2025-09-08T03:55:10+00:00 ― 4 min read

Neuroscience Understanding Speech Processing in Challenging Environments

This study examines how hearing ability affects speech understanding in noisy settings.

2025-09-07T04:34:28+00:00 ― 6 min read

Audio and Speech Processing Improving Audio Datasets with K-Means Clustering

Using k-means clustering to optimize audio data for better model training.

2025-09-06T15:28:55+00:00 ― 5 min read

Audio and Speech Processing Efficient Model Selection for Speech Recognition

A method to choose the best ASR model based on audio features.

2025-09-05T23:17:15+00:00 ― 5 min read

Computation and Language My Science Tutor Project: A New Way to Learn

MyST aims to improve children's science learning through virtual tutoring.

2025-09-05T09:31:20+00:00 ― 5 min read

Sound Advancements in Meeting Transcription Technology

A look at M2MeT 2.0 and its impact on meeting transcription.

2025-09-05T03:51:15+00:00 ― 5 min read

Audio and Speech Processing Advances and Challenges in Speech Recognition Models

This study examines how model compression impacts speech recognition in noisy environments.

2025-09-04T19:45:25+00:00 ― 5 min read

Sound Advancements in Audio and Speech Recognition Model

A new model improves understanding of speech and sounds simultaneously.

2025-09-04T18:08:15+00:00 ― 6 min read

Audio and Speech Processing Advancements in Speech Extraction Technology

Introducing new models for better speech extraction in noisy environments.

2025-09-04T02:45:10+00:00 ― 5 min read

Computation and Language Addressing Challenges in Long-Form Automatic Speech Recognition

Research focuses on improving ASR systems for unsegmented audio.

2025-09-03T13:47:50+00:00 ― 4 min read

Computation and Language Addressing Gender Bias in Speech Recognition Technology

Examining performance gaps in speech recognition across different genders.

2025-09-03T12:51:42+00:00 ― 5 min read

Computation and Language Improving Speech Recognition with Large Language Models

LLMs enhance accuracy and error correction in speech recognition systems.

2025-09-03T06:30:35+00:00 ― 5 min read

Audio and Speech Processing Improving Meeting Transcriptions with PP-MeT System

PP-MeT aims to enhance accuracy in transcribing multi-speaker meetings.

2025-09-02T04:35:55+00:00 ― 5 min read

Audio and Speech Processing A Universal Approach to Speech Enhancement

This research presents a model for improving speech clarity across different conditions.

2025-09-02T02:10:10+00:00 ― 5 min read

Computation and Language Advancements in Code-Switching Speech Recognition

This project aims to improve recognition of Gujarati-English mixed speech.

2025-08-30T05:46:00+00:00 ― 6 min read

Computation and Language Advancing Speech Classification with Multimodal Data

A new model integrates audio and text for better speech classification.

2025-08-29T18:49:00+00:00 ― 6 min read

Sound NOTSOFAR-1 Challenge: Advancing Meeting Transcription Technology

A new initiative to improve transcription technology for meetings in large rooms.

2025-08-29T16:23:15+00:00 ― 7 min read

Computation and Language Advancements in Speech Recognition Error Correction

New methods enhance accuracy in noisy speech recognition using large language models.

2025-08-29T01:48:45+00:00 ― 6 min read

Audio and Speech Processing Addressing Speech Technology Challenges for Under-Resourced Languages

This article discusses solutions for speech applications in languages with limited transcribed data.

2025-08-28T18:31:30+00:00 ― 6 min read

Computation and Language Documenting Endangered Languages with IGT

A new method supports the preservation of at-risk languages through detailed documentation.

2025-08-27T17:35:42+00:00 ― 8 min read

Audio and Speech Processing New Method to Clear Echoed Speech

A method enhances speech clarity in noisy environments without clear training data.

2025-08-26T17:56:30+00:00 ― 6 min read

Audio and Speech Processing Improving Speech Recognition for Low-Resource Languages

New methods enhance ASR for underrepresented languages using data from similar languages.

2025-08-26T10:39:15+00:00 ― 5 min read

Audio and Speech Processing Reborn: A New Era in Unsupervised ASR

Reborn offers innovative solutions for automatic speech recognition without labeled data.

2025-08-25T19:16:10+00:00 ― 6 min read

Computation and Language Advancements in Spoken Dialog Technology

A look at new models for natural spoken responses.

2025-08-25T03:04:30+00:00 ― 6 min read

Audio and Speech Processing Improving Speaker Diarization with Multi-Microphone Approaches

New methods enhance voice activity and overlap detection in speaker diarization.

2025-08-24T13:18:35+00:00 ― 6 min read

Signal Processing Chirp MFCC: A New Approach in Audio Processing

Chirp MFCC enhances audio signal representation for better classification and recognition.

2025-08-23T08:58:10+00:00 ― 5 min read

Computation and Language Kallaama Project: Bridging Language and Technology in Agriculture

Kallaama creates a speech dataset in local languages to aid Senegalese farmers.

2025-08-23T02:43:54+00:00 ― 4 min read

Computation and Language Advancing Language Models Through Speech Styles

A new framework enhances language models by recognizing and responding to different speech styles.

2025-08-23T00:03:45+00:00 ― 7 min read

Audio and Speech Processing Improving Speaker Verification for Children

Enhancing ASV systems to recognize children's voices accurately.

2025-08-22T09:29:15+00:00 ― 8 min read

Audio and Speech Processing Advancements in Estimating Room Acoustic Properties

Research highlights new models for better audio quality in various environments.

2025-08-22T03:00:35+00:00 ― 6 min read

Sound Advancements in Automatic Speaker Diarization Techniques

Research highlights the importance of timing over specific speaker features in diarization models.

2025-08-21T00:17:20+00:00 ― 6 min read

Human-Computer Interaction Advancements in Silent Speech Interfaces

A look at MONA, a system enhancing silent speech communication.

2025-08-20T16:11:30+00:00 ― 5 min read

Robotics Improving Robot Voice Recognition in Noisy Settings

Research focuses on helping robots better understand speech amidst background noise.

2025-08-19T22:22:40+00:00 ― 5 min read

Audio and Speech Processing Evaluating Voice Recognition in Noisy Environments

A new benchmark assesses voice recognition systems' performance amidst various disturbances.

2025-08-19T14:16:50+00:00 ― 5 min read

Audio and Speech Processing Advancements in Cochlear Implants with AI Technologies

AI is improving cochlear implants for better hearing and communication in challenging environments.

2025-08-17T13:41:50+00:00 ― 6 min read

Sound New Approach to Audio Separation Using Language

This method improves audio separation by combining language descriptions with sound analysis.

2025-08-13T14:57:35+00:00 ― 6 min read

Sound Innovative Voice Analysis for Early Parkinson's Detection

Research shows promise in using speech analysis for identifying Parkinson's disease early.

2025-08-09T16:24:42+00:00 ― 5 min read