Latest Articles for Speech Recognition

Audio and Speech Processing Improving Speech Clarity with Dereverberation Techniques

Learn how dereverberation boosts speech recognition in noisy environments.

2025-09-05T12:45:40+00:00 ― 4 min read

Sound Advancements in Audio and Speech Recognition Model

A new model improves understanding of speech and sounds simultaneously.

2025-09-04T18:08:15+00:00 ― 6 min read

Audio and Speech Processing Improving Speech Recognition for Diverse Accents

Enhancing speech models to better recognize and adapt to different accents.

2025-09-04T08:25:15+00:00 ― 4 min read

Computation and Language Building Speech Recognition for Indian Languages

A project to enhance speech recognition across diverse Indian languages.

2025-09-01T15:10:24+00:00 ― 4 min read

Computation and Language Kallaama Project: Bridging Language and Technology in Agriculture

Kallaama creates a speech dataset in local languages to aid Senegalese farmers.

2025-08-23T02:43:54+00:00 ― 4 min read

Computation and Language Challenges and Opportunities for Indigenous Languages in NLP

Indigenous languages face challenges in technology while offering rich cultural insights.

2025-08-21T07:40:36+00:00 ― 5 min read

Robotics Evaluating a Social Robot in Healthcare

A study on the use of ARI in a gerontological day-care facility.

2025-08-20T09:17:36+00:00 ― 6 min read

Computation and Language Classifying Sorani Kurdish Subdialects Through Audio Data

Research identifies and classifies Sorani Kurdish dialects using extensive audio recordings.

2025-08-14T07:57:50+00:00 ― 6 min read

Computation and Language Generative Fusion Decoding: Advancing Text Recognition

A new method enhances text recognition accuracy across various applications.

2025-08-07T22:00:54+00:00 ― 6 min read

Human-Computer Interaction Advancing Robot Communication: Overlapping Speech Solution

A new system improves robot interactions by filtering overlapping speech.

2025-08-04T13:57:15+00:00 ― 6 min read

Audio and Speech Processing Advancements in Automatic Speech Recognition with Dynamic TTA

New methods enhance speech recognition in noisy environments using adaptive techniques.

2025-07-29T13:49:25+00:00 ― 6 min read

Computation and Language Advancements in Code-Switching Speech Translation

A new method improves translating mixed-language speech into English.

2025-07-29T09:46:30+00:00 ― 5 min read

Audio and Speech Processing GigaSpeech 2: A New Dataset for Speech Recognition

GigaSpeech 2 offers a vast dataset for low-resource languages to improve speech recognition.

2025-07-29T02:29:15+00:00 ― 5 min read

Computer Vision and Pattern Recognition The BabyView Dataset: A New Look at Child Learning

A unique dataset captures children's daily lives to enhance machine learning and understanding of human learning.

2025-07-29T01:16:42+00:00 ― 7 min read

Computation and Language Generative AI Systems: Shaping the Future of Content Creation

Discover how Generative AI is changing the way we create content.

2025-07-24T05:01:00+00:00 ― 6 min read

Computation and Language Advancements in Automatic Speech Recognition Technology

New methods improve accuracy and efficiency in speech recognition systems.

2025-07-22T03:41:05+00:00 ― 6 min read

Sound Advancing Communication: Speech Recognition Meets Morse Code

A new model enhances communication for individuals with disabilities using speech recognition and Morse code.

2025-07-18T02:52:00+00:00 ― 5 min read

Audio and Speech Processing Qwen2-Audio: A New Voice for Technology

A voice-driven model transforming audio interaction with technology.

2025-07-16T00:18:55+00:00 ― 5 min read

Audio and Speech Processing Vibravox: Advancing Speech Recognition Technology

A new dataset aims to improve speech capture using body-conduction sensors.

2025-07-15T14:35:55+00:00 ― 6 min read

Audio and Speech Processing Improving Speech Recognition for Polish Language

Research presents new methods for evaluating speech recognition systems in Polish.

2025-07-14T16:44:10+00:00 ― 6 min read

Neurons and Cognition Advancements in EEG Technology for Speech Recovery

Researchers improve speech decoding using EEG to help those with speech impairments.

2025-07-12T16:20:33+00:00 ― 7 min read

Computation and Language Evaluating Speech Recognition in Multilingual Oral Histories

This study assesses speech recognition systems using different languages for oral history.

2025-07-07T12:29:12+00:00 ― 5 min read

Human-Computer Interaction OpenOmni: Advancing Multimodal Conversation Agents

OpenOmni builds flexible tools for creating and testing conversation agents.

2025-07-01T09:40:42+00:00 ― 8 min read

Computation and Language Improving Cross-Lingual Speech Summarization Techniques

Research focuses on better summarization of spoken conversations across languages.

2025-06-29T05:24:24+00:00 ― 6 min read

Sound Introducing NEST: A New Model for Speech Processing

NEST offers a faster, more efficient approach to self-supervised speech tasks.

2025-06-25T20:06:05+00:00 ― 5 min read

Artificial Intelligence Improving Speech Recognition Through Error Prediction

Research focuses on predicting errors in speech recognition for better accuracy.

2025-06-25T10:09:42+00:00 ― 5 min read

Computation and Language New Benchmark for Hindi Speech Recognition

Research improves speech recognition for Hindi with diverse accents.

2025-06-24T05:11:42+00:00 ― 4 min read

Sound Advancements in Speaker Verification Using Whisper

A novel method improves voice recognition accuracy across multiple languages.

2025-06-22T18:25:00+00:00 ― 5 min read

Sound New Dataset Enhances Speech Recognition Technology

Researchers create LibriheavyMix to improve speech recognition in noisy environments.

2025-06-20T22:41:30+00:00 ― 5 min read

Audio and Speech Processing Evaluating Mamba Model in Speech Processing Tasks

This research analyzes Mamba's performance in speech tasks, emphasizing sound reconstruction and recognition.

2025-06-14T23:22:15+00:00 ― 5 min read

Audio and Speech Processing Acoustic Landmarks: A New Dataset for Speech Processing

Researchers develop a dataset to improve speech recognition and analysis techniques.

2025-06-13T19:50:25+00:00 ― 6 min read

Computation and Language Advancing Speech Recognition for Faetar Language

Efforts to improve speech technology for the under-resourced Faetar language.

2025-06-13T09:18:50+00:00 ― 5 min read

Computation and Language Improving Speech Recognition Accuracy with Language Models

A study on using language models for correcting errors in speech recognition systems.

2025-06-12T22:47:15+00:00 ― 5 min read

Quantum Physics Advancing Speech Recognition with Quantum Computing

A new method improving speech recognition while ensuring data privacy.

2025-06-11T07:18:42+00:00 ― 5 min read

Sound Challenges in Transcribing Police Radio Communications

Research reveals the difficulties in speech recognition of police radio transmissions.

2025-06-10T09:14:55+00:00 ― 7 min read

Robotics Introducing WeHelp: A Robotic Assistant for Wheelchair Users

WeHelp offers robotic support to enhance daily activities for wheelchair users.

2025-06-10T03:04:30+00:00 ― 5 min read

Computation and Language Improving Audio Language Models for Thai and English

This study addresses challenges in audio language models for low-resource languages.

2025-06-08T08:39:55+00:00 ― 5 min read

Audio and Speech Processing EVA: A New Era in Audiovisual Speech Recognition

EVA combines audio and visual signals for better speech recognition accuracy.

2025-06-07T22:08:20+00:00 ― 4 min read

Computation and Language Combining Speech and Language Models for Better Performance

Research evaluates connections between speech and language models for improved recognition and translation.

2025-06-05T22:13:06+00:00 ― 5 min read

Computation and Language Improving ASR Systems with Keyword Lists and Language Models

A method to boost automatic speech recognition by blending keyword lists with language models.

2025-06-05T20:44:45+00:00 ― 4 min read