Latest Articles for Speech Recognition

Computation and Language New Methods in Spoken Language Processing

Researchers explore textless approaches for better understanding of spoken language.

2025-07-13T18:11:30+00:00 ― 6 min read

Audio and Speech Processing Advancements in Speech Enhancement Techniques

A new model improves speech clarity by targeting noise and echoes.

2025-07-12T15:20:35+00:00 ― 6 min read

Computer Vision and Pattern Recognition Advancements in Speech-Based Medical Image Analysis

A new dataset empowers healthcare with speech-based question systems for medical images.

2025-07-12T04:00:30+00:00 ― 6 min read

Computation and Language Optimizing ASR Error Correction with Language Models

A study on enhancing transcription accuracy through improved prompt design.

2025-07-11T15:03:05+00:00 ― 5 min read

Sound Improving Speech Emotion Recognition in Noisy Environments

A new approach enhances SER systems by using noise environment descriptions.

2025-07-11T06:08:40+00:00 ― 6 min read

Sound Innovative Approach to Voice Assistant Training

Combining TTS and real data enhances voice recognition systems effectively.

2025-07-10T00:59:40+00:00 ― 4 min read

Sound Advancements in Silent Speech Interfaces

New method improves converting silent speech to understandable audio.

2025-07-09T22:33:55+00:00 ― 5 min read

Sound Advancements in Audio-Visual Speech Separation Techniques

A new method improves voice separation in noisy settings with multiple speakers.

2025-07-09T16:53:50+00:00 ― 5 min read

Audio and Speech Processing A New Method for Measuring Sound Meaningfulness

This study presents a method to evaluate the meaningfulness of sound signals.

2025-07-09T16:05:15+00:00 ― 6 min read

Audio and Speech Processing Improving Whispered Speech Recognition Technologies

New methods aim to enhance recognition of whispered speech in automatic systems.

2025-07-08T08:30:30+00:00 ― 6 min read

Audio and Speech Processing Improving Speech Recognition with AI Collaboration

AI models enhance accuracy of speech-to-text conversions.

2025-07-07T09:50:10+00:00 ― 5 min read

Audio and Speech Processing Balancing Privacy and Utility in Conversation Analysis

Examining techniques to protect privacy while analyzing recorded conversations.

2025-07-07T04:10:05+00:00 ― 5 min read

Audio and Speech Processing SynesLM: Advancing Audio-Visual Speech Technology

A new model integrates audio and visual data for speech recognition and translation.

2025-07-06T20:04:15+00:00 ― 6 min read

Sound Addressing Accent Recognition Challenges in Speech Technology

New methods improve speech recognition accuracy for diverse accents.

2025-07-05T05:12:15+00:00 ― 4 min read

Computation and Language New Framework Transforms Speech Into Knowledge Graphs

Wav2graph creates knowledge graphs from spoken language for improved AI understanding.

2025-07-04T04:06:10+00:00 ― 7 min read

Sound Introducing MulliVC: Next-Gen Voice Conversion System

MulliVC transforms voices across languages with impressive accuracy and clarity.

2025-07-03T11:54:30+00:00 ― 5 min read

Robotics Robots Learn to Read Human Emotions

New robot navigation system understands spoken commands through emotions.

2025-07-02T20:42:06+00:00 ― 6 min read

Computation and Language New Model TOGGL Enhances Speech Transcription

TOGGL model improves transcription accuracy for overlapping speech situations.

2025-07-02T03:31:10+00:00 ― 5 min read

Computation and Language Improving Speech Recognition with Context Noise Representation Learning

A method to enhance speech recognition quality in noisy environments.

2025-07-01T23:28:15+00:00 ― 6 min read

Audio and Speech Processing Advancements in Speech Technology with SaSLaW

Researchers develop SaSLaW to enhance machine speech adaptation in various environments.

2025-07-01T16:11:00+00:00 ― 5 min read

Computation and Language Evaluating Bias in Speech Language Models

A new dataset highlights biases in speech models based on gender and age.

2025-06-30T19:07:50+00:00 ― 7 min read

Computation and Language Advancements in Speech Models Through Pruning Techniques

Research reveals how to make speech models smaller and more efficient.

2025-06-29T16:24:35+00:00 ― 5 min read

Sound Improving Keyword Spotting with Adversarial Training

Adversarial training enhances keyword spotting accuracy in synthetic and real speech.

2025-06-28T13:41:20+00:00 ― 5 min read

Computation and Language Evaluating Speech Emotion Recognition Models with New Benchmark

A new benchmark improves evaluation of speech emotion recognition systems across languages and emotions.

2025-06-28T04:15:30+00:00 ― 6 min read

Computation and Language Improving Multilingual Speech Recognition Without Original Data

New methods enhance ASR models for multiple languages, preserving past knowledge.

2025-06-27T15:01:00+00:00 ― 5 min read

Computation and Language Improving Bilingual Speech Recognition with XCB

A new approach enhances recognition of code-switched phrases in bilingual speech.

2025-06-27T11:46:40+00:00 ― 5 min read

Machine Learning Advancements in Sequence Processing with MRConv

A new method for better handling of long data sequences.

2025-06-26T07:21:36+00:00 ― 4 min read

Computation and Language The Role of Prosody and Pragmatics in Speech Technology

Examining how voice patterns affect meaning and technology performance.

2025-06-25T21:43:15+00:00 ― 4 min read

Sound Challenges in Detecting Partially Fake Speech Signals

A look into the complexities of identifying mixed audio tracks.

2025-06-25T06:20:10+00:00 ― 6 min read

Computation and Language O-HuBERT: A Step Forward in Speech Recognition

O-HuBERT enhances speech recognition by separating content and expressive information.

2025-06-24T20:04:24+00:00 ― 5 min read

Computation and Language Enhancing Hindi Speech Recognition with Pseudo-Labeling

A new method improves speech recognition for Hindi using pseudo-labeling techniques.

2025-06-24T06:02:40+00:00 ― 4 min read

Audio and Speech Processing Preserving Tamil Dialects Through Technology

A system to classify Literary and Colloquial Tamil dialects using sound features.

2025-06-23T13:51:00+00:00 ― 5 min read

Audio and Speech Processing Improving Speech Recognition: Whispered vs. Normal

New methods enhance computer understanding of whispered and normal speech.

2025-06-23T08:59:30+00:00 ― 5 min read

Machine Learning Understanding Micro-batch Clipping in Machine Learning

A look at micro-batch clipping and its benefits for model training.

2025-06-23T05:45:10+00:00 ― 5 min read

Audio and Speech Processing Improving Japanese Speech Recognition with GER Techniques

Research shows how LLMs enhance automatic speech recognition in Japanese language.

2025-06-23T04:08:00+00:00 ― 6 min read

Computation and Language How Speech Models Learn Suprasegmentals

This article examines how models recognize tone, stress, and pitch accents.

2025-06-22T21:19:54+00:00 ― 5 min read

Computation and Language Introducing SALSA: A New Method for ASR Improvement

SALSA enhances speech recognition accuracy for low-resource languages by integrating ASR and language models.

2025-06-22T06:16:15+00:00 ― 5 min read

Computation and Language Improving Automatic Speech Recognition with Language Models

New method enhances ASR accuracy using language models for better transcriptions.

2025-06-21T20:33:15+00:00 ― 4 min read

Computation and Language Improving Speaker Tagging Accuracy in Conversations

A new system corrects speaker identification errors for clearer conversation transcripts.

2025-06-21T18:56:05+00:00 ― 7 min read

Sound Advancements in Speech Enhancement Techniques

Improving speech clarity through hybrid filterbanks and neural networks.

2025-06-21T17:18:55+00:00 ― 5 min read