Latest Articles for Speech Processing

Computation and Language RoDia: A New Dataset for Romanian Dialect Identification

RoDia provides crucial audio samples for identifying Romanian dialects.

2025-09-16T15:58:10+00:00 ― 5 min read

Sound Advancements in Automatic Speech Recognition Systems

New methods improve accuracy and speed in speech recognition technology.

2025-09-15T06:46:15+00:00 ― 6 min read

Sound Advancements in Speech Generation Technology

Introducing a framework for more natural and expressive speech synthesis.

2025-09-15T01:06:10+00:00 ― 6 min read

Computation and Language Advancements in Direct Text to Speech Translation

New systems improve translation from text to spoken language without intermediates.

2025-09-11T20:59:20+00:00 ― 4 min read

Sound New Method to Detect Synthetic Speech

A method improves detection of synthetic voices and identifies their creators.

2025-09-10T20:41:50+00:00 ― 5 min read

Sound Advancements in Tiny Speech Enhancement Models

New methods improve tiny models for better speech enhancement using less resources.

2025-09-10T19:53:15+00:00 ― 5 min read

Sound Improving Speaker Diarization with Semantic Information

A new approach enhances speaker diarization by integrating semantic data into the process.

2025-09-08T20:06:50+00:00 ― 5 min read

Audio and Speech Processing Advancements in Speech Emotion Recognition: A Multilingual Approach

Research shows improved accuracy in recognizing emotions from speech across languages.

2025-09-08T16:03:55+00:00 ― 4 min read

Sound Advancements in Text-Based Speech Editing

FluentEditor improves audio editing by focusing on natural flow and consistency.

2025-09-07T20:37:55+00:00 ― 4 min read

Audio and Speech Processing Improving Speech Recognition with Memory Networks

New techniques enhance ASR systems for better long speech recognition.

2025-09-06T03:20:10+00:00 ― 5 min read

Audio and Speech Processing Advancements in Speaker Anonymization Using Neural Audio Codecs

A new audio processing method enhances speaker anonymity while maintaining speech clarity.

2025-09-05T01:25:30+00:00 ― 5 min read

Sound Innovative Speech Separation Using Audio and Visual Data

Research introduces an effective method for improving speech clarity in noisy settings.

2025-09-02T00:33:00+00:00 ― 6 min read

Computer Vision and Pattern Recognition Transforming Avatar Movements for Realism

A new method enhances avatar speech through natural movements and expressions.

2025-08-24T01:06:54+00:00 ― 6 min read

Computer Vision and Pattern Recognition Improving Gesture Detection Through Speech Analysis

Research reveals new methods for detecting gestures in relation to speech patterns.

2025-08-17T01:14:24+00:00 ― 7 min read

Audio and Speech Processing CLaM-TTS: Advancing Text-to-Speech Technology

CLaM-TTS improves speech synthesis using advanced techniques for better efficiency and quality.

2025-08-13T08:28:55+00:00 ― 6 min read

Sound Navigating Vulnerabilities in Speech Emotion Recognition

This study examines the weaknesses of SER models against adversarial attacks across languages.

2025-08-08T21:35:55+00:00 ― 5 min read

Audio and Speech Processing Improving Voice Clarity in Noisy Environments

New techniques enhance voice reconstruction in challenging settings using limited data.

2025-08-05T02:06:00+00:00 ― 7 min read

Audio and Speech Processing Reducing Cross-Talk for Clearer Speech

A new system improves speech clarity in multi-speaker environments.

2025-08-02T14:10:50+00:00 ― 5 min read

Machine Learning Advancements in Speech Decoding Through Brain Data

Researchers utilize self-supervised learning to improve speech decoding from brain activity.

2025-08-01T14:12:12+00:00 ― 7 min read

Audio and Speech Processing Advancements in Speech-to-Singing Technology

New method improves conversion from speech to singing using self-supervised learning.

2025-08-01T09:50:25+00:00 ― 7 min read

Sound Advancements in Emotion Recognition Through Speech

New methods improve how machines recognize emotions in human speech.

2025-07-30T18:09:50+00:00 ― 5 min read

Sound Advancing Voice Conversion with Spatial Awareness

Introducing spatial voice conversion to enhance audio realism and immersion.

2025-07-27T01:54:15+00:00 ― 6 min read

Computation and Language Examining Italy's Language Diversity Through Speech Data

A study on Italy's regional languages using advanced speech analysis techniques.

2025-07-25T12:21:12+00:00 ― 9 min read

Audio and Speech Processing Advancements in Phoneme Alignment Techniques

A new method enhances phoneme alignment accuracy for various speech applications.

2025-07-24T10:44:45+00:00 ― 5 min read

Computation and Language A New Approach to Speech Representation Learning

This article presents a dual encoder system for effective speech representation learning.

2025-07-24T01:50:20+00:00 ― 6 min read

Sound Improving Speech Quality Monitoring on Devices

Advancements in predicting speech quality using efficient methods for mobile devices.

2025-07-21T13:55:10+00:00 ― 5 min read

Sound The Evolution of Automatic Speech Recognition Systems

A look at the progress in speech recognition technologies and methods.

2025-07-15T11:21:35+00:00 ― 5 min read

Computation and Language Enhancing Self-Supervised Learning for Speech Processing

A new model improves efficiency in speech processing with less energy consumption.

2025-07-14T00:32:30+00:00 ― 4 min read

Sound Advances in Hearing Aid Technology Using Machine Learning

New machine learning models improve speech clarity for hearing aid users.

2025-07-13T23:43:55+00:00 ― 6 min read

Sound Advancements in Speech Emotion Recognition Technology

New methods improve machine understanding of human emotions in speech.

2025-07-12T18:34:55+00:00 ― 4 min read

Computation and Language Improving Speaker Identification in Dialogues

New models enhance the identification of speakers in dialogue content.

2025-07-12T16:54:42+00:00 ― 6 min read

Audio and Speech Processing Speech Codecs and Emotional Preservation

Examining how codecs retain emotional tones in voice data.

2025-07-12T06:26:10+00:00 ― 5 min read

Audio and Speech Processing New Method for Acoustic Parameter Estimation Using AI

A novel approach to estimating sound traits in challenging environments using deep learning.

2025-07-09T03:07:55+00:00 ― 5 min read

Computation and Language Improving Speech Recognition for Specialized Terms

Research enhances ASR systems using language models for better accuracy.

2025-07-06T20:41:12+00:00 ― 7 min read

Audio and Speech Processing Advancing Speech Tech for Arabic Dialects

New framework enhances speech recognition for diverse Arabic dialects.

2025-07-05T10:52:20+00:00 ― 4 min read

Audio and Speech Processing Advancements in Voice Anonymisation Techniques

New methods improve privacy while preserving speech content and emotions.

2025-07-03T15:57:25+00:00 ― 6 min read

Computation and Language The Impact of Annotation Methods on Speech Summarization

This study examines how different summarization methods affect quality and content.

2025-07-02T05:56:55+00:00 ― 5 min read

Audio and Speech Processing Improving Speech Recognition with Contextual Keywords

A new system enhances speech recognition by using contextual keywords for better accuracy.

2025-06-29T22:53:15+00:00 ― 5 min read

Sound Introducing NEST: A New Model for Speech Processing

NEST offers a faster, more efficient approach to self-supervised speech tasks.

2025-06-25T20:06:05+00:00 ― 5 min read

Sound Advancements in Speech Emotion Recognition with Wav2Small

Wav2Small enhances emotion detection in speech with reduced resource needs.

2025-06-25T10:23:05+00:00 ― 5 min read