Electrical Engineering and Systems Science - Audio and Speech Processing

RSS

Audio and Speech Processing Advancements in Speech Emotion Conversion Technology

A new approach to changing emotions in speech amidst real-world noise.

2025-10-28T23:21:40+00:00 ― 6 min read

Sound New Method for Improving Language Pronunciation Detection

This study presents a new system for detecting pronunciation errors in language learners.

2025-10-28T21:44:30+00:00 ― 6 min read

Sound A New Approach to Music Rearrangement

The Q A system uses self-supervised learning for innovative music rearrangement.

2025-10-28T20:07:20+00:00 ― 6 min read

Sound Improving Expressive Speech Synthesis with TVC-GMM

A new method enhances text-to-speech quality and emotional expression.

2025-10-28T18:30:10+00:00 ― 5 min read

Audio and Speech Processing Enhancing Speech Clarity with Audio-Visual Techniques

Researchers combine audio and visual data to improve speech understanding in noisy places.

2025-10-28T17:41:35+00:00 ― 4 min read

Audio and Speech Processing Active Noise Control: Reducing Unwanted Sound

Discover how active noise control technology is changing our sound experience.

2025-10-28T16:53:00+00:00 ― 5 min read

Audio and Speech Processing Advancing Speech Recognition with Smaller Models

Techniques to reduce model size while preserving performance are emerging.

2025-10-28T15:15:50+00:00 ― 4 min read

Audio and Speech Processing Advancements in Digital Phasing Effects

New model mimics analog phasing effects with improved learning techniques.

2025-10-28T12:50:05+00:00 ― 5 min read

Computation and Language Advancing Multilingual Speech Recognition with DistilXLSR

A new model reduces size while improving multilingual speech recognition.

2025-10-28T11:12:55+00:00 ― 6 min read

Computation and Language Enhancing Speech Recognition for Diverse Accents

A new method improves speech recognition accuracy for African accents.

2025-10-28T09:35:45+00:00 ― 5 min read

Audio and Speech Processing Improving Speech Technology Evaluations through Detailed Reporting

Examining the impact of detailed evaluations on speech synthesis systems.

2025-10-28T07:58:35+00:00 ― 5 min read

Audio and Speech Processing Advancements in Echo Cancellation Technology

Improving voice clarity through effective echo cancellation techniques and machine learning.

2025-10-28T05:32:50+00:00 ― 6 min read

Audio and Speech Processing Real-Time Tracking of Singing Voices with SingNet

SingNet improves beat tracking in singing voices using past data.

2025-10-28T04:44:15+00:00 ― 6 min read

Computation and Language Advancements in Speech Recognition for Multiple Speakers

A new system improves speech recognition in multi-speaker settings.

2025-10-28T00:41:20+00:00 ― 6 min read

Audio and Speech Processing Advancements in Lip-to-Speech Technology

LipVoicer generates clear speech from silent videos using advanced lip-reading methods.

2025-10-27T21:27:00+00:00 ― 5 min read

Audio and Speech Processing Advancing Dysarthric Speech Recognition with Innovative Approaches

New methods aim to improve communication for individuals with dysarthria.

2025-10-27T21:01:09+00:00 ― 6 min read

Audio and Speech Processing Combining Speech Processing with Visual Learning

This study examines the benefits of merging speech processing with visual data.

2025-10-27T20:38:25+00:00 ― 6 min read

Computation and Language Advancing Predictions with Multiple Scores in Gaussian Processes

New method improves predictions by considering multiple expert scores.

2025-10-27T19:49:50+00:00 ― 6 min read

Audio and Speech Processing Reevaluating Speaker Anonymization and Vocoder Impact

A fresh look at speaker anonymization and the crucial role of vocoders.

2025-10-27T18:12:40+00:00 ― 5 min read

Computation and Language Assessing Whisper's Performance on Arabic Dialects

A look at how Whisper handles various Arabic dialects and accents.

2025-10-27T13:21:10+00:00 ― 5 min read

Computation and Language Video-LLaMA: A New Approach to Video Understanding

A program combining visual and audio data to enhance video comprehension.

2025-10-27T11:44:00+00:00 ― 5 min read

Computation and Language Advancing Speech Act Recognition in Bengali

A new method improves speech act recognition in Bengali using audio and text analysis.

2025-10-27T10:55:25+00:00 ― 5 min read

Audio and Speech Processing The Role of Laughter in Machine Interaction

Studying laughter can improve how machines interact with people.

2025-10-27T08:29:40+00:00 ― 5 min read

Sound Analyzing Music with BERT: A New Approach

Research explores BERT's potential in bar-level music analysis.

2025-10-27T07:41:05+00:00 ― 5 min read

Computers and Society Engaging Math Learning for Young Children

A new system enhances math learning at home through fun interactions.

2025-10-27T07:08:49+00:00 ― 6 min read

Computation and Language Efficient Speech Recognition Adaptation Using Text Data

A new method enhances speech recognition models using only text data for adaptation.

2025-10-27T06:52:30+00:00 ― 5 min read

Sound Advancing Melody Harmonization with Emotional Context

A new model improves melody harmonization by considering emotional factors.

2025-10-26T21:58:05+00:00 ― 6 min read

Machine Learning Innovative Dance Creation Using Sound Words

New methods use onomatopoeia to inspire unique dance movements.

2025-10-26T20:20:55+00:00 ― 5 min read

Sound Advancements in Speech Countermeasure Systems

Researchers improve detection of machine-generated speech using phase information adjustments.

2025-10-26T17:55:10+00:00 ― 6 min read

Digital Libraries Reproducibility Challenges at Interspeech Conferences

A look at reproducibility issues in speech processing research.

2025-10-26T16:18:00+00:00 ― 7 min read

Computation and Language Advancements in Language Identification with LASR Framework

A new approach improves speech language identification using self-supervised learning and labels.

2025-10-26T08:12:10+00:00 ― 6 min read

Sound Improving Arabic Dysarthric Speech Recognition

A new method enhances speech recognition for dysarthric Arabic speakers.

2025-10-26T07:23:35+00:00 ― 5 min read

Computation and Language Innovative Speech Recognition Tool for Low-Resource Languages

Allophant enhances phoneme recognition for languages with limited data.

2025-10-26T06:35:00+00:00 ― 5 min read

Sound SANGEET: A Structured Dataset for Hindustani Music

Introducing SANGEET, a detailed dataset on Hindustani Classical Music.

2025-10-26T04:57:50+00:00 ― 4 min read

Audio and Speech Processing Advancing Word Timing in Speech Recognition Systems

Improving how speech recognition systems estimate word timing for better accuracy.

2025-10-26T01:43:30+00:00 ― 4 min read

Audio and Speech Processing Improving Speech Recognition with Advanced Models

New methods enhance speech processing in language models.

2025-10-26T00:54:55+00:00 ― 5 min read

Sound Addressing the Challenge of Fake Audio Detection

A new method aims to improve fake audio detection without losing past knowledge.

2025-10-25T16:00:30+00:00 ― 6 min read

Audio and Speech Processing Advancements in Unsupervised Speech Recognition

A new framework enhances the study of unsupervised speech recognition systems.

2025-10-25T13:34:45+00:00 ― 6 min read

Sound Creating Melodies from Simple Beats

This project helps anyone compose music using basic beats and advanced computer methods.

2025-10-25T11:57:35+00:00 ― 5 min read

Computation and Language Advancements in Self-Supervised Learning for Speech Recognition

Self-supervised models reveal insights into phonetic and phonemic distinctions in speech.

2025-10-25T10:20:25+00:00 ― 5 min read