Electrical Engineering and Systems Science - Audio and Speech Processing

RSS

Multimedia Linking Emotions in Images to Music Search

A new system connects emotional images to music for improved discovery.

2025-09-21T16:37:05+00:00 ― 6 min read

Sound Improving Music Quality for Everyday Recordings

A new system enhances audio recordings for better listening experiences.

2025-09-21T15:48:30+00:00 ― 6 min read

Sound Improving Bioacoustics with Active Learning Techniques

A novel approach reduces data labeling while enhancing audio classification accuracy.

2025-09-21T14:11:20+00:00 ― 5 min read

Sound Advancements in Text-to-Speech Technology for Natural Speech

A new system improves speech quality and expressiveness for paragraph synthesis.

2025-09-21T11:45:35+00:00 ― 5 min read

Sound Evaluating the Quality of AI-Generated Music

Discover methods for assessing AI-created music quality through subjective and objective evaluation.

2025-09-21T10:08:25+00:00 ― 5 min read

Sound New Insights into Tongue Movement During Speech

Research focuses on tongue movements to aid speech therapy and language learning.

2025-09-21T04:28:20+00:00 ― 4 min read

Audio and Speech Processing Gender Impact on Voice Biometric Systems

This study examines how gender affects voice biometrics' utility, privacy, and fairness.

2025-09-20T19:33:55+00:00 ― 6 min read

Sound Improving Voice Synthesis with Pruning Techniques

New pruning methods enhance zero-shot multi-speaker text-to-speech model performance.

2025-09-20T15:31:00+00:00 ― 7 min read

Computation and Language Understanding Emotions in Emergency Conversations

Research on emotion recognition in emergency call interactions reveals significant insights.

2025-09-20T14:42:25+00:00 ― 4 min read

Audio and Speech Processing Advancements in Self-Supervised Learning for Speech Recognition

New methods for selecting speech data minimize labeling while improving recognition accuracy.

2025-09-20T13:53:50+00:00 ― 5 min read

Sound Advancing Speech Emotion Recognition with Time-Frequency Transformer

A new method enhances emotion recognition in speech by analyzing time and frequency.

2025-09-20T12:16:40+00:00 ― 5 min read

Quantum Physics Quantum Technology Meets Music Creation

Explore how quantum tools transform music production for artists.

2025-09-20T08:57:42+00:00 ― 5 min read

Audio and Speech Processing Improving Speech Quality Assessment with Preference Scores

A new method enhances speech quality ranking using listener preference scores.

2025-09-20T07:25:10+00:00 ― 5 min read

Sound Improving Speech Recognition for Stutterers

A method to enhance ASR systems for users who stutter.

2025-09-20T06:36:35+00:00 ― 5 min read

Sound Access Issues in the Million Song Dataset

Challenges in accessing audio data hinder research opportunities.

2025-09-20T00:07:55+00:00 ― 5 min read

Sound Advancements in Voice Isolation Technology

New methods improve clarity in noisy environments through advanced sound processing.

2025-09-19T22:30:45+00:00 ― 5 min read

Audio and Speech Processing Advancements in French Speech Synthesis Technology

A newly developed system generates realistic French speech for a competition.

2025-09-19T21:42:10+00:00 ― 5 min read

Sound Advancements in Keyword Spotting Technology

New methods improve efficiency and accuracy in voice recognition systems.

2025-09-19T17:39:15+00:00 ― 5 min read

Computation and Language Advancements in Speech Language Modeling

New methods improve speech processing and generation in language models.

2025-09-19T16:02:05+00:00 ― 5 min read

Sound Advancements in Noise Suppression Technology

New techniques improve audio clarity in noisy environments.

2025-09-19T15:13:30+00:00 ― 6 min read

Audio and Speech Processing Advancing Few-Shot Keyword Spotting with Reading Speech Data

New methods improve keyword spotting using available reading speech data.

2025-09-19T13:36:20+00:00 ― 4 min read

Audio and Speech Processing Advancements in Sound Extraction Technology

A look into region-customizable sound extraction methods for clearer audio.

2025-09-19T07:56:15+00:00 ― 5 min read

Audio and Speech Processing Advancements in Formant Tracking for Speech Processing

New single-step methods improve accuracy in formant tracking for speech sounds.

2025-09-19T02:16:10+00:00 ― 4 min read

Audio and Speech Processing New Insights in Spoken Language Technology

A fresh look at advancements in spoken language science methods and applications.

2025-09-19T01:27:35+00:00 ― 6 min read

Information Retrieval Challenges in Learning from Music Videos

This study examines the difficulties of using contrastive learning for music video understanding.

2025-09-18T17:21:45+00:00 ― 6 min read

Computation and Language Connecting Speech with Language Models: The BLSP Method

A new approach enhances the integration of speech with language models.

2025-09-18T15:44:35+00:00 ― 7 min read

Audio and Speech Processing Advancing Speech Movement Prediction in Dysarthria

Using self-supervised learning to enhance predictions of speech movements in dysarthria.

2025-09-18T12:30:15+00:00 ― 5 min read

Sound Evaluating the Dance-Music Connection with MDSC

A new metric to assess the alignment of dance styles with music.

2025-09-18T11:41:40+00:00 ― 7 min read

Computation and Language The Role of Pretrained Language Models in TTS

Examining how pretrained language models improve text-to-speech quality.

2025-09-17T20:18:35+00:00 ― 5 min read

Audio and Speech Processing BWSNet: Advancing Audio Perception Evaluation

A new model evaluates audio perception through human feedback using Best-Worst Scaling.

2025-09-17T13:49:55+00:00 ― 5 min read

Sound Advancements in Music Source Separation Techniques

New methods improve the clarity of audio components in music tracks.

2025-09-17T08:09:50+00:00 ― 6 min read

Audio and Speech Processing Improving Cinematic Audio Separation with BandIt

BandIt enhances audio source separation using innovative deep learning techniques.

2025-09-17T06:32:40+00:00 ― 5 min read

Audio and Speech Processing Personalizing Speech Emotion Recognition Systems

Tailoring emotion recognition technology improves accuracy for diverse speakers.

2025-09-17T04:55:30+00:00 ― 6 min read

Sound Voice Identity Morphing: A Threat to Security

Study reveals serious threats in voice recognition using morph samples.

2025-09-17T04:06:55+00:00 ― 5 min read

Sound Batik-plays-Mozart: A Comprehensive Piano Dataset

A detailed dataset combining Mozart's sonatas with piano performances and expert annotations.

2025-09-17T03:18:20+00:00 ― 5 min read

Audio and Speech Processing Enhancing Audio Quality for Remote Meetings

A new earbud design improves sound clarity using bone conduction technology.

2025-09-17T02:29:45+00:00 ― 7 min read

Audio and Speech Processing Advancements in Pitch Estimation with Self-Supervised Learning

A new lightweight model improves pitch estimation using self-supervised learning techniques.

2025-09-17T00:04:00+00:00 ― 7 min read

Sound Advancements in Music Structure Analysis Techniques

A new approach to improve music segment identification and analysis.

2025-09-16T23:15:25+00:00 ― 5 min read

Sound Detecting Fake Songs: A New Dataset Approach

New methods developed to identify fake songs amidst growing concerns.

2025-09-16T22:26:50+00:00 ― 5 min read

Computation and Language Improving Speech Recognition with Cleancoder

Cleancoder enhances ASR systems by reducing background noise for clearer speech understanding.

2025-09-16T21:38:15+00:00 ― 4 min read