Electrical Engineering and Systems Science - Audio and Speech Processing

RSS

Audio and Speech Processing Bias in Transfer Learning for Music Recognition

This study explores bias in audio models used for instrument recognition.

2025-10-06T09:39:25+00:00 ― 6 min read

Sound Advancements in Music Genre Classification Using Deep Learning

This study explores a deep learning approach to accurately classify music genres.

2025-10-06T08:50:50+00:00 ― 7 min read

Audio and Speech Processing Advancements in Topic Identification from Audio Data

Research explores methods for identifying topics directly from audio recordings.

2025-10-05T23:56:25+00:00 ― 5 min read

Sound Automated Sound Source Localization in Shallow Waters

New method improves sound source location tracking in shallow aquatic environments.

2025-10-05T13:27:48+00:00 ― 7 min read

Sound Advancing Speech Technology with SCRAPS

A new model connects phonetics and acoustics for better speech technology.

2025-10-05T13:24:50+00:00 ― 7 min read

Sound Advancements in Emotion Recognition with Self-Supervised Learning

This study highlights the role of self-supervised learning in detecting emotions from audio data.

2025-10-05T08:33:20+00:00 ― 6 min read

Audio and Speech Processing Making Music Easy for Everyone

A new interface simplifies music creation for beginners using text-to-audio technology.

2025-10-04T18:47:25+00:00 ― 5 min read

Sound Evaluating Hearing Aids and AI Speech Enhancement

Research highlights the improvements AI can bring to hearing aids in noisy settings.

2025-10-04T17:58:50+00:00 ― 5 min read

Audio and Speech Processing Improving Music Source Separation with Noisy Data

New method refines mislabeled data, enhancing music source separation.

2025-10-04T10:41:35+00:00 ― 6 min read

Sound New Methods in Auditory Attention Decoding

Advancements in decoding how people focus on sounds using brain activity.

2025-10-04T07:43:21+00:00 ― 5 min read

Audio and Speech Processing Advancements in Sound Field Synthesis Techniques

A new method improves sound clarity and localization using a hybrid approach.

2025-10-04T07:27:15+00:00 ― 5 min read

Audio and Speech Processing Advancements in Acoustic Echo Cancellation with CMNet

CMNet improves voice clarity by reducing echo in communication devices.

2025-10-04T06:38:40+00:00 ― 5 min read

Sound Improving Underwater Target Recognition with Neural Networks

A new method enhances the classification of underwater sounds from vessels using neural networks.

2025-10-04T05:01:30+00:00 ― 5 min read

Sound Advancements in Hearing Aid Technology

Research aims to improve clarity in hearing aids for better communication.

2025-10-04T02:35:45+00:00 ― 5 min read

Sound Advancements in Speech Enhancement Using Spiking Neural Networks

A new method to improve speech quality using energy-efficient networks.

2025-10-03T21:44:15+00:00 ― 5 min read

Sound Understanding Cow Vocalizations During Stress

Research highlights cow communication to improve dairy farming practices.

2025-10-03T15:15:35+00:00 ― 5 min read

Sound Introducing MuReNN: A New Model for Audio Processing

MuReNN combines parametric and nonparametric models for improved audio analysis.

2025-10-03T14:14:43+00:00 ― 5 min read

Machine Learning BioLingual: A New Era in Bioacoustics

Revolutionizing animal communication research with innovative audio and language integration.

2025-10-03T11:32:00+00:00 ― 4 min read

Audio and Speech Processing Advancements in Active Speaker Detection Using Audio

Research shows benefits of multiple microphones for detecting and locating speakers.

2025-10-03T11:12:40+00:00 ― 5 min read

Audio and Speech Processing Advancements in Speech Enhancement with PCNN

Introducing a new model for clearer speech in noisy environments.

2025-10-03T07:58:20+00:00 ― 5 min read

Multimedia Advancements in Visual Acoustic Matching

A new method improves audio matching using images, enhancing realism in audio environments.

2025-10-03T03:55:25+00:00 ― 7 min read

Audio and Speech Processing New Dataset Links Emotions to MIDI Music

A dataset connects emotions to MIDI songs using song lyrics analysis.

2025-10-03T02:18:15+00:00 ― 7 min read

Audio and Speech Processing Advancements in Speech Enhancement Techniques

Improving speech quality through innovative methods and multilingual datasets.

2025-10-02T23:52:30+00:00 ― 6 min read

Audio and Speech Processing Addressing Audio Quality Loss During Transmission

New techniques aim to improve audio quality by addressing packet loss.

2025-10-02T22:15:20+00:00 ― 5 min read

Sound Effective Detection of Deepfake Audio

New systems are designed to detect fake audio recordings with improved accuracy.

2025-10-02T18:12:25+00:00 ― 5 min read

Sound Advancements in Speaker Diarization Through Audio-Visual Integration

New systems improve speaker identification using both audio and visual data.

2025-10-02T15:46:40+00:00 ― 5 min read

Sound MoisesDB: A Breakthrough in Music Source Separation

MoisesDB offers a detailed dataset for advanced music sound separation.

2025-10-02T09:18:00+00:00 ― 6 min read

Sound Advancing Music Captioning with Large Language Models

Using LLMs to create a vast dataset for music captioning.

2025-10-02T08:29:25+00:00 ― 6 min read

Computation and Language Advancements in Pronunciation Training Technology

Researchers are improving pronunciation training with new technologies for language learners.

2025-10-02T07:40:50+00:00 ― 5 min read

Sound Advancements in Voice Style Transfer Technology

HierVST transforms voices seamlessly, enhancing audio quality without needing extensive data.

2025-10-02T05:15:05+00:00 ― 5 min read

Audio and Speech Processing New Model Revolutionizes Music Structure Analysis

A unified approach enhances music analysis by integrating multiple structural elements.

2025-10-01T23:35:00+00:00 ― 5 min read

Audio and Speech Processing Improving Speech Classification for Children with Autism

Research focuses on classifying child-adult speech using unlabelled data.

2025-10-01T22:46:25+00:00 ― 5 min read

Multimedia Advancements in Engagement Estimation for Conversations

Research develops a model to accurately measure engagement in conversations.

2025-10-01T21:57:50+00:00 ― 6 min read

Computer Vision and Pattern Recognition DAVIS: A New Approach to Sound Separation

DAVIS offers a fresh way to tackle audio and visual sound separation.

2025-10-01T19:32:05+00:00 ― 5 min read

Sound Advancing Audio-Visual Segmentation Techniques

A new method enhances accurate identification of sound-producing objects in videos.

2025-10-01T13:52:00+00:00 ― 6 min read

Sound Advancements in Text-to-Speech with DiffProsody

DiffProsody enhances speech synthesis speed and quality through innovative prosody generation.

2025-10-01T13:03:25+00:00 ― 4 min read

Audio and Speech Processing Advancements in Sound Field Reconstruction with GANs

Deep learning models improve sound field reconstruction in complex environments.

2025-10-01T04:57:35+00:00 ― 7 min read

Sound Addressing the Loudness War with De-limiter Networks

New technology aims to restore music quality lost in loudness compression.

2025-10-01T02:31:50+00:00 ― 5 min read

Sound Automated System for Identifying Aphasia

New method promises quicker identification of speech disorders like aphasia.

2025-09-30T21:40:20+00:00 ― 5 min read

Cryptography and Security Inaudible Sound Techniques for Speech Manipulation

New method uses ultrasonic sounds to confuse speech recognition systems without detection.

2025-09-30T19:14:35+00:00 ― 6 min read