Urhythmic enhances voice conversion by focusing on speech rhythm.

2025-10-09T21:52:05+00:00 ― 5 min read

Sound Advancements in Real-Time Music Information Retrieval for Guitarists

Research enhances percussive fingerstyle techniques for guitarists using real-time sound retrieval.

2025-10-09T15:23:25+00:00 ― 7 min read

Computation and Language Advancements in Speech Intent Classification and Slot Filling

This article explores a new model for speech intent and slot identification.

2025-10-09T12:09:05+00:00 ― 6 min read

Sound Detecting the Truth in Synthetic Voices

As voice cloning technology advances, reliable detection methods are crucial.

2025-10-09T06:29:00+00:00 ― 6 min read

Computation and Language Improving Speech Recognition for Older Adults

A study enhances ASR for older speakers, using innovative techniques.

2025-10-09T01:37:30+00:00 ― 6 min read

Computation and Language Advancements in Speech Summarization with BASS

BASS improves summarization of long audio by processing in blocks.

2025-10-08T15:05:55+00:00 ― 5 min read

Sound Risks of Stealthy Backdoor Attacks in Speech Recognition Systems

New methods pose serious security risks for speech recognition technology.

2025-10-08T14:17:20+00:00 ― 7 min read

Audio and Speech Processing New Dataset Aims to Improve Hebrew Speech Recognition

ivrit.ai provides vital resources for enhancing Hebrew ASR technology.

2025-10-08T05:22:55+00:00 ― 6 min read

Computation and Language Advancements in Multilingual Speech Translation Technology

Innovative techniques are transforming how we translate spoken language.

2025-10-08T02:57:10+00:00 ― 6 min read

Audio and Speech Processing Advancements in Speaker Anonymisation Techniques

New methods aim to hide speaker identities while maintaining speech clarity.

2025-10-08T01:20:00+00:00 ― 5 min read

Sound Advancing Speech Recognition with Time-Sparse Transducer

New model improves speech recognition speed and memory usage.

2025-10-07T23:42:50+00:00 ― 6 min read

Sound Introducing the JAZZVAR Dataset for Jazz Piano Variations

A new dataset highlights the creative interpretations of jazz pianists on classic standards.

2025-10-07T14:48:25+00:00 ― 5 min read

Audio and Speech Processing Advancements in HRTF Modeling for Realistic Sound

New methods improve sound representation in virtual and augmented reality.

2025-10-07T10:45:30+00:00 ― 7 min read

Sound FlexiAST: A Flexible Approach to Audio Processing

FlexiAST allows models to adapt to various audio patch sizes efficiently.

2025-10-07T09:56:55+00:00 ― 6 min read

Machine Learning Advances in Speech Analysis for Throat Cancer Detection

Researchers are using machine learning to improve throat cancer diagnosis through speech analysis.

2025-10-07T06:42:35+00:00 ― 6 min read

Sound Introducing Polyffusion: A New Way to Create Music Scores

Polyffusion uses visual techniques to generate and control music effectively.

2025-10-07T01:51:05+00:00 ― 6 min read

Audio and Speech Processing Advancements in Detecting Alzheimer's Through Speech Analysis

Researchers are using speech patterns to detect Alzheimer's earlier and more effectively.

2025-10-07T00:13:55+00:00 ― 6 min read

Sound New Framework Improves Speech Recognition with Metadata

Integrating metadata enhances performance in speech tasks like language identification.

2025-10-06T12:05:10+00:00 ― 6 min read

Audio and Speech Processing Advancements in Transducer Models for Speech Recognition

This article discusses the Transducer model's real-time capabilities and recent improvements.

2025-10-06T11:16:35+00:00 ― 6 min read

Audio and Speech Processing Bias in Transfer Learning for Music Recognition

This study explores bias in audio models used for instrument recognition.

2025-10-06T09:39:25+00:00 ― 6 min read

Sound Advancements in Music Genre Classification Using Deep Learning

This study explores a deep learning approach to accurately classify music genres.

2025-10-06T08:50:50+00:00 ― 7 min read

Sound Automated Sound Source Localization in Shallow Waters

New method improves sound source location tracking in shallow aquatic environments.

2025-10-05T13:27:48+00:00 ― 7 min read

Sound Advancing Speech Technology with SCRAPS

A new model connects phonetics and acoustics for better speech technology.

2025-10-05T13:24:50+00:00 ― 7 min read

Sound Advancements in Emotion Recognition with Self-Supervised Learning

This study highlights the role of self-supervised learning in detecting emotions from audio data.

2025-10-05T08:33:20+00:00 ― 6 min read

Audio and Speech Processing Making Music Easy for Everyone

A new interface simplifies music creation for beginners using text-to-audio technology.

2025-10-04T18:47:25+00:00 ― 5 min read

Sound Evaluating Hearing Aids and AI Speech Enhancement

Research highlights the improvements AI can bring to hearing aids in noisy settings.

2025-10-04T17:58:50+00:00 ― 5 min read

Audio and Speech Processing Improving Music Source Separation with Noisy Data

New method refines mislabeled data, enhancing music source separation.

2025-10-04T10:41:35+00:00 ― 6 min read

Sound New Methods in Auditory Attention Decoding

Advancements in decoding how people focus on sounds using brain activity.

2025-10-04T07:43:21+00:00 ― 5 min read

Audio and Speech Processing Advancements in Sound Field Synthesis Techniques

A new method improves sound clarity and localization using a hybrid approach.

2025-10-04T07:27:15+00:00 ― 5 min read

Audio and Speech Processing Advancements in Acoustic Echo Cancellation with CMNet

CMNet improves voice clarity by reducing echo in communication devices.

2025-10-04T06:38:40+00:00 ― 5 min read

Sound Improving Underwater Target Recognition with Neural Networks

A new method enhances the classification of underwater sounds from vessels using neural networks.

2025-10-04T05:01:30+00:00 ― 5 min read

Sound Advancements in Hearing Aid Technology

Research aims to improve clarity in hearing aids for better communication.

2025-10-04T02:35:45+00:00 ― 5 min read

Sound Advancements in Speech Enhancement Using Spiking Neural Networks

A new method to improve speech quality using energy-efficient networks.

2025-10-03T21:44:15+00:00 ― 5 min read

Sound Understanding Cow Vocalizations During Stress

Research highlights cow communication to improve dairy farming practices.

2025-10-03T15:15:35+00:00 ― 5 min read

Sound Introducing MuReNN: A New Model for Audio Processing

MuReNN combines parametric and nonparametric models for improved audio analysis.

2025-10-03T14:14:43+00:00 ― 5 min read

Machine Learning BioLingual: A New Era in Bioacoustics

Revolutionizing animal communication research with innovative audio and language integration.

2025-10-03T11:32:00+00:00 ― 4 min read

Audio and Speech Processing Advancements in Speech Enhancement with PCNN

Introducing a new model for clearer speech in noisy environments.

2025-10-03T07:58:20+00:00 ― 5 min read

Multimedia Advancements in Visual Acoustic Matching

A new method improves audio matching using images, enhancing realism in audio environments.

2025-10-03T03:55:25+00:00 ― 7 min read

Audio and Speech Processing Advancements in Speech Enhancement Techniques

Improving speech quality through innovative methods and multilingual datasets.

2025-10-02T23:52:30+00:00 ― 6 min read

Sound Effective Detection of Deepfake Audio

New systems are designed to detect fake audio recordings with improved accuracy.

2025-10-02T18:12:25+00:00 ― 5 min read

Computer Science - Sound