Electrical Engineering and Systems Science - Audio and Speech Processing

RSS

Sound Analyzing Music with Dependency Trees

A fresh method for understanding musical relationships through dependency trees.

2025-10-15T13:05:30+00:00 ― 6 min read

Computation and Language Improving Speech Recognition with Long-Context Models

This article discusses new models that enhance speech recognition accuracy by considering longer context.

2025-10-15T12:16:55+00:00 ― 5 min read

Computation and Language Introducing LyricWhiz: Transforming Lyric Transcription

LyricWhiz combines advanced models to improve lyric transcription accuracy across languages.

2025-10-15T09:51:10+00:00 ― 5 min read

Sound Classifying African Birdcalls Through Audio Analysis

A study on using sound recordings to identify different bird species in Africa.

2025-10-15T09:02:35+00:00 ― 6 min read

Neural and Evolutionary Computing Advancing Spiking Neural Networks Through Delay Learning

New method enhances learning in Spiking Neural Networks by incorporating delay adjustments.

2025-10-15T07:25:25+00:00 ― 6 min read

Information Retrieval How Music Recommendations Use Data Analysis

Learn how recommendation systems suggest songs based on user preferences.

2025-10-15T02:33:55+00:00 ― 5 min read

Machine Learning Addressing Dataset Imbalance in Audio Classification

This article discusses challenges and techniques for managing dataset imbalance in audio classification.

2025-10-15T00:08:10+00:00 ― 6 min read

Computation and Language Advancing Speech Recognition for Low-Resource Languages

A new approach improves speech recognition for Romanian using lateral inhibition.

2025-10-14T23:19:35+00:00 ― 5 min read

Audio and Speech Processing Advancing Gender Privacy in Audio: New Insights

Research highlights methods to protect gender privacy in spoken audio.

2025-10-14T21:42:25+00:00 ― 5 min read

Sound Understanding Emotions in Speech Recognition

A look into capturing emotions behind spoken words more accurately.

2025-10-14T16:02:20+00:00 ― 5 min read

Sound Advancing Music Classification with Audio Embeddings

Using pre-trained audio embeddings leads to better music classification models.

2025-10-14T13:36:35+00:00 ― 7 min read

Audio and Speech Processing New Model Enhances Understanding of Speech Processing in the Brain

Research highlights word boundaries' role in speech and EEG activity.

2025-10-14T11:59:25+00:00 ― 6 min read

Sound Advancements in Lip-to-Speech Synthesis Technology

New framework enhances speech clarity from silent videos through improved processing.

2025-10-13T19:47:45+00:00 ― 6 min read

Sound The Science Behind the Mridangam: A Unique Instrument

Discover the blend of art and science in studying the mridangam.

2025-10-13T17:19:15+00:00 ― 8 min read

Computation and Language Advancing Speech Recognition for Low-Resource Languages

A new method improves custom word recognition in ASR systems for languages with limited data.

2025-10-13T13:19:05+00:00 ― 5 min read

Sound Advancements in Fake Audio Detection Using Conformer Models

Researchers develop a Conformer model to improve fake audio detection.

2025-10-13T03:36:05+00:00 ― 5 min read

Audio and Speech Processing Protecting Gender Privacy in Voice Recognition Systems

A method to conceal gender information while ensuring identity verification in voice recognition.

2025-10-12T22:44:35+00:00 ― 5 min read

Audio and Speech Processing Advancing Alzheimer's Detection through Speech Analysis

New methods improve early detection of Alzheimer's using speech and audio analysis.

2025-10-12T19:30:15+00:00 ― 7 min read

Audio and Speech Processing New Database Reveals Insights into Musical Instrument Sounds

Explore sound data from 41 musical instruments with detailed recordings.

2025-10-12T15:27:20+00:00 ― 6 min read

Audio and Speech Processing Advancing Acoustic Word Embeddings for Spoken Language

Research on improving acoustic word embeddings with semantic understanding and multilingual data.

2025-10-12T14:38:45+00:00 ― 6 min read

Audio and Speech Processing Advancements in Speech Recognition for Dysarthria

New technologies improve communication for individuals with speech disorders.

2025-10-12T13:01:35+00:00 ― 6 min read

Computation and Language Advancements in Real-Time Speech Processing Technology

A new system combines transcription and translation for better communication.

2025-10-12T11:24:25+00:00 ― 4 min read

Sound Advancements in Speech Recognition with Whisper-AT

Whisper-AT combines speech recognition and audio tagging for improved performance.

2025-10-12T08:10:05+00:00 ― 5 min read

Audio and Speech Processing Integrating Speech with Language Models: The Speech-LLaMA Method

A new approach that combines speech with language models for improved translation.

2025-10-11T18:24:10+00:00 ― 4 min read

Sound Advancements in Automatic Piano Transcription

New method improves accuracy in turning piano audio into sheet music.

2025-10-11T14:21:15+00:00 ― 4 min read

Audio and Speech Processing The Evolving Landscape of Generative Audio AI

This article discusses the needs and challenges in generative audio technology.

2025-10-11T13:32:40+00:00 ― 5 min read

Audio and Speech Processing Improving Tuberculosis Detection Through Cough Analysis

New methods use cough sounds and health data to better detect tuberculosis.

2025-10-11T09:29:45+00:00 ― 5 min read

Audio and Speech Processing Voice Changes in Oral Cancer Patients During Treatment

This study examines how voice characteristics evolve in oral cancer patients post-treatment.

2025-10-11T08:41:10+00:00 ― 5 min read

Audio and Speech Processing Advancing Timbre Transfer with DiffTransfer

A new method for changing musical timbre using advanced machine learning techniques.

2025-10-11T07:52:35+00:00 ― 5 min read

Computation and Language Improving Speech Recognition with RNN-Transducers

New methods enhance speech recognition accuracy, addressing common transcription errors.

2025-10-11T04:38:15+00:00 ― 4 min read

Sound Advancements in Articulatory Speech Synthesis

A study on improving vocal sound reproduction through advanced synthesis techniques.

2025-10-11T02:12:30+00:00 ― 5 min read

Sound Introducing VampNet: A New Approach to Music Creation

VampNet transforms music processing through innovative token modeling techniques.

2025-10-11T01:23:55+00:00 ― 4 min read

Sound EchoVest: A New Hope for Hearing Impairment

Affordable wearable technology for individuals with hearing loss.

2025-10-10T23:46:45+00:00 ― 5 min read

Sound Advancing Lyrics Alignment in Music Services

A new model improves timing accuracy for lyrics in music applications.

2025-10-10T18:55:15+00:00 ― 6 min read

Human-Computer Interaction Introducing SnakeSynth: A New Way to Create Sound

A web-based synthesizer that allows users to create music using simple gestures.

2025-10-10T16:29:30+00:00 ― 4 min read

Sound AI and Creativity in Progressive Metal Music

A study on AI's role in generating progressive metal music.

2025-10-10T13:15:10+00:00 ― 6 min read

Sound ShredGP: A New Way to Generate Guitar Music

A model that creates guitar tablature reflecting famous guitarists' styles.

2025-10-10T12:26:35+00:00 ― 5 min read

Sound Advancements in Self-Supervised Learning for Music Analysis

Exploring the potential of self-supervised learning in music information retrieval.

2025-10-10T10:00:50+00:00 ― 6 min read

Sound Audio Analysis in COVID-19 Detection

Using audio signals to identify respiratory health risks.

2025-10-10T09:12:15+00:00 ― 7 min read

Computation and Language SummaryMixing: A New Approach to Speech Recognition

A new method improves speech recognition speed and accuracy while reducing resource use.

2025-10-10T07:35:05+00:00 ― 5 min read