Electrical Engineering and Systems Science - Audio and Speech Processing

RSS

Audio and Speech Processing Vibravox: Advancing Speech Recognition Technology

A new dataset aims to improve speech capture using body-conduction sensors.

2025-07-15T14:35:55+00:00 ― 6 min read

Computer Vision and Pattern Recognition New Method for Detecting Deepfakes

A novel approach improves deepfake detection using audio-visual analysis.

2025-07-15T12:10:10+00:00 ― 5 min read

Sound The Evolution of Automatic Speech Recognition Systems

A look at the progress in speech recognition technologies and methods.

2025-07-15T11:21:35+00:00 ― 5 min read

Sound Improving Stuttering Detection with MMSD-Net

A new method enhances stuttering detection by combining audio, video, and text data.

2025-07-15T07:18:40+00:00 ― 5 min read

Audio and Speech Processing Advancements in Speaker and Language Diarization Systems

A team improves audio processing for speaker and language identification.

2025-07-15T03:15:45+00:00 ― 4 min read

Audio and Speech Processing Advancements in Emotion Recognition from Speech

Research on detecting human emotions through speech shows promise for various applications.

2025-07-15T00:50:00+00:00 ― 5 min read

Sound Innovative Sound Generation for 3D Human Models

A new method enhances sound creation for realistic 3D human models.

2025-07-15T00:01:25+00:00 ― 7 min read

Sound Estimating Breathing Rates Through Speech Analysis

This study reveals how speech can estimate breathing rates using advanced models.

2025-07-14T23:12:50+00:00 ― 5 min read

Sound GraphMuse: A New Tool for Music Analysis

GraphMuse streamlines the analysis of symbolic music data with advanced machine learning techniques.

2025-07-14T19:58:30+00:00 ― 5 min read

Audio and Speech Processing Improving Speech Recognition for Polish Language

Research presents new methods for evaluating speech recognition systems in Polish.

2025-07-14T16:44:10+00:00 ― 6 min read

Audio and Speech Processing Improving Number Formatting in ASR Transcripts

This article discusses ways to enhance numeric expression formatting in automatic transcripts.

2025-07-14T15:55:35+00:00 ― 5 min read

Audio and Speech Processing Advancements in Music Classification Techniques

Self-supervised learning transforms music recognition through innovative methods.

2025-07-14T12:41:15+00:00 ― 6 min read

Audio and Speech Processing MSceneSpeech: Advancing Mandarin Speech Synthesis

A new dataset enhances machine speech for Mandarin, aiming for natural expression.

2025-07-14T09:26:55+00:00 ― 6 min read

Multimedia Advancing Sound Source Localization through Audio-Visual Integration

A study on improving sound source localization by better using audio and visual information.

2025-07-14T06:12:35+00:00 ― 7 min read

Machine Learning Assessing Cognitive Health through Speech Analysis

A new framework analyzes speech to identify mild cognitive impairment across languages.

2025-07-14T05:24:00+00:00 ― 5 min read

Sound AI and the Challenge of Diverse Music Genres

Exploring AI's impact on underrepresented music styles.

2025-07-14T02:58:15+00:00 ― 6 min read

Computation and Language Improving Text-to-Speech for Indian Languages

A method to enhance TTS systems for better pronunciation of OOV words in India.

2025-07-14T02:09:40+00:00 ― 5 min read

Computation and Language Enhancing Self-Supervised Learning for Speech Processing

A new model improves efficiency in speech processing with less energy consumption.

2025-07-14T00:32:30+00:00 ― 4 min read

Sound Advances in Hearing Aid Technology Using Machine Learning

New machine learning models improve speech clarity for hearing aid users.

2025-07-13T23:43:55+00:00 ― 6 min read

Sound Studying Social Interactions with Low-Frequency Audio

Research explores low-frequency audio to protect privacy in social behavior studies.

2025-07-13T21:18:10+00:00 ― 5 min read

Audio and Speech Processing Understanding Sound Propagation in Connected Spaces

Exploring how sound behaves in multi-room environments and its implications in technology.

2025-07-13T20:29:35+00:00 ― 6 min read

Audio and Speech Processing AI Tools Transform Music Editing Process

New AI tools are simplifying music editing with innovative techniques and improved precision.

2025-07-13T18:52:25+00:00 ― 5 min read

Computation and Language A New Approach to Speech Translation: Preset-Voice Matching

Preset-Voice Matching improves speech translation while ensuring privacy and reducing risks.

2025-07-13T18:03:50+00:00 ― 6 min read

Sound Composer's Assistant 2: A New Tool for Musicians

A new system helps musicians create music with greater control and precision.

2025-07-13T14:00:55+00:00 ― 7 min read

Sound Evaluating AI's Impact on Music Originality

A new tool to assess replication in AI-made music.

2025-07-13T12:23:45+00:00 ― 7 min read

Sound Open Audio Generation: A New Model

A new text-to-audio model using only public data.

2025-07-13T11:35:10+00:00 ― 5 min read

Computation and Language Challenges and Innovations in Code-Switching Research

A new dataset aims to improve understanding of code-switching across multiple languages.

2025-07-13T09:58:00+00:00 ― 5 min read

Computation and Language Gender Representation in French Broadcast News

This article examines gender balance in French news broadcasts across different topics.

2025-07-13T08:20:50+00:00 ― 5 min read

Computation and Language Rasa: A Breakthrough in Indian Language Speech Synthesis

Rasa dataset advances text-to-speech for Indian languages with neutral and expressive speech.

2025-07-13T05:55:05+00:00 ― 6 min read

Sound Advancements in Speech Emotion Recognition Technology

New methods improve machine understanding of human emotions in speech.

2025-07-12T18:34:55+00:00 ― 4 min read

Sound Making AI Tools Accessible for Artists

Simplifying AI tools can empower artists to enhance their creative expression.

2025-07-12T17:46:20+00:00 ― 5 min read

Sound MusiConGen: Advancing Text-to-Music Technology

MusiConGen enhances user control in text-to-music generation.

2025-07-12T16:57:45+00:00 ― 6 min read

Neurons and Cognition Advancements in EEG Technology for Speech Recovery

Researchers improve speech decoding using EEG to help those with speech impairments.

2025-07-12T16:20:33+00:00 ― 7 min read

Audio and Speech Processing Advancements in Speech Enhancement Techniques

A new model improves speech clarity by targeting noise and echoes.

2025-07-12T15:20:35+00:00 ― 6 min read

Computation and Language Introducing J-CHAT: A New Dataset for Spoken Dialogue Research

J-CHAT provides a large, open-source dataset for enhancing spoken dialogue systems.

2025-07-12T12:06:15+00:00 ― 5 min read

Audio and Speech Processing Advancements in Sample-Based Musical Instrument Creation

New methods enable musicians to create instruments from sound prompts.

2025-07-12T08:51:55+00:00 ― 5 min read

Audio and Speech Processing Speech Codecs and Emotional Preservation

Examining how codecs retain emotional tones in voice data.

2025-07-12T06:26:10+00:00 ― 5 min read

Audio and Speech Processing Transforming Broadcasting with IP Technology and Audio Tagging

Learn how IP broadcasting and audio tagging reshape content delivery.

2025-07-12T05:37:35+00:00 ― 5 min read

Human-Computer Interaction Humans and Robots Create Music Together

A look at how technology and musicians collaborate in a unique performance.

2025-07-12T03:11:50+00:00 ― 7 min read

Robotics Robotic Musician Enhances Shopping Experience

A robot plays music in a store to improve customer enjoyment.

2025-07-12T02:23:15+00:00 ― 7 min read