Computer Science - Sound

RSS

Sound ElasticAST: A Flexible Approach to Audio Classification

ElasticAST allows processing of variable length audio efficiently without losing important details.

2025-07-18T02:31:05+00:00 ― 5 min read

Sound Cloning Voices: A New Challenge for Music Rights

Analyzing singer identification methods amidst growing voice cloning concerns.

2025-07-18T01:42:30+00:00 ― 5 min read

Sound New Method for Detecting Partially Fake Audio

A novel approach improves detection of mixed real and fake audio clips.

2025-07-17T17:36:40+00:00 ― 6 min read

Audio and Speech Processing Evaluating Mamba's Efficiency in Speech Technology

Mamba shows promise against transformers in speech tasks, especially for long inputs.

2025-07-17T13:33:45+00:00 ― 4 min read

Sound Advancements in Singing Voice Synthesis with SingFlex

SingFlex offers innovative solutions for creating diverse singing voices efficiently.

2025-07-17T07:05:05+00:00 ― 5 min read

Information Theory Measuring Complexity in Irish Dance Music

A study on the complexity of Irish traditional dance tunes using compression methods.

2025-07-17T06:56:50+00:00 ― 5 min read

Sound RefinPaint: A New Approach to Music Generation

RefinPaint enhances music creation by identifying and refining weak areas effectively.

2025-07-17T06:16:30+00:00 ― 6 min read

Sound Adapting Whisper for Improved Speaker Verification

A new framework enhances speaker verification performance with limited data.

2025-07-17T00:36:25+00:00 ― 6 min read

Sound Bridging the Gap: AI and Musicians in Harmony

Exploring new ways AI can collaborate with musicians through interpretation.

2025-07-16T15:42:00+00:00 ― 5 min read

Audio and Speech Processing Advancing Audio Security with Continual Learning

CADE improves audio detection against evolving spoofing threats using continual learning techniques.

2025-07-16T10:50:30+00:00 ― 7 min read

Robotics Utilizing Sound for Object Location in Robotics

A new method helps robots find fallen objects using sound.

2025-07-16T06:47:35+00:00 ― 5 min read

Sound Advancements in Voice-Controlled Drone Systems

New voice command systems enhance drone control without the need for hands.

2025-07-16T05:42:18+00:00 ― 5 min read

Sound Advancements in Guitar Amplifier Modeling

New techniques allow for better emulation of guitar amplifiers and effects.

2025-07-15T23:30:20+00:00 ― 6 min read

Audio and Speech Processing Improving Code-Switching ASR with Knowledge Distillation

A new framework enhances ASR performance using limited data and resources.

2025-07-15T22:41:45+00:00 ― 5 min read

Sound Advancing Audio Synthesis with Diffusion Models

A new method improves audio generation efficiency using innovative attention techniques.

2025-07-15T20:16:00+00:00 ― 5 min read

Sound BandControlNet: A New Approach to Music Creation

Discover how AI is transforming music generation with BandControlNet.

2025-07-15T19:27:25+00:00 ― 5 min read

Computer Vision and Pattern Recognition New Method for Detecting Deepfakes

A novel approach improves deepfake detection using audio-visual analysis.

2025-07-15T12:10:10+00:00 ― 5 min read

Sound The Evolution of Automatic Speech Recognition Systems

A look at the progress in speech recognition technologies and methods.

2025-07-15T11:21:35+00:00 ― 5 min read

Sound Improving Stuttering Detection with MMSD-Net

A new method enhances stuttering detection by combining audio, video, and text data.

2025-07-15T07:18:40+00:00 ― 5 min read

Sound Innovative Sound Generation for 3D Human Models

A new method enhances sound creation for realistic 3D human models.

2025-07-15T00:01:25+00:00 ― 7 min read

Sound Estimating Breathing Rates Through Speech Analysis

This study reveals how speech can estimate breathing rates using advanced models.

2025-07-14T23:12:50+00:00 ― 5 min read

Sound GraphMuse: A New Tool for Music Analysis

GraphMuse streamlines the analysis of symbolic music data with advanced machine learning techniques.

2025-07-14T19:58:30+00:00 ― 5 min read

Audio and Speech Processing Improving Speech Recognition for Polish Language

Research presents new methods for evaluating speech recognition systems in Polish.

2025-07-14T16:44:10+00:00 ― 6 min read

Audio and Speech Processing MSceneSpeech: Advancing Mandarin Speech Synthesis

A new dataset enhances machine speech for Mandarin, aiming for natural expression.

2025-07-14T09:26:55+00:00 ― 6 min read

Multimedia Advancing Sound Source Localization through Audio-Visual Integration

A study on improving sound source localization by better using audio and visual information.

2025-07-14T06:12:35+00:00 ― 7 min read

Machine Learning Assessing Cognitive Health through Speech Analysis

A new framework analyzes speech to identify mild cognitive impairment across languages.

2025-07-14T05:24:00+00:00 ― 5 min read

Sound AI and the Challenge of Diverse Music Genres

Exploring AI's impact on underrepresented music styles.

2025-07-14T02:58:15+00:00 ― 6 min read

Computation and Language Improving Text-to-Speech for Indian Languages

A method to enhance TTS systems for better pronunciation of OOV words in India.

2025-07-14T02:09:40+00:00 ― 5 min read

Sound Advances in Hearing Aid Technology Using Machine Learning

New machine learning models improve speech clarity for hearing aid users.

2025-07-13T23:43:55+00:00 ― 6 min read

Sound Studying Social Interactions with Low-Frequency Audio

Research explores low-frequency audio to protect privacy in social behavior studies.

2025-07-13T21:18:10+00:00 ― 5 min read

Audio and Speech Processing Understanding Sound Propagation in Connected Spaces

Exploring how sound behaves in multi-room environments and its implications in technology.

2025-07-13T20:29:35+00:00 ― 6 min read

Audio and Speech Processing AI Tools Transform Music Editing Process

New AI tools are simplifying music editing with innovative techniques and improved precision.

2025-07-13T18:52:25+00:00 ― 5 min read

Computation and Language A New Approach to Speech Translation: Preset-Voice Matching

Preset-Voice Matching improves speech translation while ensuring privacy and reducing risks.

2025-07-13T18:03:50+00:00 ― 6 min read

Sound Composer's Assistant 2: A New Tool for Musicians

A new system helps musicians create music with greater control and precision.

2025-07-13T14:00:55+00:00 ― 7 min read

Sound Evaluating AI's Impact on Music Originality

A new tool to assess replication in AI-made music.

2025-07-13T12:23:45+00:00 ― 7 min read

Sound Open Audio Generation: A New Model

A new text-to-audio model using only public data.

2025-07-13T11:35:10+00:00 ― 5 min read

Computation and Language Rasa: A Breakthrough in Indian Language Speech Synthesis

Rasa dataset advances text-to-speech for Indian languages with neutral and expressive speech.

2025-07-13T05:55:05+00:00 ― 6 min read

Sound Advancements in Speech Emotion Recognition Technology

New methods improve machine understanding of human emotions in speech.

2025-07-12T18:34:55+00:00 ― 4 min read

Sound Making AI Tools Accessible for Artists

Simplifying AI tools can empower artists to enhance their creative expression.

2025-07-12T17:46:20+00:00 ― 5 min read

Sound MusiConGen: Advancing Text-to-Music Technology

MusiConGen enhances user control in text-to-music generation.

2025-07-12T16:57:45+00:00 ― 6 min read