Computer Science - Sound

RSS

Sound Converting Mono Audio to Immersive Stereo

A new method transforms mono signals into engaging stereo experiences.

2025-10-17T01:31:45+00:00 ― 5 min read

Computation and Language Advancing Emotion Recognition Across Age and Languages

A study on improving emotion detection in speech for diverse groups.

2025-10-16T23:06:00+00:00 ― 5 min read

Multimedia Revolutionizing Infant Sleep Monitoring with LittleBeats

Study uses multi-data device to track infant sleep patterns more accurately.

2025-10-16T17:25:55+00:00 ― 4 min read

Computation and Language Introducing 3D-Speaker: A New Resource for Speech Research

3D-Speaker provides a vast collection of audio recordings for advanced speech analysis.

2025-10-16T16:37:20+00:00 ― 5 min read

Audio and Speech Processing Advancing Text-to-Speech: GenerTTS Model Explained

GenerTTS enhances text-to-speech technology for cross-lingual applications.

2025-10-16T15:48:45+00:00 ― 5 min read

Sound Addressing the Challenge of Audio Deepfakes

A new system enhances detection of manipulated audio through innovative techniques.

2025-10-16T15:00:10+00:00 ― 5 min read

Sound Advancements in Multi-Talker Speech Recognition

Improving speech recognition for overlapping voices enhances usability in various settings.

2025-10-16T11:45:50+00:00 ― 5 min read

Sound Improving Speaker Extraction Techniques

New methods enhance voice separation in mixed audio environments.

2025-10-16T10:57:15+00:00 ― 5 min read

Sound Advancements in Target Speaker Extraction Technology

Learn how new techniques improve speech clarity in noisy environments.

2025-10-16T10:08:40+00:00 ― 5 min read

Sound UnitSpeech: Personalizing Text-to-Speech with Minimal Data

A new method for making voice synthesis more personal using less speech data.

2025-10-16T06:54:20+00:00 ― 5 min read

Sound Advancements in Audio Processing with Graph Neural Networks

New methods improve sound localization using distributed microphone arrays.

2025-10-16T06:05:45+00:00 ― 5 min read

Audio and Speech Processing Balancing Privacy and Utility in Speech Analysis

This study examines methods to protect privacy while analyzing spoken conversations.

2025-10-16T05:17:10+00:00 ― 5 min read

Sound New Vulnerabilities in Speaker Recognition Systems

Recent backdoor attacks expose risks in voice identification technologies.

2025-10-16T02:51:25+00:00 ― 7 min read

Sound Advancing Voice Isolation Technology

A new model improves speech extraction from noisy backgrounds using deep learning.

2025-10-16T02:02:50+00:00 ― 5 min read

Audio and Speech Processing Introducing GOLF: A New Era in Singing Voice Synthesis

GOLF offers a fresh approach to create human-like singing using fewer resources.

2025-10-15T16:19:50+00:00 ― 6 min read

Sound Advancements in Voice-Based Age and Gender Prediction

Research on predicting age and gender from voice data using innovative models.

2025-10-15T13:54:05+00:00 ― 4 min read

Sound Analyzing Music with Dependency Trees

A fresh method for understanding musical relationships through dependency trees.

2025-10-15T13:05:30+00:00 ― 6 min read

Computation and Language Improving Speech Recognition with Long-Context Models

This article discusses new models that enhance speech recognition accuracy by considering longer context.

2025-10-15T12:16:55+00:00 ― 5 min read

Computation and Language Introducing LyricWhiz: Transforming Lyric Transcription

LyricWhiz combines advanced models to improve lyric transcription accuracy across languages.

2025-10-15T09:51:10+00:00 ― 5 min read

Sound Classifying African Birdcalls Through Audio Analysis

A study on using sound recordings to identify different bird species in Africa.

2025-10-15T09:02:35+00:00 ― 6 min read

Information Retrieval How Music Recommendations Use Data Analysis

Learn how recommendation systems suggest songs based on user preferences.

2025-10-15T02:33:55+00:00 ― 5 min read

Machine Learning Addressing Dataset Imbalance in Audio Classification

This article discusses challenges and techniques for managing dataset imbalance in audio classification.

2025-10-15T00:08:10+00:00 ― 6 min read

Computation and Language Advancing Speech Recognition for Low-Resource Languages

A new approach improves speech recognition for Romanian using lateral inhibition.

2025-10-14T23:19:35+00:00 ― 5 min read

Audio and Speech Processing Advancing Gender Privacy in Audio: New Insights

Research highlights methods to protect gender privacy in spoken audio.

2025-10-14T21:42:25+00:00 ― 5 min read

Sound Understanding Emotions in Speech Recognition

A look into capturing emotions behind spoken words more accurately.

2025-10-14T16:02:20+00:00 ― 5 min read

Sound Advancing Music Classification with Audio Embeddings

Using pre-trained audio embeddings leads to better music classification models.

2025-10-14T13:36:35+00:00 ― 7 min read

Sound Advancements in Lip-to-Speech Synthesis Technology

New framework enhances speech clarity from silent videos through improved processing.

2025-10-13T19:47:45+00:00 ― 6 min read

Sound The Science Behind the Mridangam: A Unique Instrument

Discover the blend of art and science in studying the mridangam.

2025-10-13T17:19:15+00:00 ― 8 min read

Computation and Language Advancing Speech Recognition for Low-Resource Languages

A new method improves custom word recognition in ASR systems for languages with limited data.

2025-10-13T13:19:05+00:00 ― 5 min read

Sound Advancements in Fake Audio Detection Using Conformer Models

Researchers develop a Conformer model to improve fake audio detection.

2025-10-13T03:36:05+00:00 ― 5 min read

Audio and Speech Processing Advancing Alzheimer's Detection through Speech Analysis

New methods improve early detection of Alzheimer's using speech and audio analysis.

2025-10-12T19:30:15+00:00 ― 7 min read

Audio and Speech Processing New Database Reveals Insights into Musical Instrument Sounds

Explore sound data from 41 musical instruments with detailed recordings.

2025-10-12T15:27:20+00:00 ― 6 min read

Audio and Speech Processing Advancements in Speech Recognition for Dysarthria

New technologies improve communication for individuals with speech disorders.

2025-10-12T13:01:35+00:00 ― 6 min read

Computation and Language Advancements in Real-Time Speech Processing Technology

A new system combines transcription and translation for better communication.

2025-10-12T11:24:25+00:00 ― 4 min read

Sound Advancements in Speech Recognition with Whisper-AT

Whisper-AT combines speech recognition and audio tagging for improved performance.

2025-10-12T08:10:05+00:00 ― 5 min read

Audio and Speech Processing Integrating Speech with Language Models: The Speech-LLaMA Method

A new approach that combines speech with language models for improved translation.

2025-10-11T18:24:10+00:00 ― 4 min read

Sound Advancements in Automatic Piano Transcription

New method improves accuracy in turning piano audio into sheet music.

2025-10-11T14:21:15+00:00 ― 4 min read

Sound Advancements in Articulatory Speech Synthesis

A study on improving vocal sound reproduction through advanced synthesis techniques.

2025-10-11T02:12:30+00:00 ― 5 min read

Sound Introducing VampNet: A New Approach to Music Creation

VampNet transforms music processing through innovative token modeling techniques.

2025-10-11T01:23:55+00:00 ― 4 min read

Sound EchoVest: A New Hope for Hearing Impairment

Affordable wearable technology for individuals with hearing loss.

2025-10-10T23:46:45+00:00 ― 5 min read