Computer Science - Sound

RSS

Sound Improving Speech Recognition with mixPGD Training

A new method enhances Automatic Speech Recognition against adversarial challenges.

2025-12-07T11:32:45+00:00 ― 5 min read

Latest Articles

Sound Advancing Emotional Text-to-Speech Technology

A new method for emotional speech synthesis while preserving speaker identity.

2025-12-06T03:09:25+00:00 ― 6 min read

Sound Fairness in Speaker Recognition Systems

Analyzing bias in voice identification technology across different demographics.

2025-12-05T23:55:05+00:00 ― 5 min read

Audio and Speech Processing Advancements in Audio Coding Techniques

A new multi-band audio coding method improves sound quality and efficiency.

2025-12-05T23:06:30+00:00 ― 5 min read

Signal Processing New Method for Detecting Language Issues in Aphasia

Brain wave tracking shows promise in assessing language problems post-stroke.

2025-12-05T19:03:35+00:00 ― 8 min read

Sound Advancing Audio Recognition with Data-Free Techniques

New framework improves audio recognition without extensive data access.

2025-12-05T17:26:25+00:00 ― 5 min read

Sound Causal Audio Transformer: Advancements in Sound Classification

A new model improves audio classification using advanced techniques.

2025-12-05T16:37:50+00:00 ― 5 min read

Audio and Speech Processing Advances in Acoustic Source Localization

Researchers are finding new ways to locate sound sources accurately.

2025-12-05T15:00:40+00:00 ― 4 min read

Audio and Speech Processing Improving Speech Clarity in Noisy Environments

A new system enhances speech signals affected by various distortions.

2025-12-05T03:40:30+00:00 ― 5 min read

Sound Reconstructing Audio Processing Graphs Using Deep Learning

A new method to estimate audio processing setups from sound inputs.

2025-12-05T02:03:20+00:00 ― 7 min read

Sound The Role of Diffusion Models in Music Creation

Discover how diffusion models are changing music generation for composers.

2025-12-04T22:49:00+00:00 ― 5 min read

Audio and Speech Processing Improving Speech Clarity in Noisy Environments with BCMs

Learn how body-conduction microphones enhance speech quality despite background noise.

2025-12-03T18:28:35+00:00 ― 7 min read

Sound Connecting Speech and Music Through Emotion

A new system matches music to speech based on emotions without needing text.

2025-12-03T16:02:50+00:00 ― 5 min read

Sound EarCough: A New Way to Monitor Coughs

EarCough uses smart earbuds to detect user coughs accurately.

2025-12-03T15:14:15+00:00 ― 5 min read

Sound Advancements in Acoustic Event Classification Technology

A new method enhances sound recognition across diverse smart devices.

2025-12-03T11:59:55+00:00 ― 5 min read

Multimedia Advancements in Continuous Emotion Recognition

A study on improving emotion detection through multiple data sources.

2025-12-03T11:11:20+00:00 ― 5 min read

Sound Advancements in Speech Clarity through Noise Suppression Challenges

Research teams compete in improving speech quality amidst background noise.

2025-12-02T19:48:15+00:00 ― 4 min read

Computation and Language Cocktail HuBERT: Advancing Speech Recognition

A new model that improves speech recognition in multi-speaker settings.

2025-12-02T14:56:45+00:00 ― 6 min read

Audio and Speech Processing Advancements in Speech Recognition Technology

New methods improve speech recognition accuracy and efficiency.

2025-12-02T10:53:50+00:00 ― 5 min read

Computation and Language Advancements in Unsupervised Speech Recognition

Recent methods improve speech recognition without relying on labeled data.

2025-12-01T19:30:45+00:00 ― 5 min read

Sound LMCodec: A New Frontier in Speech Coding

LMCodec compresses audio effectively while preserving quality for clear communication.

2025-12-01T18:42:10+00:00 ― 5 min read

Audio and Speech Processing Advancing Speech Recognition with Self-Supervised Learning

This article highlights how self-supervised learning helps improve speech recognition systems.

2025-12-01T17:53:35+00:00 ― 5 min read

Multimedia Introducing AIOZ-GDANCE: A New Dataset for Group Dance Generation

AIOZ-GDANCE promotes research in creating group dance movements based on music.

2025-12-01T11:24:55+00:00 ― 5 min read

Machine Learning New Insights into Sperm Whale Communication

This study reveals patterns in sperm whale sounds and their potential meanings.

2025-12-01T05:12:16+00:00 ― 5 min read

Computer Vision and Pattern Recognition Advances in Locating Sounds in Videos

Research improves machines' ability to locate objects that make sounds in videos.

2025-12-01T02:30:30+00:00 ― 8 min read

Audio and Speech Processing Advancements in Sound Field Reproduction Techniques

This article examines two methods for improving sound reproduction quality.

2025-11-30T19:13:15+00:00 ― 6 min read

Sound The Art of Music Segmentation: A Closer Look

Discover how music structures enhance our listening experience.

2025-11-30T16:47:30+00:00 ― 5 min read

Sound Advancements in Music Structure Analysis

Exploring new methods for segmenting music structure and their implications.

2025-11-30T11:07:25+00:00 ― 5 min read

Computation and Language Innovative Method for Song Translation

A new approach to translating songs that aligns lyrics with melodies effectively.

2025-11-29T10:49:55+00:00 ― 7 min read

Sound Enhancing Fighting Games with Adaptive Background Music

This research looks at using adaptive music in DareFightingICE for player insights.

2025-11-29T01:06:55+00:00 ― 6 min read

Sound New Dataset Aims to Improve Lip Reading Technology

Researchers develop LIPSFUS dataset for better lip reading systems.

2025-11-28T23:29:45+00:00 ― 5 min read

Machine Learning Advancements in Speaker Verification with Unlabeled Data

This framework enhances speaker verification using unlabeled data and clustering techniques.

2025-11-28T19:26:50+00:00 ― 5 min read

Human-Computer Interaction Advancements in Wearable Emotion Recognition Systems

A new framework enhances emotion detection using self-supervised learning.

2025-11-28T14:35:20+00:00 ― 6 min read

Computer Vision and Pattern Recognition New Method for Generating Realistic Sounds from Video

This approach links video actions and sound using physics for better sound effects.

2025-11-28T12:58:10+00:00 ― 7 min read

Computer Vision and Pattern Recognition Improving Audiovisual Speech Recognition with Visual Cues

A new method boosts speech recognition using visual data with existing models.

2025-11-28T10:32:25+00:00 ― 7 min read

Artificial Intelligence A New Framework for Music Annotations

This article discusses a systematic approach to music annotation.

2025-11-28T06:29:30+00:00 ― 6 min read

Artificial Intelligence Understanding the Music Note Ontology

A structured approach to music representation and performance analysis.

2025-11-28T05:40:55+00:00 ― 5 min read

Computation and Language Advancing Bilingual Visually Grounded Speech Models

This study improves bilingual speech models using strong language support.

2025-11-28T04:03:45+00:00 ― 4 min read

Computer Vision and Pattern Recognition Creating Images from Sounds: The Sound2Scene Model

New model transforms sounds into clear images, bridging audio and visual information.

2025-11-28T03:15:10+00:00 ― 6 min read

Audio and Speech Processing New Method for Audio Captioning with Limited Data

A novel approach to generate audio captions using pre-trained language models.

2025-11-28T02:26:35+00:00 ― 6 min read

Computation and Language Modular Innovations in Speech Recognition Systems

A new approach enhances the adaptability of speech recognition technology.

2025-11-27T16:43:35+00:00 ― 4 min read