Computer Science - Sound

RSS

Sound EarCough: A New Way to Monitor Coughs

EarCough uses smart earbuds to detect user coughs accurately.

2025-12-03T15:14:15+00:00 ― 5 min read

Sound Advancements in Acoustic Event Classification Technology

A new method enhances sound recognition across diverse smart devices.

2025-12-03T11:59:55+00:00 ― 5 min read

Multimedia Advancements in Continuous Emotion Recognition

A study on improving emotion detection through multiple data sources.

2025-12-03T11:11:20+00:00 ― 5 min read

Sound Advancements in Speech Clarity through Noise Suppression Challenges

Research teams compete in improving speech quality amidst background noise.

2025-12-02T19:48:15+00:00 ― 4 min read

Computation and Language Cocktail HuBERT: Advancing Speech Recognition

A new model that improves speech recognition in multi-speaker settings.

2025-12-02T14:56:45+00:00 ― 6 min read

Audio and Speech Processing Advancements in Speech Recognition Technology

New methods improve speech recognition accuracy and efficiency.

2025-12-02T10:53:50+00:00 ― 5 min read

Computation and Language Advancements in Unsupervised Speech Recognition

Recent methods improve speech recognition without relying on labeled data.

2025-12-01T19:30:45+00:00 ― 5 min read

Sound LMCodec: A New Frontier in Speech Coding

LMCodec compresses audio effectively while preserving quality for clear communication.

2025-12-01T18:42:10+00:00 ― 5 min read

Audio and Speech Processing Advancing Speech Recognition with Self-Supervised Learning

This article highlights how self-supervised learning helps improve speech recognition systems.

2025-12-01T17:53:35+00:00 ― 5 min read

Multimedia Introducing AIOZ-GDANCE: A New Dataset for Group Dance Generation

AIOZ-GDANCE promotes research in creating group dance movements based on music.

2025-12-01T11:24:55+00:00 ― 5 min read

Machine Learning New Insights into Sperm Whale Communication

This study reveals patterns in sperm whale sounds and their potential meanings.

2025-12-01T05:12:16+00:00 ― 5 min read

Computer Vision and Pattern Recognition Advances in Locating Sounds in Videos

Research improves machines' ability to locate objects that make sounds in videos.

2025-12-01T02:30:30+00:00 ― 8 min read

Audio and Speech Processing Advancements in Sound Field Reproduction Techniques

This article examines two methods for improving sound reproduction quality.

2025-11-30T19:13:15+00:00 ― 6 min read

Sound The Art of Music Segmentation: A Closer Look

Discover how music structures enhance our listening experience.

2025-11-30T16:47:30+00:00 ― 5 min read

Sound Advancements in Music Structure Analysis

Exploring new methods for segmenting music structure and their implications.

2025-11-30T11:07:25+00:00 ― 5 min read

Computation and Language Innovative Method for Song Translation

A new approach to translating songs that aligns lyrics with melodies effectively.

2025-11-29T10:49:55+00:00 ― 7 min read

Sound Enhancing Fighting Games with Adaptive Background Music

This research looks at using adaptive music in DareFightingICE for player insights.

2025-11-29T01:06:55+00:00 ― 6 min read

Sound New Dataset Aims to Improve Lip Reading Technology

Researchers develop LIPSFUS dataset for better lip reading systems.

2025-11-28T23:29:45+00:00 ― 5 min read

Machine Learning Advancements in Speaker Verification with Unlabeled Data

This framework enhances speaker verification using unlabeled data and clustering techniques.

2025-11-28T19:26:50+00:00 ― 5 min read

Human-Computer Interaction Advancements in Wearable Emotion Recognition Systems

A new framework enhances emotion detection using self-supervised learning.

2025-11-28T14:35:20+00:00 ― 6 min read

Computer Vision and Pattern Recognition New Method for Generating Realistic Sounds from Video

This approach links video actions and sound using physics for better sound effects.

2025-11-28T12:58:10+00:00 ― 7 min read

Computer Vision and Pattern Recognition Improving Audiovisual Speech Recognition with Visual Cues

A new method boosts speech recognition using visual data with existing models.

2025-11-28T10:32:25+00:00 ― 7 min read

Artificial Intelligence A New Framework for Music Annotations

This article discusses a systematic approach to music annotation.

2025-11-28T06:29:30+00:00 ― 6 min read

Artificial Intelligence Understanding the Music Note Ontology

A structured approach to music representation and performance analysis.

2025-11-28T05:40:55+00:00 ― 5 min read

Computation and Language Advancing Bilingual Visually Grounded Speech Models

This study improves bilingual speech models using strong language support.

2025-11-28T04:03:45+00:00 ― 4 min read

Computer Vision and Pattern Recognition Creating Images from Sounds: The Sound2Scene Model

New model transforms sounds into clear images, bridging audio and visual information.

2025-11-28T03:15:10+00:00 ― 6 min read

Audio and Speech Processing New Method for Audio Captioning with Limited Data

A novel approach to generate audio captions using pre-trained language models.

2025-11-28T02:26:35+00:00 ― 6 min read

Computation and Language Modular Innovations in Speech Recognition Systems

A new approach enhances the adaptability of speech recognition technology.

2025-11-27T16:43:35+00:00 ― 4 min read

Computation and Language New Approaches in Speech Recognition Technology

A look at advancements in speech recognition models for efficiency and accuracy.

2025-11-27T15:55:00+00:00 ― 5 min read

Computation and Language New Method for Evaluating Speech Recognition Systems

A novel approach to measure speech recognition performance without manual transcription.

2025-11-26T22:06:10+00:00 ― 5 min read

Computation and Language Voice Anonymization in COVID-19 Diagnostics: Balancing Privacy and Accuracy

Examining how voice anonymization affects COVID-19 diagnostic systems and user privacy.

2025-11-26T01:03:00+00:00 ― 7 min read

Human-Computer Interaction Revolutionizing Drumming: The Air Drumming System

Experience drumming with just two sticks and a smartphone, no heavy equipment needed.

2025-11-25T21:48:40+00:00 ― 5 min read

Human-Computer Interaction How AI is Shaping Music Mixing

AI tools simplify mixing, offering new options for amateurs and professionals alike.

2025-11-24T20:42:35+00:00 ― 7 min read

Sound Bubbles in Water: A New Sound Frontier

Bubbles may hold the key to innovative music generation.

2025-11-24T18:42:00+00:00 ― 7 min read

Sound Advancements in Automated Audio Captioning

A look at new methods improving audio captioning for better accessibility.

2025-11-24T10:11:00+00:00 ― 5 min read

Computers and Society Voice Biometrics: Datasets, Bias, and Privacy Challenges

Analyzing dataset use in voice biometrics reveals significant bias and privacy concerns.

2025-11-24T06:56:40+00:00 ― 6 min read

Audio and Speech Processing Improving Speaker Verification with Margin-Mixup

A new method enhances speaker verification systems for overlapping voices.

2025-11-24T01:16:35+00:00 ― 5 min read

Machine Learning New Techniques for Speech Processing

Innovative methods for effective speech segment representation in processing tasks.

2025-11-23T14:45:00+00:00 ― 6 min read

Audio and Speech Processing Improving Speech Synthesis with Pause Prediction

Enhancing TTS systems for better storytelling through effective pause placement.

2025-11-23T09:53:30+00:00 ― 4 min read

Sound AffectMachine-Classical: A New Way to Create Emotional Music

AffectMachine-Classical generates real-time classical music to help manage emotions.

2025-11-23T06:39:10+00:00 ― 6 min read