Computer Science - Sound

RSS

Multimedia The New Age of Lie Detection

Researchers combine audio and visual cues to detect lies more accurately.

2025-05-29T11:09:31+00:00 ― 6 min read

Human-Computer Interaction Innovative Communication System for Disaster Response

A new voice-based network bridges language gaps in emergencies.

2025-05-29T09:49:20+00:00 ― 6 min read

Audio and Speech Processing Advancements in Device-Directed Speech Detection

Learn how virtual assistants understand user commands better.

2025-05-29T05:48:47+00:00 ― 6 min read

Sound Revolutionizing Audio Captioning with MACE

MACE improves audio captioning by linking sounds to accurate text descriptions.

2025-05-28T17:47:08+00:00 ― 5 min read

Sound Predicting Song Cover Success with Machine Learning

Using machine learning to forecast audience reaction to song covers.

2025-05-28T15:06:46+00:00 ― 7 min read

Sound Improving Audio Classification with ADD Loss

A new approach to enhance classification through Angular Distance Distribution Loss.

2025-05-28T13:46:35+00:00 ― 6 min read

Computation and Language Advancements in Speech Recognition for People with Disabilities

New methods improve communication tools for individuals with speech difficulties.

2025-05-28T11:06:13+00:00 ― 7 min read

Sound Estimating Human Poses Using Sound Waves

Researchers use sound waves to estimate human poses without cameras.

2025-05-27T23:13:12+00:00 ― 8 min read

Audio and Speech Processing Improving Sound Detection in Noisy Environments

New methods using language models enhance sound detection amidst background noise.

2025-05-27T03:01:49+00:00 ― 6 min read

Sound Fish-Speech: A New Era in Text-to-Speech

Fish-Speech enhances voice technology for a more natural communication experience.

2025-05-27T01:41:38+00:00 ― 6 min read

Sound EmoSphere++: A New Era in Emotional Machines

EmoSphere++ enables machines to express emotions like humans, enhancing interactions.

2025-05-26T05:38:53+00:00 ― 7 min read

Sound New Method for Underwater Boundary Estimation

U-COTANS improves underwater boundary detection using deep learning techniques.

2025-05-26T02:58:31+00:00 ― 6 min read

Sound Introducing PIAST: A New Dataset for Piano Music Research

PIAST offers a unique collection of piano music for researchers.

2025-05-26T01:38:20+00:00 ― 5 min read

Computer Vision and Pattern Recognition Advancing Technology with 3D Audio-Visual Segmentation

Machines learn to connect sound and visuals in 3D spaces.

2025-05-25T21:37:47+00:00 ― 7 min read

Audio and Speech Processing The Evolution of Speaker Diarization

How new methods are transforming speaker identification in audio recordings.

2025-05-25T18:57:25+00:00 ― 6 min read

Sound The Soul of Ghanaian Seperewa Music

A look into the traditional sounds of the seperewa harp-lute.

2025-05-25T06:50:24+00:00 ― 6 min read

Sound Target Speaker Extraction: Enhancing Clarity in Noisy Settings

Learn how TSE improves speech recognition in crowded environments using text cues.

2025-05-25T00:14:51+00:00 ― 6 min read

Sound Innovative Audio System Enhances Construction Site Safety

A new system detects screams to improve worker safety on construction sites.

2025-05-24T22:54:40+00:00 ― 7 min read

Sound Advancements in Speaker Emotion Recognition Technology

Exploring new methods for recognizing emotions in speech using advanced models.

2025-05-24T20:14:18+00:00 ― 7 min read

Sound The Concatenator: A New Way to Create Music

A fresh system for merging audio samples to help music creators innovate easily.

2025-05-24T05:32:17+00:00 ― 6 min read

Sound Dynamic Range Compression: Improving Sound Quality

A look at how dynamic range compression enhances audio experiences.

2025-05-24T04:12:06+00:00 ― 6 min read

Audio and Speech Processing Using Voice Assistants to Detect Mild Cognitive Impairment

Voice assistants help identify early signs of memory issues in older adults.

2025-05-24T01:31:44+00:00 ― 7 min read

Sound Dynamic Music Generation for Tabletop RPGs

A system creates real-time music based on tabletop role-playing game narratives.

2025-05-23T16:10:27+00:00 ― 7 min read

Computation and Language SLAM-ASR: A Look at Speech Recognition's Potential

Examining SLAM-ASR's strengths, weaknesses, and future in speech recognition.

2025-05-23T14:50:16+00:00 ― 5 min read

Signal Processing Clearing Up Sound: The SoundSil-DS Method

A new method to clarify and visualize sound-field images.

2025-05-23T13:48:54+00:00 ― 7 min read

Computation and Language Innovating Speech Recognition for Malasar Language

A project improves speech recognition for the Malasar language using Tamil resources.

2025-05-23T02:48:37+00:00 ― 5 min read

Sound Acoustic Volume Rendering: A Leap in Sound Realism

Discover how sound enhances virtual experiences through acoustic volume rendering.

2025-05-21T22:44:46+00:00 ― 7 min read

Machine Learning Listening to Machines: A New Diagnostic Approach

This study uses sound analysis to identify machine faults effectively.

2025-05-21T21:24:35+00:00 ― 5 min read

Audio and Speech Processing Advances in Sound Event Localization and Detection

A new model improves identifying and locating sounds effectively.

2025-05-21T08:02:45+00:00 ― 7 min read

Sound AuscultaBase: Transforming Body Sound Diagnostics

AuscultaBase enhances accuracy in diagnosing health conditions using diverse body sound data.

2025-05-20T22:41:28+00:00 ― 4 min read

Sound Introducing ArPA: A New Tool for Kids' Pronunciation

ArPA helps Arabic-speaking kids improve their pronunciation through interactive activities.

2025-05-20T21:34:12+00:00 ― 5 min read

Sound Creating a Conversational Music Retrieval System

A new dataset helps find music through friendly dialogue.

2025-05-20T18:40:55+00:00 ― 7 min read

Sound Aligning Audio with Sheet Music: A New Approach

Combining audio recordings with sheet music for better practice.

2025-05-20T17:20:44+00:00 ― 6 min read

Audio and Speech Processing AEROMamba: The Future of Audio Quality

AEROMamba enhances low-quality audio into rich, high-fidelity sound.

2025-05-20T13:20:11+00:00 ― 5 min read

Sound New Tool Transforms Animal Sound Research

A groundbreaking audio-language model aids in studying animal sounds and behaviors.

2025-05-20T09:02:58+00:00 ― 7 min read

Computation and Language Building a Chatbot for Taiwanese Mandarin Conversations

Creating an AI model for natural conversations in Taiwanese Mandarin.

2025-05-20T03:51:26+00:00 ― 5 min read

Sound Mamba: Advancing Speech Recognition Technology

Mamba enhances speech recognition with speed and accuracy, reshaping interaction with devices.

2025-05-19T22:39:54+00:00 ― 4 min read

Sound Using Visual Cues to Clear Up Speech in Noise

New method enhances speech clarity using visual information from surroundings.

2025-05-18T20:42:14+00:00 ― 5 min read

Computer Vision and Pattern Recognition The Rise of Deepfakes and Their Impact

Exploring the challenges and implications of deepfake technology in today’s media landscape.

2025-05-18T12:54:56+00:00 ― 6 min read

Sound Brain Waves: A New Way to Communicate

Research reveals how brain waves can aid silent communication.

2025-05-15T01:50:24+00:00 ― 6 min read