Computer Science - Sound

RSS

Computation and Language Refining Song Lyrics with a New Model

A new model transforms plain texts into fitting song lyrics.

2025-06-21T23:47:35+00:00 ― 6 min read

Computation and Language The Movement of English Vowels: Diphthongs vs. Monophthongs

This study analyzes how diphthongs and monophthongs differ in production and movement.

2025-06-21T22:10:25+00:00 ― 5 min read

Computation and Language Improving Automatic Speech Recognition with Language Models

New method enhances ASR accuracy using language models for better transcriptions.

2025-06-21T20:33:15+00:00 ― 4 min read

Sound Advancements in Speech Enhancement Techniques

Improving speech clarity through hybrid filterbanks and neural networks.

2025-06-21T17:18:55+00:00 ― 5 min read

Sound AASIST3: Advanced Solution for Voice Verification

AASIST3 improves fake voice detection in automatic speaker verification systems.

2025-06-21T16:30:20+00:00 ― 6 min read

Audio and Speech Processing Advancements in Audio Technology: Introducing X-Codec

X-Codec improves audio generation by integrating semantic understanding into processing.

2025-06-21T15:41:45+00:00 ― 6 min read

Computer Vision and Pattern Recognition Advancements in Gesture Recognition Technology

Researchers enhance gesture recognition using innovative learning techniques.

2025-06-21T12:27:25+00:00 ― 6 min read

Audio and Speech Processing Innovative Noise Control for Construction Sites

Portable system reduces construction noise, enhancing worker comfort and community well-being.

2025-06-21T11:38:50+00:00 ― 5 min read

Sound Advancements in Text-to-Music Generation Technology

New models like FluxMusic improve music creation from written text.

2025-06-21T10:50:15+00:00 ― 5 min read

Information Retrieval Advancements in Optical Music Recognition Technology

Discover how new techniques improve the conversion of music notation to digital formats.

2025-06-21T09:54:48+00:00 ― 5 min read

Audio and Speech Processing Combining Voice and Face for Better Identity Recognition

This article discusses the benefits of merging voice and facial recognition systems.

2025-06-21T08:24:30+00:00 ― 5 min read

Audio and Speech Processing Advancements in Audio-Visual Speech Recognition Technology

A new model enhances speech recognition by combining audio and visual inputs effectively.

2025-06-21T05:58:45+00:00 ― 5 min read

Sound Advancing Depression Detection Through Speech Analysis

New models improve accuracy in detecting depression via voice recordings.

2025-06-21T03:33:00+00:00 ― 6 min read

Audio and Speech Processing Advancements in Self-Supervised Learning for Speech Processing

A new method improves speech model performance across various tasks.

2025-06-21T02:44:25+00:00 ― 6 min read

Sound Advancing Keyword Spotting with Unlabeled Data

A new method improves keyword spotting accuracy using unlabeled audio data.

2025-06-21T01:55:50+00:00 ― 6 min read

Neurons and Cognition Automatic Detection of Mild Cognitive Impairment through Speech Analysis

Research shows speech analysis can aid in early detection of Mild Cognitive Impairment.

2025-06-21T01:11:33+00:00 ― 5 min read

Sound Advancements in Automatic Music Generation

A new method improves music generation by focusing on chords and representation.

2025-06-20T23:30:05+00:00 ― 6 min read

Sound New Dataset Enhances Speech Recognition Technology

Researchers create LibriheavyMix to improve speech recognition in noisy environments.

2025-06-20T22:41:30+00:00 ― 5 min read

Sound Advancements in Multi-Speaker Speech Recognition

New methods improve speech recognition in challenging multi-speaker situations.

2025-06-20T21:52:55+00:00 ― 4 min read

Signal Processing New Dataset Aims to Transform Heart Disease Diagnosis

A groundbreaking dataset enhances AI tools for diagnosing heart conditions.

2025-06-20T21:04:20+00:00 ― 7 min read

Sound VoxHakka: Preserving Taiwanese Hakka with Technology

A new system helps bring Taiwanese Hakka language back to life.

2025-06-20T16:12:50+00:00 ― 5 min read

Sound Advancements in Speech Enhancement Techniques

New methods improve speech clarity in noisy environments using advanced technologies.

2025-06-20T15:24:15+00:00 ― 5 min read

Sound Advancements in Target Speaker Extraction Technology

New methods improve voice separation in noisy environments.

2025-06-20T13:47:05+00:00 ― 5 min read

Computation and Language Enhancing TTS for Low-Resource Languages

This article explores methods for improving text-to-speech systems for underrepresented languages.

2025-06-20T10:32:45+00:00 ― 6 min read

Sound Melodies Across Cultures: A Deep Dive

This study examines how melody varies and connects across different cultures.

2025-06-20T06:00:33+00:00 ― 6 min read

Sound ConversaSynth: Advancing Synthetic Audio Conversations

A framework using large language models to create authentic audio dialogues.

2025-06-20T05:41:15+00:00 ― 6 min read

Computation and Language Advancements in Speech Tokenization: A Framework for Evaluation

A new benchmark aids in assessing speech tokenizers for better performance.

2025-06-20T00:01:10+00:00 ― 6 min read

Sound Advancing ASR Performance through Temporal Order Preservation

A new method improves automatic speech recognition by preserving sound order in knowledge transfer.

2025-06-19T19:58:15+00:00 ― 4 min read

Computation and Language Advancements in Speech Recognition for Code-Switching

A new model improves speech recognition in multilingual conversations.

2025-06-19T16:43:55+00:00 ― 5 min read

Sound Evaluating Large Language Models in Musicology

This study examines the effectiveness of LLMs in musicology and their reliability.

2025-06-19T15:55:20+00:00 ― 5 min read

Audio and Speech Processing Improving Speech Recognition with Noise-Augmented Training

This study examines how noise can enhance speech recognition resilience against challenges.

2025-06-19T14:18:10+00:00 ― 5 min read

Audio and Speech Processing Improving Sound Direction Estimation with Extra Microphone

Discover how an additional microphone enhances sound direction detection in noisy environments.

2025-06-19T12:41:00+00:00 ― 5 min read

Sound Advancements in One-Shot Voice Conversion Technology

A new method improves voice conversion using fewer samples.

2025-06-19T11:03:50+00:00 ― 5 min read

Computation and Language Advancements in Lightweight Speech Recognition Models

Innovative lightweight transducer enhances speech recognition efficiency and accuracy.

2025-06-19T07:00:55+00:00 ― 6 min read

Sound New Method for Efficient Speech Generation

A novel system generates speech from text using minimal data.

2025-06-19T04:27:24+00:00 ― 4 min read

Sound Advancing Symbolic Music Generation with Audio Data

New methods improve music creation through audio analysis and user control.

2025-06-19T01:20:50+00:00 ― 6 min read

Sound Watermarking in Audio Generative Models: A New Approach

New watermarking methods protect creators in audio generative models.

2025-06-18T23:43:40+00:00 ― 4 min read

Audio and Speech Processing Advancements in Speech Synthesis Using DDSP

Discover how DDSP improves speech synthesis efficiency and quality.

2025-06-18T17:15:00+00:00 ― 6 min read

Sound Advancements in Speech Emotion Recognition Systems

This study enhances SER through improved preprocessing and efficient attention models.

2025-06-18T12:23:30+00:00 ― 4 min read

Sound Dynamic Background Music Generation for Interactive Media

A framework for real-time music adjustment in games and films.

2025-06-18T10:46:20+00:00 ― 5 min read