Computer Science - Sound

RSS

Audio and Speech Processing Advancements in Lip-to-Speech Technology

LipVoicer generates clear speech from silent videos using advanced lip-reading methods.

2025-10-27T21:27:00+00:00 ― 5 min read

Audio and Speech Processing Advancing Dysarthric Speech Recognition with Innovative Approaches

New methods aim to improve communication for individuals with dysarthria.

2025-10-27T21:01:09+00:00 ― 6 min read

Computation and Language Advancing Predictions with Multiple Scores in Gaussian Processes

New method improves predictions by considering multiple expert scores.

2025-10-27T19:49:50+00:00 ― 6 min read

Computation and Language Assessing Whisper's Performance on Arabic Dialects

A look at how Whisper handles various Arabic dialects and accents.

2025-10-27T13:21:10+00:00 ― 5 min read

Computation and Language Video-LLaMA: A New Approach to Video Understanding

A program combining visual and audio data to enhance video comprehension.

2025-10-27T11:44:00+00:00 ― 5 min read

Computation and Language Advancing Speech Act Recognition in Bengali

A new method improves speech act recognition in Bengali using audio and text analysis.

2025-10-27T10:55:25+00:00 ― 5 min read

Sound Analyzing Music with BERT: A New Approach

Research explores BERT's potential in bar-level music analysis.

2025-10-27T07:41:05+00:00 ― 5 min read

Computers and Society Engaging Math Learning for Young Children

A new system enhances math learning at home through fun interactions.

2025-10-27T07:08:49+00:00 ― 6 min read

Computation and Language Efficient Speech Recognition Adaptation Using Text Data

A new method enhances speech recognition models using only text data for adaptation.

2025-10-27T06:52:30+00:00 ― 5 min read

Sound Advancing Melody Harmonization with Emotional Context

A new model improves melody harmonization by considering emotional factors.

2025-10-26T21:58:05+00:00 ― 6 min read

Machine Learning Innovative Dance Creation Using Sound Words

New methods use onomatopoeia to inspire unique dance movements.

2025-10-26T20:20:55+00:00 ― 5 min read

Sound Advancements in Speech Countermeasure Systems

Researchers improve detection of machine-generated speech using phase information adjustments.

2025-10-26T17:55:10+00:00 ― 6 min read

Computation and Language Advancements in Language Identification with LASR Framework

A new approach improves speech language identification using self-supervised learning and labels.

2025-10-26T08:12:10+00:00 ― 6 min read

Sound Improving Arabic Dysarthric Speech Recognition

A new method enhances speech recognition for dysarthric Arabic speakers.

2025-10-26T07:23:35+00:00 ― 5 min read

Computation and Language Innovative Speech Recognition Tool for Low-Resource Languages

Allophant enhances phoneme recognition for languages with limited data.

2025-10-26T06:35:00+00:00 ― 5 min read

Sound SANGEET: A Structured Dataset for Hindustani Music

Introducing SANGEET, a detailed dataset on Hindustani Classical Music.

2025-10-26T04:57:50+00:00 ― 4 min read

Sound Addressing the Challenge of Fake Audio Detection

A new method aims to improve fake audio detection without losing past knowledge.

2025-10-25T16:00:30+00:00 ― 6 min read

Audio and Speech Processing Advancements in Unsupervised Speech Recognition

A new framework enhances the study of unsupervised speech recognition systems.

2025-10-25T13:34:45+00:00 ― 6 min read

Sound Creating Melodies from Simple Beats

This project helps anyone compose music using basic beats and advanced computer methods.

2025-10-25T11:57:35+00:00 ― 5 min read

Computation and Language Advancements in Self-Supervised Learning for Speech Recognition

Self-supervised models reveal insights into phonetic and phonemic distinctions in speech.

2025-10-25T10:20:25+00:00 ― 5 min read

Computation and Language Enhancing Police Accountability with Speech Recognition Technology

Research explores the use of speech recognition in police body camera footage analysis.

2025-10-25T07:54:40+00:00 ― 6 min read

Sound New Ways Computers Create Music

A look at how computers are changing music composition.

2025-10-25T04:31:16+00:00 ― 4 min read

Audio and Speech Processing Improving Emotional Recognition and Synthesis in Speech Models

New techniques enhance emotional understanding in speech processing tasks.

2025-10-25T01:26:00+00:00 ― 6 min read

Sound LinDiff: A Leap Forward in Speech Synthesis

New model LinDiff improves speech synthesis speed and quality.

2025-10-25T00:37:25+00:00 ― 4 min read

Sound New Method Transforms Audio Compression Technology

A new approach to audio compression reduces file size without losing quality.

2025-10-24T18:57:20+00:00 ― 5 min read

Sound Enhancing Speech Clarity in Noisy Environments

Techniques to improve speech recognition amidst background noise.

2025-10-24T16:50:20+00:00 ― 5 min read

Audio and Speech Processing HiddenSinger: A New Era in Singing Voice Synthesis

HiddenSinger improves singing voice quality using advanced AI techniques.

2025-10-24T14:54:25+00:00 ― 5 min read

Sound Advancements in Electrolaryngeal Voice Conversion Technology

New methods improve speech clarity for electrolarynx users.

2025-10-24T13:17:15+00:00 ― 6 min read

Sound Innovative Advances in Electrolaryngeal Speech Technology

Researchers blend visual and sound features to improve speech for electrolarynx users.

2025-10-24T12:28:40+00:00 ― 5 min read

Audio and Speech Processing The Impact of Age on Voice Recognition Systems

A study highlights how ageing affects automatic speaker verification performance.

2025-10-24T10:02:55+00:00 ― 5 min read

Audio and Speech Processing PauseSpeech: Advancing Text-to-Speech Technology

PauseSpeech enhances TTS systems with natural-sounding speech through improved pausing.

2025-10-24T09:14:20+00:00 ― 5 min read

Multimedia A New System for Music and Video Matching

This research introduces a system for matching music to video content effectively.

2025-10-24T07:37:10+00:00 ― 6 min read

Audio and Speech Processing Enhancing Speech Recognition in Noisy Environments

New methods improve automatic speech recognition performance amid background noise.

2025-10-24T02:45:40+00:00 ― 5 min read

Audio and Speech Processing Efficient Management of Large Speech Models

A new method optimizes speech models for better performance with fewer resources.

2025-10-23T21:54:10+00:00 ― 5 min read

Audio and Speech Processing New Method for Objective Spatial Audio Evaluation

A fresh approach improves how we assess spatial audio quality.

2025-10-23T19:28:25+00:00 ― 5 min read

Sound Identifying Read vs. Spontaneous Speech in Interviews

A study on how to tell apart read and spontaneous speech.

2025-10-23T18:39:50+00:00 ― 6 min read

Audio and Speech Processing StyleTTS 2: Advancing Text-to-Speech Technology

A new model enhances the realism of synthetic speech.

2025-10-23T15:25:30+00:00 ― 8 min read

Audio and Speech Processing Advancements in Sound Source Tracking with PI-RNN

A new model improves accuracy and efficiency in tracking sound sources.

2025-10-23T10:34:00+00:00 ― 5 min read

Computation and Language Introducing the ITALIC Dataset for Spoken Italian

A new dataset enhances spoken language understanding for Italian.

2025-10-23T08:56:50+00:00 ― 6 min read

Audio and Speech Processing Advances in Bilingual and Code-Switched ASR Models

New methods improve multilingual speech recognition using existing data sources.

2025-10-23T04:05:20+00:00 ― 6 min read