Computer Science - Sound

RSS

Computation and Language Wav2Vec2.0 and the Sound of Speech Recognition

This article discusses how Wav2Vec2.0 processes speech sounds using phonology.

2025-07-23T05:35:45+00:00 ― 5 min read

Computation and Language Advancements in Multilingual Speaker Anonymization

Improving speaker anonymization technology for nine languages to ensure privacy.

2025-07-23T03:58:35+00:00 ― 5 min read

Quantitative Methods Digital Aquaculture: The Future of Fish Farming

Exploring technology's role in enhancing fish farming efficiency and welfare.

2025-07-23T03:15:54+00:00 ― 5 min read

Sound New Method for Early Dementia Detection via Voice Analysis

A novel approach combines voice analysis with privacy protection for dementia detection.

2025-07-22T19:04:10+00:00 ― 6 min read

Sound Advancing Automated Animal Sound Classification

New methods improve accuracy in identifying animal sounds for wildlife monitoring.

2025-07-22T18:15:35+00:00 ― 4 min read

Sound Advancements in Multi-Talker Speech Recognition

A new method improves accuracy in recognizing speech from multiple speakers.

2025-07-22T10:58:20+00:00 ― 5 min read

Sound Advancements in Speech Synthesis Using Acoustic BPE

Acoustic BPE improves speech intelligibility and quality in TTS systems.

2025-07-22T08:32:35+00:00 ― 6 min read

Sound Advancements in Speech Enhancement Technology

A new method improves speech clarity in noisy environments using dual neural networks.

2025-07-22T06:55:25+00:00 ― 5 min read

Computation and Language Advancing Speech Recognition with Accent-Specific Codebooks

New method improves ASR systems' handling of various accents through specialized codebooks.

2025-07-22T04:29:40+00:00 ― 5 min read

Computation and Language Advancements in Automatic Speech Recognition Technology

New methods improve accuracy and efficiency in speech recognition systems.

2025-07-22T03:41:05+00:00 ― 6 min read

Audio and Speech Processing Advancing Sound Source Localization with DOA-PNN

A new method improves sound localization in varied environments by focusing on continuous learning.

2025-07-22T02:03:55+00:00 ― 6 min read

Audio and Speech Processing Advancements in Sound Event Detection with UCIL

A new method enhances sound event detection by integrating new audio classes effectively.

2025-07-22T01:15:20+00:00 ― 6 min read

Audio and Speech Processing Advancing Sound Event Detection with WildDESED Dataset

WildDESED improves sound detection systems in noisy home environments.

2025-07-22T00:26:45+00:00 ― 6 min read

Neurons and Cognition Exploring How Music Affects the Brain

A study reveals how different music genres activate distinct brain areas.

2025-07-21T22:25:24+00:00 ― 5 min read

Audio and Speech Processing Guidelines for NeurIPS 2024 Paper Submissions

Essential rules for submitting papers to NeurIPS 2024.

2025-07-21T22:01:00+00:00 ― 4 min read

Hardware Architecture Improving MUSIC Efficiency through Approximate Computing

This article discusses enhancing MUSIC with approximate computing for better performance.

2025-07-21T16:20:55+00:00 ― 6 min read

Audio and Speech Processing YourMT3+: Advancements in Music Transcription Technology

A new system improves multi-instrument music transcription accuracy and efficiency.

2025-07-21T15:32:20+00:00 ― 5 min read

Audio and Speech Processing Seed-ASR: Advancing Speech Recognition Technology

A new model improves accuracy in speech-to-text capabilities across multiple languages.

2025-07-21T14:43:45+00:00 ― 5 min read

Sound Improving Speech Quality Monitoring on Devices

Advancements in predicting speech quality using efficient methods for mobile devices.

2025-07-21T13:55:10+00:00 ― 5 min read

Sound Harnessing Timbre in Music Production with Synthesizers

A method to enhance timbre in music production through synthesizers.

2025-07-21T13:06:35+00:00 ― 6 min read

Computation and Language Advancing Speech Technology for Tunisian Arabic

This study evaluates speech technology in low-resource languages like Tunisian Arabic.

2025-07-21T12:18:00+00:00 ― 5 min read

Sound Vulnerability in Speech Recognition Systems Exposed

Research reveals risks in multi-task speech models like Whisper.

2025-07-21T09:52:15+00:00 ― 5 min read

Computation and Language TokenVerse: Streamlining Conversation Analysis

TokenVerse simplifies the analysis of spoken conversations by integrating multiple tasks into a single model.

2025-07-21T08:15:05+00:00 ― 6 min read

Sound Advancing Audio Generation with Sound-VECaps Dataset

New dataset improves audio generation from detailed text descriptions.

2025-07-21T07:26:30+00:00 ― 4 min read

Sound Bridging Art and AI: New Interaction Methods

A fresh approach for artists to connect creativity with AI audio generation.

2025-07-21T06:37:55+00:00 ― 6 min read

Sound The Rise of Text-to-Music Models in Music Creation

Exploring the impact of TTM models on music creation and user experiences.

2025-07-21T05:49:20+00:00 ― 6 min read

Computation and Language Evaluating Online Speaker Diarization Systems

This article examines the latency of various speaker diarization systems in audio processing.

2025-07-21T04:12:10+00:00 ― 6 min read

Computation and Language LearnerVoice: Advancing Voice Recognition for Language Learners

New dataset aims to improve voice recognition for non-native English speakers.

2025-07-21T02:35:00+00:00 ― 6 min read

Computation and Language Advancing Emotion Recognition in Conversations

A new framework, BiosERC, improves emotion recognition by considering speaker traits.

2025-07-21T01:46:25+00:00 ― 6 min read

Audio and Speech Processing Understanding Voice Likability in Technology Design

This study examines how voice preferences vary among different listeners.

2025-07-21T00:57:50+00:00 ― 4 min read

Computer Vision and Pattern Recognition New Method for Creating Sound from Video and Text

This article presents a method to generate accurate sound from videos and text.

2025-07-20T16:03:25+00:00 ― 7 min read

Audio and Speech Processing Advancements in String Sound Synthesis

A new model enhances the simulation of string instruments for realistic sound.

2025-07-20T15:14:50+00:00 ― 6 min read

Audio and Speech Processing A New Way to Edit Speech Sounds

Introducing a method for better control in speech editing.

2025-07-20T12:49:05+00:00 ― 5 min read

Sound Recognizing Music Eras Through Audio and Artist Data

A study on classifying music by its era using audio features and artist insights.

2025-07-20T10:23:20+00:00 ― 6 min read

Sound New Framework for Analyzing Animal Sounds

A new model enhances the study of animal communication using raw audio data.

2025-07-20T10:15:44+00:00 ― 5 min read

Signal Processing Advancements in Signal Processing with Spiking Neural Networks

A new system improves signal processing efficiency through innovative encoding methods.

2025-07-20T07:09:00+00:00 ― 5 min read

Sound Innovative Approaches to Birdcall Classification

A team tackles birdcall identification challenges in the BirdCLEF 2024 competition.

2025-07-20T01:28:55+00:00 ― 6 min read

Sound New Datasets for Music Emotion Recognition

Introducing MERGE datasets to improve emotion classification in music.

2025-07-19T20:37:25+00:00 ― 6 min read

Sound Advancing Few-Shot Keyword Spotting with Mix-Training

This study examines Mix-Training for keyword spotting in noisy speech conditions.

2025-07-19T16:39:18+00:00 ― 5 min read

Machine Learning Boosting Small Models with Large Model Insights

A new method helps smaller models perform better using hints from larger models.

2025-07-19T14:08:45+00:00 ― 6 min read