Computer Science - Sound

RSS

Audio and Speech Processing Efficient Management of Large Speech Models

A new method optimizes speech models for better performance with fewer resources.

2025-10-23T21:54:10+00:00 ― 5 min read

Audio and Speech Processing New Method for Objective Spatial Audio Evaluation

A fresh approach improves how we assess spatial audio quality.

2025-10-23T19:28:25+00:00 ― 5 min read

Sound Identifying Read vs. Spontaneous Speech in Interviews

A study on how to tell apart read and spontaneous speech.

2025-10-23T18:39:50+00:00 ― 6 min read

Audio and Speech Processing StyleTTS 2: Advancing Text-to-Speech Technology

A new model enhances the realism of synthetic speech.

2025-10-23T15:25:30+00:00 ― 8 min read

Audio and Speech Processing Advancements in Sound Source Tracking with PI-RNN

A new model improves accuracy and efficiency in tracking sound sources.

2025-10-23T10:34:00+00:00 ― 5 min read

Computation and Language Introducing the ITALIC Dataset for Spoken Italian

A new dataset enhances spoken language understanding for Italian.

2025-10-23T08:56:50+00:00 ― 6 min read

Audio and Speech Processing Advances in Bilingual and Code-Switched ASR Models

New methods improve multilingual speech recognition using existing data sources.

2025-10-23T04:05:20+00:00 ― 6 min read

Computation and Language Improving Speech Recognition for Low-Resource Languages

Research focuses on enhancing speech tech for languages lacking sufficient data.

2025-10-22T23:13:50+00:00 ― 6 min read

Audio and Speech Processing Advancements in Speech Enhancement Techniques

A look at recent developments in improving audio clarity using advanced models.

2025-10-22T21:36:40+00:00 ― 5 min read

Sound Assessing Piano Piece Difficulty with New Dataset

A new dataset aims to classify piano scores by difficulty level.

2025-10-22T20:48:05+00:00 ― 7 min read

Sound Advancements in Speech Quality Improvement

Gesper framework enhances speech clarity in noisy environments.

2025-10-22T19:59:30+00:00 ― 5 min read

Audio and Speech Processing Improving Speech Enhancement with Normalization Techniques

This study presents a new method to enhance speech quality using pre-trained models.

2025-10-22T19:10:55+00:00 ― 6 min read

Artificial Intelligence Improving Hate Speech Detection in Multimedia

Combining audio, video, and text enhances detection of hate speech.

2025-10-22T15:08:00+00:00 ― 5 min read

Sound A Simplified Approach to Hybrid HMM for ASR

This article discusses a new method for building efficient ASR systems.

2025-10-22T14:19:25+00:00 ― 5 min read

Sound Innovative Approach to Roman Numeral Analysis in Music

A new method using Graph Neural Networks improves Roman Numeral analysis for music.

2025-10-22T13:26:06+00:00 ― 6 min read

Sound Advancements in Few-shot Bioacoustic Event Detection

Teams improve animal sound identification with few examples in DCASE challenge.

2025-10-22T07:50:45+00:00 ― 5 min read

Sound Harnessing Audio Tagging on Small Computers

Learn about audio tagging systems and their use on Raspberry Pi.

2025-10-22T06:13:35+00:00 ― 5 min read

Sound Advancements in Cover Song Identification Algorithms

New techniques improve accuracy and efficiency in identifying cover songs.

2025-10-22T05:25:00+00:00 ― 5 min read

Audio and Speech Processing Advancements in Active Noise Control Technology

New method improves noise control in 3D spaces.

2025-10-22T01:22:05+00:00 ― 4 min read

Sound Evaluating Speech Quality with Machine Learning Models

This study assesses various models for predicting synthesized speech quality.

2025-10-21T16:27:40+00:00 ― 5 min read

Sound Advancements in Bird Sound Classification Methods

Researchers automate bird sound classification, enhancing accuracy in monitoring species.

2025-10-21T14:50:30+00:00 ― 5 min read

Audio and Speech Processing FALL-E: A New Era in Sound Creation

FALL-E creates high-quality sound effects from text descriptions.

2025-10-21T13:13:20+00:00 ― 5 min read

Audio and Speech Processing Advancements in Multi-Talker Speech Recognition with SURT 2.0

SURT 2.0 improves speech recognition for multiple speakers in real-time settings.

2025-10-21T05:07:30+00:00 ― 5 min read

Sound Introducing MARBLE: A Benchmark for Music AI

MARBLE sets a standard for evaluating music AI models across multiple tasks.

2025-10-21T04:18:55+00:00 ― 6 min read

Audio and Speech Processing New Model Enhances Bird Sound Detection

A new method improves the accuracy of identifying bird calls.

2025-10-21T03:30:20+00:00 ― 6 min read

Sound Improving Audio Processing with SFI Layers

New algorithms enhance audio processing performance across varying sample rates.

2025-10-21T00:16:00+00:00 ― 5 min read

Sound Using Sound to Sort Male Mosquitoes for Pest Control

Research explores sound analysis to improve mosquito sorting for disease control.

2025-10-20T21:50:15+00:00 ― 5 min read

Sound Transforming Vocal Sounds with DSP Techniques

Explore two innovative methods for altering vocal timbre using Digital Signal Processing.

2025-10-20T14:33:00+00:00 ― 4 min read

Audio and Speech Processing Advancements in Automatic Speech Recognition Learning

A new method enhances speech recognition technology without losing previously learned knowledge.

2025-10-20T13:44:25+00:00 ― 6 min read

Sound Advances in Multitrack Music Transcription with Perceiver TF

A new model improves music transcription accuracy for multiple instruments.

2025-10-20T12:07:15+00:00 ― 5 min read

Sound Advancements in Audio Processing with DAMAS-FISTA

A new method combines traditional and deep learning for efficient sound imaging.

2025-10-20T11:18:40+00:00 ― 6 min read

Audio and Speech Processing Advancements in Sound Field Reconstruction

New methods improve realism in audio technologies using physics-informed techniques.

2025-10-20T10:30:05+00:00 ― 6 min read

Audio and Speech Processing Voice Recognition's Role in Clinical Trial Integrity

Investigating how voice technology can prevent duplicate patient participation in trials.

2025-10-20T07:15:45+00:00 ― 6 min read

Audio and Speech Processing Analyzing Speech to Detect Mental Health Issues

A new dataset helps identify signs of depression and anxiety through speech analysis.

2025-10-20T06:27:10+00:00 ― 6 min read

Sound Reconstructing Sound from Brain Activity

New method reconstructs sound from brain signals, revealing insights into auditory processing.

2025-10-20T01:35:40+00:00 ― 5 min read

Sound Bringing AI to Music Creation on Bela

A guide to using AI models for music on the Bela platform.

2025-10-19T22:21:20+00:00 ― 5 min read

Computation and Language Evaluating ASR Quality Without Reference Texts

NoRefER offers a new way to assess speech recognition outputs without needing transcripts.

2025-10-19T16:41:15+00:00 ― 6 min read

Computer Vision and Pattern Recognition Improving Video Captioning with Audio Integration

This article discusses a method to enhance video captioning by incorporating audio.

2025-10-19T15:52:40+00:00 ― 5 min read

Sound Advancements in Voice Conversion Technology

A new model improves voice conversion by simplifying speech separation techniques.

2025-10-19T12:38:20+00:00 ― 6 min read

Sound Advancements in Measuring Music Similarity

Research aims to combine audio and symbolic data for music similarity analysis.

2025-10-19T11:49:45+00:00 ― 7 min read