Electrical Engineering and Systems Science - Audio and Speech Processing

RSS

Computation and Language Enhancing Police Accountability with Speech Recognition Technology

Research explores the use of speech recognition in police body camera footage analysis.

2025-10-25T07:54:40+00:00 ― 6 min read

Audio and Speech Processing Advancements in Voice Stress Detection Technology

New methods improve short-duration voice stress detection accuracy.

2025-10-25T06:17:30+00:00 ― 6 min read

Sound New Ways Computers Create Music

A look at how computers are changing music composition.

2025-10-25T04:31:16+00:00 ― 4 min read

Audio and Speech Processing Improving Emotional Recognition and Synthesis in Speech Models

New techniques enhance emotional understanding in speech processing tasks.

2025-10-25T01:26:00+00:00 ― 6 min read

Sound LinDiff: A Leap Forward in Speech Synthesis

New model LinDiff improves speech synthesis speed and quality.

2025-10-25T00:37:25+00:00 ― 4 min read

Sound New Method Transforms Audio Compression Technology

A new approach to audio compression reduces file size without losing quality.

2025-10-24T18:57:20+00:00 ― 5 min read

Sound Enhancing Speech Clarity in Noisy Environments

Techniques to improve speech recognition amidst background noise.

2025-10-24T16:50:20+00:00 ― 5 min read

Computation and Language Improving Voice Assistants with Multimodal Language Understanding

Multimodal language understanding enhances voice assistant performance in real-world conditions.

2025-10-24T15:43:00+00:00 ― 5 min read

Audio and Speech Processing HiddenSinger: A New Era in Singing Voice Synthesis

HiddenSinger improves singing voice quality using advanced AI techniques.

2025-10-24T14:54:25+00:00 ― 5 min read

Sound Advancements in Electrolaryngeal Voice Conversion Technology

New methods improve speech clarity for electrolarynx users.

2025-10-24T13:17:15+00:00 ― 6 min read

Sound Innovative Advances in Electrolaryngeal Speech Technology

Researchers blend visual and sound features to improve speech for electrolarynx users.

2025-10-24T12:28:40+00:00 ― 5 min read

Audio and Speech Processing The Impact of Age on Voice Recognition Systems

A study highlights how ageing affects automatic speaker verification performance.

2025-10-24T10:02:55+00:00 ― 5 min read

Audio and Speech Processing PauseSpeech: Advancing Text-to-Speech Technology

PauseSpeech enhances TTS systems with natural-sounding speech through improved pausing.

2025-10-24T09:14:20+00:00 ― 5 min read

Multimedia A New System for Music and Video Matching

This research introduces a system for matching music to video content effectively.

2025-10-24T07:37:10+00:00 ― 6 min read

Audio and Speech Processing Enhancing Speech Recognition in Noisy Environments

New methods improve automatic speech recognition performance amid background noise.

2025-10-24T02:45:40+00:00 ― 5 min read

Audio and Speech Processing Improving Speech Recognition with Large Language Models

This research highlights how LLMs enhance speech understanding in long videos.

2025-10-23T22:42:45+00:00 ― 4 min read

Audio and Speech Processing Efficient Management of Large Speech Models

A new method optimizes speech models for better performance with fewer resources.

2025-10-23T21:54:10+00:00 ― 5 min read

Audio and Speech Processing New Method for Objective Spatial Audio Evaluation

A fresh approach improves how we assess spatial audio quality.

2025-10-23T19:28:25+00:00 ― 5 min read

Sound Identifying Read vs. Spontaneous Speech in Interviews

A study on how to tell apart read and spontaneous speech.

2025-10-23T18:39:50+00:00 ― 6 min read

Audio and Speech Processing StyleTTS 2: Advancing Text-to-Speech Technology

A new model enhances the realism of synthetic speech.

2025-10-23T15:25:30+00:00 ― 8 min read

Audio and Speech Processing Malafide: A New Challenge for Voice Recognition Systems

Malafide introduces sophisticated spoofing techniques, complicating countermeasures in speech recognition.

2025-10-23T14:36:55+00:00 ― 5 min read

Audio and Speech Processing Advancements in Sound Source Tracking with PI-RNN

A new model improves accuracy and efficiency in tracking sound sources.

2025-10-23T10:34:00+00:00 ― 5 min read

Computation and Language Introducing the ITALIC Dataset for Spoken Italian

A new dataset enhances spoken language understanding for Italian.

2025-10-23T08:56:50+00:00 ― 6 min read

Audio and Speech Processing Advancements in Self-Supervised Learning for Speech Processing

MCR-Data2vec 2.0 enhances speech recognition by improving model consistency.

2025-10-23T08:08:15+00:00 ― 4 min read

Machine Learning EM-Network: A New Approach in Sequence Learning

EM-Network enhances sequence learning in speech and language processing tasks.

2025-10-23T07:19:40+00:00 ― 5 min read

Audio and Speech Processing Advances in Bilingual and Code-Switched ASR Models

New methods improve multilingual speech recognition using existing data sources.

2025-10-23T04:05:20+00:00 ― 6 min read

Computation and Language Improving Speech Recognition for Low-Resource Languages

Research focuses on enhancing speech tech for languages lacking sufficient data.

2025-10-22T23:13:50+00:00 ― 6 min read

Audio and Speech Processing Advancements in Speech Enhancement Techniques

A look at recent developments in improving audio clarity using advanced models.

2025-10-22T21:36:40+00:00 ― 5 min read

Sound Assessing Piano Piece Difficulty with New Dataset

A new dataset aims to classify piano scores by difficulty level.

2025-10-22T20:48:05+00:00 ― 7 min read

Sound Advancements in Speech Quality Improvement

Gesper framework enhances speech clarity in noisy environments.

2025-10-22T19:59:30+00:00 ― 5 min read

Audio and Speech Processing Improving Speech Enhancement with Normalization Techniques

This study presents a new method to enhance speech quality using pre-trained models.

2025-10-22T19:10:55+00:00 ― 6 min read

Artificial Intelligence Improving Hate Speech Detection in Multimedia

Combining audio, video, and text enhances detection of hate speech.

2025-10-22T15:08:00+00:00 ― 5 min read

Sound A Simplified Approach to Hybrid HMM for ASR

This article discusses a new method for building efficient ASR systems.

2025-10-22T14:19:25+00:00 ― 5 min read

Audio and Speech Processing Personalizing Voice Recognition on Mobile Devices

A new approach enhances voice recognition directly on smartphones while ensuring user privacy.

2025-10-22T10:16:30+00:00 ― 6 min read

Audio and Speech Processing New System Improves Speaker Identification in Audio

A new method enhances accuracy in identifying speakers during conversations.

2025-10-22T09:27:55+00:00 ― 5 min read

Sound Advancements in Few-shot Bioacoustic Event Detection

Teams improve animal sound identification with few examples in DCASE challenge.

2025-10-22T07:50:45+00:00 ― 5 min read

Sound Harnessing Audio Tagging on Small Computers

Learn about audio tagging systems and their use on Raspberry Pi.

2025-10-22T06:13:35+00:00 ― 5 min read

Sound Advancements in Cover Song Identification Algorithms

New techniques improve accuracy and efficiency in identifying cover songs.

2025-10-22T05:25:00+00:00 ― 5 min read

Audio and Speech Processing Advancements in Active Noise Control Technology

New method improves noise control in 3D spaces.

2025-10-22T01:22:05+00:00 ― 4 min read

Audio and Speech Processing New Dataset and Model for Multilingual Text-to-Speech

CML-TTS enables better text-to-speech systems across seven languages.

2025-10-21T18:04:50+00:00 ― 5 min read