Electrical Engineering and Systems Science - Audio and Speech Processing

RSS

Human-Computer Interaction Improving Sound Localization in XR with Auptimize

Auptimize enhances audio cue placement for better user interaction in XR.

2025-06-29T04:15:50+00:00 ― 6 min read

Audio and Speech Processing Malacopula: A New Threat to Voice Verification Systems

Malacopula challenges the reliability of automatic speaker verification technologies.

2025-06-29T03:27:15+00:00 ― 6 min read

Graphics MetaFace: Advancing 3D Talking Face Animations

A new method for more realistic 3D face animations adjusting to personal speaking styles.

2025-06-28T19:21:25+00:00 ― 5 min read

Sound Improving Keyword Spotting with Adversarial Training

Adversarial training enhances keyword spotting accuracy in synthetic and real speech.

2025-06-28T13:41:20+00:00 ― 5 min read

Sound Advancements in Few-Shot Learning for Audio Processing

This piece discusses few-shot learning and its impact on audio tasks.

2025-06-28T12:04:10+00:00 ― 6 min read

Sound Transforming Communication: Face-based Voice Conversion

New technology links facial features to voice, aiding communication for those without a voice.

2025-06-28T06:24:05+00:00 ― 5 min read

Machine Learning Advancements in Audio Compositional Learning

A new method enhances audio separation and generation without labeled data.

2025-06-28T05:35:30+00:00 ― 6 min read

Sound ASVspoof Challenge: Advancements in Voice Authentication

Addressing the challenges of fake audio and speaker verification.

2025-06-28T00:44:00+00:00 ― 5 min read

Computation and Language Improving Pronunciation for Non-Native Speakers

A new system enhances speech clarity for language learners by focusing on accent training.

2025-06-27T23:55:25+00:00 ― 4 min read

Sound Classifying Rage Music: A Machine Learning Approach

Analyzing rage music features through machine learning for better genre classification.

2025-06-27T20:41:05+00:00 ― 5 min read

Sound The Rise of Fake Audio and Detection Challenges

Fake audio clips are a serious concern; effective detection methods are essential.

2025-06-27T19:52:30+00:00 ― 6 min read

Sound Enhancing Fake Audio Detection with Color Quantization

A new method improves the accuracy of detecting synthetic audio.

2025-06-27T19:03:55+00:00 ― 5 min read

Sound DisMix: Transforming Music Manipulation

A new method for separating and manipulating musical sounds.

2025-06-27T17:26:45+00:00 ― 5 min read

Audio and Speech Processing Advancements in Text-to-Speech Technology with SSL-TTS

SSL-TTS simplifies voice synthesis using minimal training data for high-quality results.

2025-06-27T15:49:35+00:00 ― 6 min read

Computation and Language Improving Multilingual Speech Recognition Without Original Data

New methods enhance ASR models for multiple languages, preserving past knowledge.

2025-06-27T15:01:00+00:00 ― 5 min read

Computation and Language Improving Bilingual Speech Recognition with XCB

A new approach enhances recognition of code-switched phrases in bilingual speech.

2025-06-27T11:46:40+00:00 ― 5 min read

Sound Video-Foley: Transforming Sound Design in Multimedia

An innovative system automates sound generation for films and games.

2025-06-26T23:37:55+00:00 ― 8 min read

Sound Advancements in Speaker Verification Technology

New methods improve speaker recognition in noisy environments.

2025-06-26T18:46:25+00:00 ― 5 min read

Sound Advancements in Zero-Shot Voice Conversion Technology

New model improves voice conversion, especially for whispered speech and real-time applications.

2025-06-26T17:57:50+00:00 ― 6 min read

Sound A Fresh Look at Guitar Amplifier Modeling

Exploring a new digital approach to guitar amplifier sound modeling.

2025-06-26T16:20:40+00:00 ― 5 min read

Sound GaMaDHaNi: A New System for Hindustani Melodies

Introducing a groundbreaking system to generate Hindustani vocal music.

2025-06-26T11:29:10+00:00 ― 6 min read

Sound Advancements in Modeling Dynamic Range Compressors with Neural Networks

A new method for accurately modeling optical compressors using neural networks.

2025-06-26T10:40:35+00:00 ― 7 min read

Human-Computer Interaction WhisperMask: A Game Changer in Voice Communication

WhisperMask captures voice clearly in noisy places, enhancing communication.

2025-06-26T09:03:25+00:00 ― 6 min read

Sound Advancements in Voice Quality Assessment Using Technology

New methods improve voice quality assessments for patients with vocal system issues.

2025-06-26T07:26:15+00:00 ― 6 min read

Human-Computer Interaction VoiceX: A New Era in Voice Creation

VoiceX simplifies the process of creating personalized voices for various applications.

2025-06-26T05:49:05+00:00 ― 4 min read

Computation and Language The Role of Prosody and Pragmatics in Speech Technology

Examining how voice patterns affect meaning and technology performance.

2025-06-25T21:43:15+00:00 ― 4 min read

Sound Introducing NEST: A New Model for Speech Processing

NEST offers a faster, more efficient approach to self-supervised speech tasks.

2025-06-25T20:06:05+00:00 ― 5 min read

Audio and Speech Processing Assessing Bias in Speaker Verification Systems

A look into bias measurement methods for speaker verification.

2025-06-25T17:40:20+00:00 ― 5 min read

Multimedia Rethinking Audio-Visual Source Localization Benchmarks

Current benchmarks misjudge models' ability to connect audio and visual data.

2025-06-25T16:03:10+00:00 ― 5 min read

Audio and Speech Processing Advancements in Musical Onset Detection Methods

New algorithms improve accuracy in identifying musical note beginnings.

2025-06-25T14:26:00+00:00 ― 6 min read

Sound Advancements in Speech Emotion Recognition with Wav2Small

Wav2Small enhances emotion detection in speech with reduced resource needs.

2025-06-25T10:23:05+00:00 ― 5 min read

Sound Challenges in Detecting Partially Fake Speech Signals

A look into the complexities of identifying mixed audio tracks.

2025-06-25T06:20:10+00:00 ― 6 min read

Audio and Speech Processing Advancements in Whispered Speech Recognition Technology

New methods improve speech recognition for whispered communication.

2025-06-25T05:31:35+00:00 ― 5 min read

Audio and Speech Processing Understanding Tamil Language Dialects

An overview of Tamil's rich dialects and identification methods.

2025-06-25T04:43:00+00:00 ― 5 min read

Audio and Speech Processing Advancements in Spoken-Term Discovery with DUSTED

DUSTED improves efficiency in identifying spoken words by analyzing phonetic patterns.

2025-06-25T02:17:15+00:00 ― 5 min read

Audio and Speech Processing Efficient Sound Recognition Using Continuous Wavelet Transform

A new method improves sound recognition with less computing power.

2025-06-24T23:51:30+00:00 ― 5 min read

Sound Innovative Framework for Machine Sound Detection

A new approach to detect machine issues without compromising data privacy.

2025-06-24T16:34:15+00:00 ― 5 min read

Sound VoiceTailor: Personalizing Text-to-Speech Technology

VoiceTailor transforms TTS systems for efficient, personalized voice outputs.

2025-06-24T15:45:40+00:00 ― 5 min read

Sound Understanding Sound Field Estimation: A Practical Approach

Learn how sound spreads in spaces and its applications.

2025-06-24T14:57:05+00:00 ― 6 min read

Sound StyleSpeech: The Future of Text-to-Speech Technology

StyleSpeech advances TTS systems by capturing natural speech nuances.

2025-06-24T14:08:30+00:00 ― 6 min read