Latest Articles for Speech Technology

Audio and Speech Processing Advancements in Whispered Speech Recognition Technology

New methods improve speech recognition for whispered communication.

2025-06-25T05:31:35+00:00 ― 5 min read

Sound StyleSpeech: The Future of Text-to-Speech Technology

StyleSpeech advances TTS systems by capturing natural speech nuances.

2025-06-24T14:08:30+00:00 ― 6 min read

Sound EmoAttack: A New Threat in Speech Technology

EmoAttack leverages emotional voice conversion to exploit vulnerabilities in speech systems.

2025-06-24T01:59:45+00:00 ― 5 min read

Audio and Speech Processing Advancing Whispered Speech Conversion with MaskCycleGAN

A new method improves converting whispered speech to normal speech using advanced techniques.

2025-06-23T09:48:05+00:00 ― 5 min read

Sound VoxInstruct: A New Way to Generate Speech

VoxInstruct combines content and style for more natural speech generation.

2025-06-22T23:16:30+00:00 ― 5 min read

Sound Advancements in Speaker Verification Using Whisper

A novel method improves voice recognition accuracy across multiple languages.

2025-06-22T18:25:00+00:00 ― 5 min read

Audio and Speech Processing Advancements in Speech Enhancement with Time-Context Windowing

Exploring a new approach to improving speech quality using time-context windowing.

2025-06-22T17:36:25+00:00 ― 5 min read

Sound Advancements in Text-to-Speech Technology

New methods improve the quality of speech synthesis in TTS systems.

2025-06-22T05:27:40+00:00 ― 4 min read

Audio and Speech Processing Introducing SelectTTS: A Streamlined Text-to-Speech Method

SelectTTS simplifies speech generation for unseen speakers with effective frame selection.

2025-06-21T18:07:30+00:00 ― 5 min read

Audio and Speech Processing Advancements in Self-Supervised Learning for Speech Processing

A new method improves speech model performance across various tasks.

2025-06-21T02:44:25+00:00 ― 6 min read

Sound Advancing Keyword Spotting with Unlabeled Data

A new method improves keyword spotting accuracy using unlabeled audio data.

2025-06-21T01:55:50+00:00 ― 6 min read

Neurons and Cognition Automatic Detection of Mild Cognitive Impairment through Speech Analysis

Research shows speech analysis can aid in early detection of Mild Cognitive Impairment.

2025-06-21T01:11:33+00:00 ― 5 min read

Sound New Dataset Enhances Speech Recognition Technology

Researchers create LibriheavyMix to improve speech recognition in noisy environments.

2025-06-20T22:41:30+00:00 ― 5 min read

Computation and Language Advancements in Speech Tokenization: A Framework for Evaluation

A new benchmark aids in assessing speech tokenizers for better performance.

2025-06-20T00:01:10+00:00 ― 6 min read

Computation and Language Using Speech Data for Autism Diagnosis

A new method leverages speech data to improve autism assessments.

2025-06-19T19:12:12+00:00 ― 6 min read

Audio and Speech Processing Advancements in Speech Synthesis Using DDSP

Discover how DDSP improves speech synthesis efficiency and quality.

2025-06-18T17:15:00+00:00 ― 6 min read

Computation and Language Challenges in Speaker Recognition for Speech Language Models

SpeechLLMs show promise but struggle with speaker identification in conversations.

2025-06-17T08:03:05+00:00 ― 4 min read

Audio and Speech Processing Efficient Training of Speech Models Under Limited Resources

This article discusses efficient training methods for speech models using self-supervised learning.

2025-06-16T15:02:50+00:00 ― 4 min read

Computation and Language Improving Speech Systems for Indian Languages

A new dataset enhances multilingual speech technology in India.

2025-06-15T18:48:15+00:00 ― 5 min read

Sound Advancements in Emotional Text-to-Speech Technology

ParaEVITS improves emotional expression in TTS through natural language guidance.

2025-06-15T05:50:55+00:00 ― 5 min read

Computation and Language Advancing Speech Recognition for Faetar Language

Efforts to improve speech technology for the under-resourced Faetar language.

2025-06-13T09:18:50+00:00 ― 5 min read

Computation and Language WhisperNER: Merging Speech Recognition and Entity Detection

A new model combines speech recognition and entity recognition for better results.

2025-06-13T03:29:30+00:00 ― 5 min read

Audio and Speech Processing Advancing Speech Recognition for Individuals with Disorders

A project aims to improve speech technology for those with communication challenges.

2025-06-12T12:15:40+00:00 ― 5 min read

Sound Improving Accents in Text-to-Speech Technology

A new system enhances accent accuracy in TTS for better communication.

2025-06-12T08:12:45+00:00 ― 5 min read

Sound ESPnet-EZ: Simplifying Speech Model Development

An easy-to-use tool for fine-tuning speech models without complex code.

2025-06-11T15:12:30+00:00 ― 6 min read

Quantum Physics Advancing Speech Recognition with Quantum Computing

A new method improving speech recognition while ensuring data privacy.

2025-06-11T07:18:42+00:00 ― 5 min read

Sound Advancements in Accent Conversion Techniques

A new method for generating accented speech using text transliteration.

2025-06-11T06:18:05+00:00 ― 6 min read

Audio and Speech Processing Improving Speech Quality with Wave-U-Mamba

Wave-U-Mamba enhances low-quality speech recordings for clearer communication.

2025-06-11T04:40:55+00:00 ― 5 min read

Sound Advancements in Speech Quality Assessment

A new system predicts naturalness scores for synthetic speech using innovative methods.

2025-06-11T03:52:20+00:00 ― 5 min read

Computation and Language Advancements in Speech Recognition with LLMs

Exploring the GenSEC challenge to improve speech transcription accuracy.

2025-06-10T18:57:55+00:00 ― 4 min read

Audio and Speech Processing Evaluating Speech Models with Rank Measurement

A new method assesses self-supervised speech models using rank measurement.

2025-06-10T05:12:00+00:00 ― 5 min read

Audio and Speech Processing Enhancing Speech Clarity with MCMamba Model

MCMamba model improves speech quality in noisy environments using spatial and spectral information.

2025-06-09T21:54:45+00:00 ― 4 min read

Audio and Speech Processing Advancements in Speech Recognition Through Human-Like Thinking

A new framework enhances speech recognition by modeling sound relationships effectively.

2025-06-09T07:20:15+00:00 ― 4 min read

Audio and Speech Processing Improving Speech Spoof Detection with Explainable Methods

A new approach enhances the interpretability of spoof speech detection.

2025-06-08T11:05:40+00:00 ― 5 min read

Audio and Speech Processing Advancements in Multilingual Speech Technology

A model improves speech tasks in multilingual settings, addressing code-switching challenges.

2025-06-08T06:14:10+00:00 ― 5 min read

Audio and Speech Processing EVA: A New Era in Audiovisual Speech Recognition

EVA combines audio and visual signals for better speech recognition accuracy.

2025-06-07T22:08:20+00:00 ― 4 min read

Computation and Language Advancing Speech Recognition with Implicit Techniques

A new method improves speech interactions by integrating recognition and response processes.

2025-06-06T03:21:12+00:00 ― 5 min read

Computation and Language Combining Speech and Language Models for Better Performance

Research evaluates connections between speech and language models for improved recognition and translation.

2025-06-05T22:13:06+00:00 ― 5 min read

Computation and Language Innovative Methods for Speech Recognition with Limited Data

Learn how to effectively train speech models with fewer labeled resources.

2025-06-05T19:07:35+00:00 ― 7 min read

Computation and Language Reevaluating Gender in Speech Technology Research

An analysis of gender terminology in speech technology and its societal implications.

2025-06-05T15:53:15+00:00 ― 7 min read