Latest Articles for Speech Recognition

Computation and Language Advancing Speech Act Recognition in Bengali

A new method improves speech act recognition in Bengali using audio and text analysis.

2025-10-27T10:55:25+00:00 ― 5 min read

Computation and Language Advancements in Language Identification with LASR Framework

A new approach improves speech language identification using self-supervised learning and labels.

2025-10-26T08:12:10+00:00 ― 6 min read

Sound Improving Arabic Dysarthric Speech Recognition

A new method enhances speech recognition for dysarthric Arabic speakers.

2025-10-26T07:23:35+00:00 ― 5 min read

Computation and Language Innovative Speech Recognition Tool for Low-Resource Languages

Allophant enhances phoneme recognition for languages with limited data.

2025-10-26T06:35:00+00:00 ― 5 min read

Audio and Speech Processing Advancing Word Timing in Speech Recognition Systems

Improving how speech recognition systems estimate word timing for better accuracy.

2025-10-26T01:43:30+00:00 ― 4 min read

Audio and Speech Processing Improving Speech Recognition with Advanced Models

New methods enhance speech processing in language models.

2025-10-26T00:54:55+00:00 ― 5 min read

Computer Vision and Pattern Recognition Alternative Telescopic Displacement: A New Method for Multimodal Data Alignment

Discover a new method for combining different types of data effectively.

2025-10-25T22:40:24+00:00 ― 5 min read

Computation and Language Advancements in Self-Supervised Learning for Speech Recognition

Self-supervised models reveal insights into phonetic and phonemic distinctions in speech.

2025-10-25T10:20:25+00:00 ― 5 min read

Neuroscience Examining Brain Responses to Speech: Key Insights

Research reveals how our brain tracks speech features during comprehension.

2025-10-25T09:40:42+00:00 ― 6 min read

Computation and Language Advancements in Spoken Named Entity Recognition

This study focuses on improving spoken NER through transfer learning and E2E models.

2025-10-24T10:59:30+00:00 ― 6 min read

Computation and Language Improving Slot Filling in Dialogue Systems

A new method enhances task-oriented dialogue systems using audio and knowledge integration.

2025-10-23T22:13:12+00:00 ― 6 min read

Computation and Language Advancements in Automatic Speech Recognition for Norwegian Languages

Recent research improves ASR models for Norwegian, enhancing performance in Bokmål and Nynorsk.

2025-10-23T21:10:00+00:00 ― 4 min read

Audio and Speech Processing Advances in Bilingual and Code-Switched ASR Models

New methods improve multilingual speech recognition using existing data sources.

2025-10-23T04:05:20+00:00 ― 6 min read

Computation and Language Improving Speech Recognition for Low-Resource Languages

Research focuses on enhancing speech tech for languages lacking sufficient data.

2025-10-22T23:13:50+00:00 ― 6 min read

Sound A Simplified Approach to Hybrid HMM for ASR

This article discusses a new method for building efficient ASR systems.

2025-10-22T14:19:25+00:00 ― 5 min read

Audio and Speech Processing New Dataset and Model for Multilingual Text-to-Speech

CML-TTS enables better text-to-speech systems across seven languages.

2025-10-21T18:04:50+00:00 ― 5 min read

Audio and Speech Processing Advancements in Multi-Talker Speech Recognition with SURT 2.0

SURT 2.0 improves speech recognition for multiple speakers in real-time settings.

2025-10-21T05:07:30+00:00 ― 5 min read

Audio and Speech Processing Advancements in Automatic Speech Recognition Learning

A new method enhances speech recognition technology without losing previously learned knowledge.

2025-10-20T13:44:25+00:00 ― 6 min read

Computation and Language New Metrics for Assessing Speech Recognition Quality

A new method evaluates ASR systems without needing reference texts.

2025-10-19T19:07:00+00:00 ― 5 min read

Computation and Language Evaluating ASR Quality Without Reference Texts

NoRefER offers a new way to assess speech recognition outputs without needing transcripts.

2025-10-19T16:41:15+00:00 ― 6 min read

Audio and Speech Processing Advancements in Spoken Language Diarization Techniques

New methods enhance speech segmentation in multi-language conversations.

2025-10-19T02:06:45+00:00 ― 6 min read

Audio and Speech Processing Advancements in Automatic Speech Recognition for Multilingual Use

A new framework improves ASR for low-resource languages and multilingual scalability.

2025-10-18T19:38:05+00:00 ― 5 min read

Computer Vision and Pattern Recognition Improving Lip Reading with Viseme Training

A new method enhances lip reading accuracy using visemes in speech recognition.

2025-10-18T03:42:24+00:00 ― 5 min read

Sound Advancing Speech Recognition for Deaf Users

Personalized ASR systems improve communication for DHH individuals significantly.

2025-10-18T03:26:25+00:00 ― 5 min read

Sound Advancements in Speaker Diarization Techniques

New methods leverage conversational summaries for better speaker recognition.

2025-10-18T00:12:05+00:00 ― 5 min read

Computation and Language Improving Automatic Speech Scoring for Language Learners

Enhancing feedback systems for English learners by addressing the cold start problem.

2025-10-17T16:54:50+00:00 ― 6 min read

Multimedia Improving Target Speaker Extraction with Visual Cues

A new model enhances speech extraction using audio and visual information.

2025-10-17T12:51:55+00:00 ― 5 min read

Sound Advancements in Target Speaker Extraction Technology

Learn how new techniques improve speech clarity in noisy environments.

2025-10-16T10:08:40+00:00 ― 5 min read

Computation and Language Improving Speech Recognition with Long-Context Models

This article discusses new models that enhance speech recognition accuracy by considering longer context.

2025-10-15T12:16:55+00:00 ― 5 min read

Neural and Evolutionary Computing Advancing Spiking Neural Networks Through Delay Learning

New method enhances learning in Spiking Neural Networks by incorporating delay adjustments.

2025-10-15T07:25:25+00:00 ― 6 min read

Audio and Speech Processing Advancing Gender Privacy in Audio: New Insights

Research highlights methods to protect gender privacy in spoken audio.

2025-10-14T21:42:25+00:00 ― 5 min read

Sound Advancements in Lip-to-Speech Synthesis Technology

New framework enhances speech clarity from silent videos through improved processing.

2025-10-13T19:47:45+00:00 ― 6 min read

Sound Advancements in Fake Audio Detection Using Conformer Models

Researchers develop a Conformer model to improve fake audio detection.

2025-10-13T03:36:05+00:00 ― 5 min read

Audio and Speech Processing Advancing Acoustic Word Embeddings for Spoken Language

Research on improving acoustic word embeddings with semantic understanding and multilingual data.

2025-10-12T14:38:45+00:00 ― 6 min read

Audio and Speech Processing Integrating Speech with Language Models: The Speech-LLaMA Method

A new approach that combines speech with language models for improved translation.

2025-10-11T18:24:10+00:00 ― 4 min read

Computation and Language Improving Speech Recognition with RNN-Transducers

New methods enhance speech recognition accuracy, addressing common transcription errors.

2025-10-11T04:38:15+00:00 ― 4 min read

Computation and Language Advancements in Speech Intent Classification and Slot Filling

This article explores a new model for speech intent and slot identification.

2025-10-09T12:09:05+00:00 ― 6 min read

Audio and Speech Processing Advancements in Speech Recognition Without Text

New method improves speech recognition using only raw audio data.

2025-10-09T02:26:05+00:00 ― 5 min read

Computation and Language Improving Speech Recognition for Older Adults

A study enhances ASR for older speakers, using innovative techniques.

2025-10-09T01:37:30+00:00 ― 6 min read

Audio and Speech Processing New Dataset Aims to Improve Hebrew Speech Recognition

ivrit.ai provides vital resources for enhancing Hebrew ASR technology.

2025-10-08T05:22:55+00:00 ― 6 min read