A resource-efficient approach to backdoor attacks on advanced machine learning models.
― 5 min read
Cutting edge science explained simply
A resource-efficient approach to backdoor attacks on advanced machine learning models.
― 5 min read
Harnessing early-exit models for efficient federated learning in ASR systems.
― 8 min read
Denoising Language Models improve error correction in speech recognition systems using synthetic data.
― 7 min read
New model VPIDM improves clarity of speech in noisy environments.
― 6 min read
A study on desktop robots using natural language and visual recognition technologies.
― 12 min read
New methods improve language model predictions under varying input conditions.
― 6 min read
A new model improves speech recognition using multiple decoding methods.
― 6 min read
A fresh method for testing language model safety and multilingual skills.
― 7 min read
A new defense strategy for LLMs against backdoor attacks.
― 5 min read
A new method combines acoustic features and confidence scores for better error correction.
― 5 min read
This study presents a dataset and method to enhance Chinese ASR accuracy using Pinyin.
― 7 min read
This study evaluates speech technology in low-resource languages like Tunisian Arabic.
― 5 min read
Emilia provides a diverse dataset for improving speech generation models.
― 6 min read
This article discusses ways to enhance numeric expression formatting in automatic transcripts.
― 5 min read
A new model aims to improve speech translation quality through integrated systems.
― 5 min read
AI models enhance accuracy of speech-to-text conversions.
― 5 min read
Research enhances ASR systems using language models for better accuracy.
― 7 min read
A method to enhance speech recognition quality in noisy environments.
― 6 min read
A new method enhances product searches across different media formats.
― 6 min read
SAGE-RT creates synthetic data to improve language model safety assessments.
― 5 min read
New methods improve voice quality assessments for patients with vocal system issues.
― 6 min read
A look at measuring accuracy in speech recognition systems with new methods.
― 5 min read
New method enhances ASR accuracy using language models for better transcriptions.
― 4 min read
New methods improve speech recognition in challenging multi-speaker situations.
― 4 min read
A new method leverages speech data to improve autism assessments.
― 6 min read
Research on modular ASR systems aims to improve performance in noisy environments.
― 4 min read
Sortformer integrates speaker diarization and ASR for improved audio processing.
― 5 min read
A new approach enhances ASR by focusing on specific speaker details.
― 5 min read
An easy-to-use tool for fine-tuning speech models without complex code.
― 6 min read
A new model helps robots follow unclear human instructions more effectively.
― 6 min read
CADA-GAN enhances ASR systems' performance across various recording environments.
― 6 min read
A new method improves speech interactions by integrating recognition and response processes.
― 5 min read
A look at the Codec-SUPERB challenge results and codec performance metrics.
― 5 min read
A project improves speech recognition for the Malasar language using Tamil resources.
― 5 min read
Mamba enhances speech recognition with speed and accuracy, reshaping interaction with devices.
― 4 min read
This project aims to standardize Bangla dialects for clearer communication.
― 6 min read
A new ASR system enhances medical speech recognition for accurate patient care.
― 6 min read
Efficiently tracks speakers in multilingual settings using automatic speech recognition.
― 6 min read
New model improves Chinese speech recognition accuracy significantly.
― 6 min read
Efforts to document and preserve the endangered Neo-Aramaic language.
― 6 min read