Computer Science - Sound

RSS

Computation and Language Analyzing Speech to Assess Suicide Risk

Research explores how speech analysis can predict suicide risk, considering gender differences.

2025-07-26T13:45:30+00:00 ― 5 min read

Sound A New Tool for Music Visualization

This paper presents a system to create visuals that respond to music.

2025-07-26T10:31:10+00:00 ― 7 min read

Robotics Learning with Sound: A New Era for Robots

A new system helps robots learn tasks using audio from real-life demonstrations.

2025-07-26T09:42:35+00:00 ― 7 min read

Audio and Speech Processing Advancements in Sound Event Detection for 2024

New methods improve accuracy in recognizing overlapping sounds across diverse audio sources.

2025-07-26T07:16:50+00:00 ― 6 min read

Computation and Language Improving Speech Error Correction in ASR Systems

A new method combines acoustic features and confidence scores for better error correction.

2025-07-25T20:45:15+00:00 ― 5 min read

Cryptography and Security Protecting Voices in the Age of Deepfakes

SecureSpectra offers a new way to safeguard audio identity against deepfake threats.

2025-07-25T16:42:20+00:00 ― 5 min read

Machine Learning Advancements in Predicting Acoustic Scattering with PGI-DeepONet

Combining physics and geometry for improved acoustic scattering predictions.

2025-07-25T15:54:09+00:00 ― 5 min read

Computation and Language Advancements in Real-Time Speech Translation Systems

A new system for accurate and fast speech translation across multiple languages.

2025-07-25T15:05:10+00:00 ― 6 min read

Sound New Method for Voice Creation in Speech Synthesis

A simple method to create voices and control emotions in speech synthesis.

2025-07-25T14:16:35+00:00 ― 5 min read

Sound Advancements in Real-Time Music Source Separation

Improving MMDenseNet for quick and efficient music separation.

2025-07-25T12:39:25+00:00 ― 5 min read

Computation and Language Advancements in Spoken Dialogue Systems

A new method improves machine dialogue through pseudo-stereo data.

2025-07-25T08:36:30+00:00 ― 6 min read

Computation and Language Improving Chinese Speech Recognition Through Pinyin Regularization

This study presents a dataset and method to enhance Chinese ASR accuracy using Pinyin.

2025-07-25T07:47:55+00:00 ― 7 min read

Sound Advancing Loudspeaker Technology and Sound Control

Innovative techniques improve loudspeaker design and sound direction.

2025-07-25T06:10:45+00:00 ― 4 min read

Sound Breaking Down Deepfake Audio Detection Techniques

This study focuses on improving detection of deepfake audio using advanced methods.

2025-07-25T02:56:25+00:00 ― 5 min read

Sound Innovative Approaches to Music Creation with Technology

Using visual interfaces and models to enhance music generation.

2025-07-25T00:30:40+00:00 ― 5 min read

Computer Vision and Pattern Recognition Innovative Approach to Automatic Sound Effects Generation

A new framework for creating synchronized sound effects in videos.

2025-07-24T23:42:05+00:00 ― 6 min read

Sound Improving Speaker Diarization with Speaker Embeddings

A study on enhancing audio segmentation by integrating speaker embeddings.

2025-07-24T21:16:20+00:00 ― 5 min read

Sound A New Lightweight Method for Text-to-Speech Technology

This article introduces a more efficient TTS system that adapts to speakers.

2025-07-24T20:27:45+00:00 ― 5 min read

Computation and Language Innovative Techniques in Speech Recognition for Low-Resource Languages

New methods improve speech models for languages with limited data.

2025-07-24T19:39:10+00:00 ― 5 min read

Sound The Importance of Measuring Uncertainty in Speech Emotion Recognition

Understanding uncertainty boosts the accuracy of emotion recognition in real-world scenarios.

2025-07-24T17:13:25+00:00 ― 6 min read

Audio and Speech Processing Advancements in Phoneme Alignment Techniques

A new method enhances phoneme alignment accuracy for various speech applications.

2025-07-24T10:44:45+00:00 ― 5 min read

Computation and Language Nollywood's Language Challenge: Bridging Dialects

A study on translating Nigerian English for better accessibility in Nollywood films.

2025-07-24T04:16:05+00:00 ― 6 min read

Computation and Language A New Approach to Speech Representation Learning

This article presents a dual encoder system for effective speech representation learning.

2025-07-24T01:50:20+00:00 ― 6 min read

Sound Advancing Symbolic Music Processing with MelodyT5

MelodyT5 offers a new approach to music creation and analysis using symbolic notation.

2025-07-23T21:47:25+00:00 ― 6 min read

Sound Synthetic Music Dataset Aims to Improve Genre Classification

GTZAN-synth dataset leverages synthetic music for better music tagging systems.

2025-07-23T17:44:30+00:00 ― 5 min read

Audio and Speech Processing MelodyLM: The Future of Song Creation

MelodyLM simplifies music creation using text and voice inputs.

2025-07-23T16:55:55+00:00 ― 6 min read

Computer Vision and Pattern Recognition Introducing the SAVE Model for Audio-Visual Segmentation

SAVE model enhances audio-visual segmentation with efficiency and precision.

2025-07-23T16:07:20+00:00 ― 6 min read

Computation and Language Advancements in Speech-to-Text Translation with LLMs

New model improves speech-to-text translation using large language models.

2025-07-23T08:01:30+00:00 ― 6 min read

Sound New Model Estimates Mouth Movements in Speech

Research presents a model linking sound recordings to mouth movements for speech.

2025-07-23T07:12:55+00:00 ― 6 min read

Computation and Language Wav2Vec2.0 and the Sound of Speech Recognition

This article discusses how Wav2Vec2.0 processes speech sounds using phonology.

2025-07-23T05:35:45+00:00 ― 5 min read

Computation and Language Advancements in Multilingual Speaker Anonymization

Improving speaker anonymization technology for nine languages to ensure privacy.

2025-07-23T03:58:35+00:00 ― 5 min read

Quantitative Methods Digital Aquaculture: The Future of Fish Farming

Exploring technology's role in enhancing fish farming efficiency and welfare.

2025-07-23T03:15:54+00:00 ― 5 min read

Sound New Method for Early Dementia Detection via Voice Analysis

A novel approach combines voice analysis with privacy protection for dementia detection.

2025-07-22T19:04:10+00:00 ― 6 min read

Sound Advancing Automated Animal Sound Classification

New methods improve accuracy in identifying animal sounds for wildlife monitoring.

2025-07-22T18:15:35+00:00 ― 4 min read

Sound Advancements in Multi-Talker Speech Recognition

A new method improves accuracy in recognizing speech from multiple speakers.

2025-07-22T10:58:20+00:00 ― 5 min read

Sound Advancements in Speech Synthesis Using Acoustic BPE

Acoustic BPE improves speech intelligibility and quality in TTS systems.

2025-07-22T08:32:35+00:00 ― 6 min read

Sound Advancements in Speech Enhancement Technology

A new method improves speech clarity in noisy environments using dual neural networks.

2025-07-22T06:55:25+00:00 ― 5 min read

Computation and Language Advancing Speech Recognition with Accent-Specific Codebooks

New method improves ASR systems' handling of various accents through specialized codebooks.

2025-07-22T04:29:40+00:00 ― 5 min read

Computation and Language Advancements in Automatic Speech Recognition Technology

New methods improve accuracy and efficiency in speech recognition systems.

2025-07-22T03:41:05+00:00 ― 6 min read

Audio and Speech Processing Advancing Sound Source Localization with DOA-PNN

A new method improves sound localization in varied environments by focusing on continuous learning.

2025-07-22T02:03:55+00:00 ― 6 min read