Latest Articles for Audio Technology

Sound Advancing AI in Text-to-Audio Generation

A study on improving audio outputs from text prompts using preference optimization.

2025-08-11T07:05:20+00:00 ― 6 min read

Sound SemantiCodec: The Next Step in Audio Technology

A new audio codec offering high-quality compression and rich semantic content.

2025-08-08T19:10:10+00:00 ― 6 min read

Sound Advancing Audio Editing with Diffusion Models

A new method improves audio editing using diffusion models for precise changes.

2025-08-06T16:09:25+00:00 ― 5 min read

Audio and Speech Processing Reducing Cross-Talk for Clearer Speech

A new system improves speech clarity in multi-speaker environments.

2025-08-02T14:10:50+00:00 ― 5 min read

Audio and Speech Processing Advancements in Speech Separation Techniques

New methods improve clarity in isolating voices from audio mixtures.

2025-07-31T04:41:25+00:00 ― 4 min read

Sound Advancements in 3D Audio Rendering with AVGS

New model improves realistic audio experiences in virtual environments.

2025-07-29T20:18:05+00:00 ― 7 min read

Audio and Speech Processing Advancing Foley Audio with the MINT Dataset

A new dataset improves the creation of foley audio for multimedia content.

2025-07-29T17:03:45+00:00 ― 6 min read

Sound Real-Time Speaker Diarization: An Overview

Learn about online speaker diarization and its significance in various applications.

2025-07-28T06:14:40+00:00 ― 6 min read

Sound Advancements in Audio Modeling with GANs

New techniques improve guitar amplifier modeling using unpaired data and GANs.

2025-07-27T22:08:50+00:00 ― 7 min read

Sound Advancing Voice Conversion with Spatial Awareness

Introducing spatial voice conversion to enhance audio realism and immersion.

2025-07-27T01:54:15+00:00 ― 6 min read

Robotics Learning with Sound: A New Era for Robots

A new system helps robots learn tasks using audio from real-life demonstrations.

2025-07-26T09:42:35+00:00 ― 7 min read

Sound New Method for Voice Creation in Speech Synthesis

A simple method to create voices and control emotions in speech synthesis.

2025-07-25T14:16:35+00:00 ― 5 min read

Audio and Speech Processing New Method for Clearer Sound in Noisy Environments

A novel approach to enhance sound clarity using advanced deep learning techniques.

2025-07-25T11:02:15+00:00 ― 7 min read

Sound Advancing Loudspeaker Technology and Sound Control

Innovative techniques improve loudspeaker design and sound direction.

2025-07-25T06:10:45+00:00 ― 4 min read

Sound Breaking Down Deepfake Audio Detection Techniques

This study focuses on improving detection of deepfake audio using advanced methods.

2025-07-25T02:56:25+00:00 ― 5 min read

Audio and Speech Processing Advancements in Audio-Visual Speech Recognition

Research highlights the role of video in improving speech recognition in noisy environments.

2025-07-22T20:41:20+00:00 ― 5 min read

Audio and Speech Processing Improving Sound Event Detection with New Techniques

Advancements in sound classification enhance audio recognition accuracy.

2025-07-22T15:01:15+00:00 ― 6 min read

Sound Advancing Audio Generation with Sound-VECaps Dataset

New dataset improves audio generation from detailed text descriptions.

2025-07-21T07:26:30+00:00 ― 4 min read

Machine Learning Boosting Small Models with Large Model Insights

A new method helps smaller models perform better using hints from larger models.

2025-07-19T14:08:45+00:00 ― 6 min read

Sound ElasticAST: A Flexible Approach to Audio Classification

ElasticAST allows processing of variable length audio efficiently without losing important details.

2025-07-18T02:31:05+00:00 ― 5 min read

Sound New Method for Detecting Partially Fake Audio

A novel approach improves detection of mixed real and fake audio clips.

2025-07-17T17:36:40+00:00 ― 6 min read

Computer Vision and Pattern Recognition Introducing the MMIS Dataset for Interior Design Research

A new dataset combining images, text, and audio for interior scene research.

2025-07-17T07:38:36+00:00 ― 4 min read

Audio and Speech Processing Advancing Audio Security with Continual Learning

CADE improves audio detection against evolving spoofing threats using continual learning techniques.

2025-07-16T10:50:30+00:00 ― 7 min read

Audio and Speech Processing Vibravox: Advancing Speech Recognition Technology

A new dataset aims to improve speech capture using body-conduction sensors.

2025-07-15T14:35:55+00:00 ― 6 min read

Audio and Speech Processing Advancements in Speaker and Language Diarization Systems

A team improves audio processing for speaker and language identification.

2025-07-15T03:15:45+00:00 ― 4 min read

Sound Open Audio Generation: A New Model

A new text-to-audio model using only public data.

2025-07-13T11:35:10+00:00 ― 5 min read

Audio and Speech Processing Automatic EQ System Revolutionizes Music Production

A new technology simplifies equalization for audio recordings.

2025-07-11T23:08:55+00:00 ― 5 min read

Sound Advancements in Speech Bandwidth Expansion

Improving audio quality in devices through bandwidth expansion techniques.

2025-07-10T00:11:05+00:00 ― 5 min read

Sound Advancements in Audio-Visual Speech Separation Techniques

A new method improves voice separation in noisy settings with multiple speakers.

2025-07-09T16:53:50+00:00 ― 5 min read

Sound Wavespace: Changing the Game in Sound Design

Wavespace offers innovative tools for better sound creation and control.

2025-07-08T19:02:05+00:00 ― 6 min read

Sound Addressing Abusive Speech in Audio

Research focuses on identifying abusive speech in audio recordings across languages.

2025-07-08T02:50:25+00:00 ― 5 min read

Computer Vision and Pattern Recognition Generating Synchronized Audio for Silent Videos

A method to create audio that matches first-person viewpoint videos.

2025-07-07T23:36:05+00:00 ― 7 min read

Sound Advancing Detection of Lossy Audio Compression

A study on improving methods to detect lossy audio compression for better sound quality.

2025-07-07T12:15:55+00:00 ― 6 min read

Audio and Speech Processing Balancing Privacy and Utility in Conversation Analysis

Examining techniques to protect privacy while analyzing recorded conversations.

2025-07-07T04:10:05+00:00 ― 5 min read

Audio and Speech Processing Advancements in Binaural Signal Matching Techniques

Improving binaural sound reproduction for better audio experiences in various devices.

2025-07-04T07:20:30+00:00 ― 7 min read

Sound Advancements in Audio Source Separation with RQ-VAE

New machine learning model enhances audio source separation techniques.

2025-07-02T05:08:20+00:00 ― 5 min read

Sound Music2Latent: A New Tool for Audio Compression

Music2Latent simplifies audio compression while maintaining high quality for various applications.

2025-07-02T04:19:45+00:00 ― 5 min read

Sound New Method Improves Speech Clarity in Smart Glasses

A system to enhance speech clarity in noisy environments using smart glasses.

2025-07-02T02:42:35+00:00 ― 5 min read

Computation and Language Detecting Hate Speech in Audio: New Approaches

A study on identifying hate speech moments in audio using novel techniques.

2025-07-02T00:16:50+00:00 ― 5 min read

Sound PeriodWave: A New Approach to Waveform Generation

Introducing PeriodWave, a model improving audio generation speed and quality.

2025-06-30T15:53:30+00:00 ― 5 min read