Bhiksha Raj

Machine Learning Improving Deep Neural Networks with Biological Insights

A new layer enhances DNNs' resistance to subtle input changes.

2025-10-11T15:09:18+00:00 ― 6 min read

Computation and Language Advancements in Speech Summarization with BASS

BASS improves summarization of long audio by processing in blocks.

2025-10-08T15:05:55+00:00 ― 5 min read

Machine Learning Addressing Label Errors in Model Training

This article examines challenges and solutions related to noisy labels in training data.

2025-09-20T07:53:36+00:00 ― 6 min read

Audio and Speech Processing Advancements in Audio Captioning with Text-Only Training

A new method trains audio captioning systems using only text descriptions.

2025-09-13T02:56:55+00:00 ― 6 min read

Machine Learning Advancements in Weakly Supervised Learning Techniques

A new framework improves learning from incomplete data labels.

2025-09-12T00:01:24+00:00 ― 6 min read

Robotics Testing Robots for Unexpected Challenges

Exploring methods to improve robot performance in unpredictable environments.

2025-09-09T02:53:54+00:00 ― 4 min read

Machine Learning Improving Weak Label Learning Through Negative Example Selection

New strategies enhance weak label learning by selecting relevant negative examples.

2025-09-06T04:57:20+00:00 ― 6 min read

Machine Learning The Challenges of Noisy Model Learning

Examining how noise in pre-training data impacts model performance.

2025-08-30T14:35:18+00:00 ― 6 min read

Audio and Speech Processing A New Approach to Audio Quality Assessment with PAM

PAM offers a novel way to measure audio quality without needing reference recordings.

2025-08-26T21:10:50+00:00 ― 6 min read

Audio and Speech Processing Evaluating Voice Recognition in Noisy Environments

A new benchmark assesses voice recognition systems' performance amidst various disturbances.

2025-08-19T14:16:50+00:00 ― 5 min read

Computer Vision and Pattern Recognition The Benefits of Slight Corruption in Diffusion Models

Investigating how small errors in training data enhance AI-generated content.

2025-08-04T09:29:06+00:00 ― 5 min read

Computer Vision and Pattern Recognition Assessing SLAM Models in Noisy Environments

New framework evaluates SLAM performance under challenging conditions.

2025-07-25T00:06:30+00:00 ― 7 min read

Computation and Language Innovative Techniques in Speech Recognition for Low-Resource Languages

New methods improve speech models for languages with limited data.

2025-07-24T19:39:10+00:00 ― 5 min read

Sound Advancements in Speech Emotion Recognition Technology

New methods improve machine understanding of human emotions in speech.

2025-07-12T18:34:55+00:00 ― 4 min read

Sound Evaluating Reasoning in Audio-Language Models

This study assesses the reasoning skills of audio-language models with a new task.

2025-07-10T09:54:05+00:00 ― 7 min read

Computation and Language The Impact of Annotation Methods on Speech Summarization

This study examines how different summarization methods affect quality and content.

2025-07-02T05:56:55+00:00 ― 5 min read

Sound Improving Speaker Verification with Phonetic Features

A new framework enhances voice identity confirmation accuracy.

2025-06-15T01:50:18+00:00 ― 5 min read

Sound Improving Speech Recognition with Human-Inspired Features

New acoustic features enhance ASR systems' performance in noisy environments.

2025-06-03T14:29:40+00:00 ― 4 min read

Audio and Speech Processing Advancements in Neural Codecs with ESPnet-Codec

ESPnet-Codec enhances training and evaluation of neural codecs for audio and speech.

2025-06-03T03:09:30+00:00 ― 7 min read

Sound Revolutionizing Audio Captioning with MACE

MACE improves audio captioning by linking sounds to accurate text descriptions.

2025-05-28T17:47:08+00:00 ― 5 min read

Machine Learning Understanding Graphs: From Nodes to Knowledge

Explore how POGAT enhances the analysis of complex graph structures.

2025-05-04T12:20:00+00:00 ― 6 min read

Computer Vision and Pattern Recognition SoftVQ-VAE: Transforming Image Generation

Discover how SoftVQ-VAE enhances image creation with efficiency and quality.

2025-03-08T21:22:03+00:00 ― 6 min read