Latest Articles for Neural Networks

Machine Learning The Promise and Challenges of RTRL

A look at RTRL's potential and obstacles in machine learning.

2025-11-07T15:13:06+00:00 ― 6 min read

Machine Learning Benign Overfitting in Deep Neural Networks

Study reveals how deep networks excel despite noise in training data.

2025-11-07T12:19:18+00:00 ― 6 min read

Machine Learning Understanding Benign Overfitting in Neural Networks

A look at how benign overfitting can benefit machine learning models.

2025-11-07T12:06:48+00:00 ― 5 min read

Computer Vision and Pattern Recognition Scaling Down Vision Transformers for Mobile Use

A review of smaller Vision Transformers suitable for mobile applications.

2025-11-07T12:03:30+00:00 ― 5 min read

Machine Learning Reevaluating Unlearnable Datasets for Data Privacy

Examining the effectiveness and challenges of unlearnable datasets in protecting private information.

2025-11-07T11:31:54+00:00 ― 5 min read

Neural and Evolutionary Computing Advancements in Spiking Neural Networks

A look into the mechanics and applications of spiking neural networks.

2025-11-07T08:22:20+00:00 ― 6 min read

Machine Learning Advancements in Weight Normalization for Neural Networks

Weight normalization improves neural network training and performance, even with larger weights.

2025-11-07T07:56:07+00:00 ― 5 min read

Computer Vision and Pattern Recognition Improving Multi-Task Learning with Aligned-MTL

Aligned-MTL addresses challenges in multi-task learning for better performance.

2025-11-07T07:42:48+00:00 ― 4 min read

Machine Learning Chain-of-Thought in Transformer Learning

A study on how CoT improves learning in multilayer perceptrons.

2025-11-07T06:15:54+00:00 ― 8 min read

Machine Learning Advancing Stochastic Learning with Quantized Optimization

A novel approach to improve neural network training through quantized optimization.

2025-11-07T06:08:00+00:00 ― 5 min read

Computation and Language Transformers and Hierarchical Language Understanding

Examining how transformers learn to understand language hierarchies through extended training.

2025-11-07T04:17:24+00:00 ― 5 min read

Machine Learning Advances in Training-Free Neural Architecture Search Metrics

This study introduces innovative metrics to evaluate RNNs and transformers without training.

2025-11-07T00:28:18+00:00 ― 7 min read

Neural and Evolutionary Computing Winning Tickets in Evolutionary Strategies for Neural Networks

Exploring the effectiveness of evolutionary strategies in finding sparse network initializations.

2025-11-06T22:14:00+00:00 ― 4 min read

Machine Learning Detecting Adversarial Attacks Using Graphs

A new method leveraging graphs to identify adversarial attacks on neural networks.

2025-11-06T22:06:06+00:00 ― 6 min read

Machine Learning Improving Neural Network Attribution with IDG

A new method enhances how neural networks explain their decisions.

2025-11-06T20:55:00+00:00 ― 5 min read

Machine Learning Improving Sequence Models with Monotonic Location Attention

A new method enhances generalization of sequence models across varying lengths.

2025-11-06T19:59:42+00:00 ― 6 min read

Machine Learning Advancements in Language Processing with BT-Cell

BT-Cell enhances recursive neural networks for improved language understanding.

2025-11-06T19:12:18+00:00 ― 5 min read

Machine Learning Understanding the Extractor and Tunnel in Deep Networks

This article examines how deep networks function through the extractor and tunnel.

2025-11-06T16:42:12+00:00 ― 6 min read

Computer Vision and Pattern Recognition Understanding Spiking Neural Networks: Challenges and Opportunities

Exploring the potential and challenges of spiking neural networks in computing.

2025-11-06T15:39:00+00:00 ― 5 min read

Neural and Evolutionary Computing Introducing LLMatic: A New Approach to Neural Network Design

LLMatic combines large language models and quality-diversity strategies for efficient neural architecture search.

2025-11-06T12:21:30+00:00 ― 6 min read

Machine Learning The Simplicity Principle in Deep Learning

Examining how gradient descent favors simpler solutions in deep learning models.

2025-11-06T08:00:48+00:00 ― 6 min read

Computer Vision and Pattern Recognition Combining Event and Standard Cameras for Better Image Clarity

A new system improves image quality by merging event camera data with blurry images.

2025-11-06T03:32:12+00:00 ― 5 min read

Machine Learning A New Perspective on Generative Models

Exploring various generative models and their unifying framework.

2025-11-05T21:43:44+00:00 ― 5 min read

Machine Learning Advancements in Attention Techniques with Cone Attention

Cone attention improves data relationships in models with hierarchical structures.

2025-11-05T21:13:00+00:00 ― 8 min read

Machine Learning Challenges of Out-of-Distribution Forgetting in AI

Examining OODF and its impact on continual learning in artificial intelligence.

2025-11-05T20:17:42+00:00 ― 6 min read

Computation and Language Rethinking Subword Tokenization in Language Models

Examining the role of frequency and compositionality in subword tokenization methods.

2025-11-05T18:58:42+00:00 ― 6 min read

Machine Learning Advancements in Generative Modeling with Injective Flows

A new approach enhances generative modeling efficiency and flexibility.

2025-11-05T18:03:24+00:00 ― 7 min read

Machine Learning Efficient Training with Information Pathways in Transformers

A new approach to improve transformer training efficiency using information pathways.

2025-11-05T12:47:24+00:00 ― 7 min read

Machine Learning Neurally-Guided Decision Making in AI

A method combining symbolic reasoning and neural networks for better decision making.

2025-11-05T08:42:30+00:00 ― 5 min read

Image and Video Processing Advancements in Diffeomorphic Registration for Medical Imaging

New techniques enhance image matching in medical analysis and diagnostics.

2025-11-05T08:41:25+00:00 ― 4 min read

Machine Learning Analyzing Generalization in Multi-Layer Neural Networks

A study on how scaling and complexity affect neural network performance.

2025-11-05T06:42:32+00:00 ― 5 min read

Machine Learning Investigating Memorization in Transformer Models

This article explores how transformers memorize data through multi-head attention.

2025-11-05T05:32:54+00:00 ― 5 min read

Machine Learning Introducing Yoked Neural Networks: A New Approach to Neural Architecture

Yoked Neural Networks improve information sharing and processing in neural systems.

2025-11-05T03:18:36+00:00 ― 5 min read

Computer Vision and Pattern Recognition Improving Surface Reconstruction from Noisy Data

A new method enhances 3D modeling from sparse and noisy inputs using depth images.

2025-11-05T02:31:12+00:00 ― 7 min read

Neural and Evolutionary Computing Advancing AI with Ising Machines and Equilibrium Propagation

This study explores training Ising machines for AI tasks using a novel method.

2025-11-04T23:46:30+00:00 ― 9 min read

Machine Learning Advancements in Graph Neural Networks with Attention Mechanisms

A new GNN architecture improves attention mechanisms for better performance in deep layers.

2025-11-04T19:56:12+00:00 ― 5 min read

Machine Learning Feedback Alignment: A New Approach to Neural Network Training

Exploring feedback alignment as an alternative to traditional backpropagation in neural networks.

2025-11-04T19:08:48+00:00 ― 5 min read

Machine Learning Why Stochastic Gradient Descent Outperforms Gradient Descent

Exploring why SGD excels in generalization compared to traditional methods.

2025-11-04T18:11:32+00:00 ― 6 min read

Machine Learning Addressing the Out-of-Distribution Challenge in Machine Learning

A new method improves how models handle unexpected data.

2025-11-04T16:15:00+00:00 ― 6 min read

Machine Learning Harnessing Symmetry for Improved Machine Learning

A new method leverages symmetry in data for better learning outcomes.

2025-11-04T16:07:06+00:00 ― 6 min read