Latest Articles for Model Training

Machine Learning Training Dynamics in Neural Networks for Modular Addition

This research reveals how simple models learn through structures and training techniques.

Jun 26, 2025 ― 5 min read

Computation and Language Improving Model Editing with Projector Networks

PENME enhances language model updates, tackling lexical bias and computational efficiency.

Jun 26, 2025 ― 6 min read

Machine Learning Understanding the DLPM Framework

A look into the DLPM framework for data modeling and noise reduction.

Jun 25, 2025 ― 6 min read

Computation and Language Improving Unlearning in Language Models

New methods enhance how language models forget unwanted knowledge.

Jun 24, 2025 ― 6 min read

Computer Vision and Pattern Recognition CluMo: A New Method for Visual Question Answering

CluMo helps models learn continuously in Visual Question Answering without forgetting past knowledge.

Jun 24, 2025 ― 6 min read

Machine Learning Enhancing Model Calibration with Focal Temperature Scaling

A new method improves confidence in machine learning predictions.

Jun 24, 2025 ― 5 min read

Computer Vision and Pattern Recognition Improving Model Training with Dataset Distillation

Learn how dataset distillation enhances model training efficiency.

Jun 23, 2025 ― 5 min read

Machine Learning Reducing High Variance in Model Training with Control Variates

Control variates enhance model stability and efficiency by lowering variance during training.

Jun 23, 2025 ― 4 min read

Computation and Language Advancements in Learning Rate Scheduling for Language Models

New methods are reshaping how learning rates are managed in model training.

Jun 23, 2025 ― 5 min read

Machine Learning Understanding Micro-batch Clipping in Machine Learning

A look at micro-batch clipping and its benefits for model training.

Jun 23, 2025 ― 5 min read

Machine Learning Dynamic Label Adversarial Training for Deep Learning Models

A new method to boost model robustness against adversarial attacks.

Jun 23, 2025 ― 5 min read

Computation and Language In-Context Learning: Navigating Challenges in AI Models

Exploring how large language models learn from examples in various contexts.

Jun 23, 2025 ― 6 min read

Machine Learning Challenges and Insights in Multi-Task Learning

Exploring how multi-task learning affects model performance and generalization.

Jun 22, 2025 ― 6 min read

Artificial Intelligence Balancing Safety and Helpfulness in Language Models

A new approach streamlines safety and helpfulness in language model training.

Jun 21, 2025 ― 9 min read

Computation and Language Simplifying Language Model Training with Inverse-Q*

A new method streamlines aligning language models with human preferences.

Jun 21, 2025 ― 5 min read

Machine Learning Understanding Transfer Learning and Scaling Laws

A look at how transfer learning impacts model performance through scaling laws.

Jun 20, 2025 ― 6 min read

Machine Learning Insights into Multi-Task and Continual Learning

Exploring the challenges of Multi-Task and Continual Learning in machine learning.

Jun 20, 2025 ― 6 min read

Machine Learning Improving Time Series Classification with Soft Labels

This study enhances time series classification using representation soft label smoothing techniques.

Jun 19, 2025 ― 5 min read

Machine Learning CoRA: A New Method for Efficient AI Training

CoRA enhances efficiency in training large language models using shared knowledge.

Jun 19, 2025 ― 5 min read

Machine Learning Improving Data Pruning for Molecular Tasks

A new framework enhances data pruning by focusing on pre-trained models for molecular tasks.

Jun 19, 2025 ― 7 min read

Machine Learning Challenges in Machine Learning with Normalizing Flows

This article explores the impact of attacks on machine learning models and defensive strategies.

Jun 18, 2025 ― 7 min read

Machine Learning Advances in Predicting Crystal Properties with CDSSL

CDSSL improves the prediction of material properties through data-driven techniques.

Jun 16, 2025 ― 6 min read

Computer Vision and Pattern Recognition Improving Chart Classification with Curriculum Learning

A novel method enhances machine recognition of charts for better accessibility.

Jun 16, 2025 ― 5 min read

Machine Learning Rate-Constrained Federated Learning: A New Approach to Efficient Model Training

RC-FED reduces communication costs while maintaining model quality in federated learning.

Jun 15, 2025 ― 5 min read

Machine Learning Y-Drop: A Smarter Approach to Neural Network Regularization

Y-Drop improves dropout by focusing on neuron importance, enhancing model performance.

Jun 14, 2025 ― 5 min read

Computer Vision and Pattern Recognition Improving Knowledge Transfer in Deep Learning Models

KRDistill enhances knowledge distillation by addressing data imbalance issues.

Jun 13, 2025 ― 5 min read

Artificial Intelligence Foundation Models: The Future of AI

Explores the rise and impact of Foundation Models in artificial intelligence.

Jun 13, 2025 ― 5 min read

Artificial Intelligence Evaluating Preference Datasets for Reward Models

This article examines key factors in preference dataset quality for better reward model training.

Jun 12, 2025 ― 6 min read

Computer Vision and Pattern Recognition The Impact of Labeling on Machine Learning Performance

This article highlights how label variations affect machine learning models.

Jun 12, 2025 ― 7 min read

Computation and Language Optimizing Instruction Data for Language Models

A new method improves data selection for training language models.

Jun 10, 2025 ― 9 min read

Computer Vision and Pattern Recognition New Method Improves Data Pruning Efficiency

A novel approach enhances data pruning for better model training.

Jun 9, 2025 ― 6 min read

Machine Learning Addressing Label Imbalance in Federated Learning

Techniques to balance data distribution in federated learning for better model performance.

Jun 8, 2025 ― 5 min read

Computer Vision and Pattern Recognition Addressing Context Bias in Object Detection Models

Study reveals context bias impacts object detection performance across different environments.

Jun 8, 2025 ― 6 min read

Machine Learning Efficient Task Affinity Estimation in Multitask Learning

A new method improves task affinity estimation for multitask learning.

Jun 7, 2025 ― 6 min read

Machine Learning New Method for Enhancing Model Training Efficiency

A fresh approach improves training diverse model groups efficiently without separate OOD data.

Jun 5, 2025 ― 5 min read

Artificial Intelligence Advancing Large Language Models through Low-bit Quantization

Learn how low-bit quantization improves the efficiency of large language models.

Jun 5, 2025 ― 6 min read

Computer Vision and Pattern Recognition Improving Knowledge Distillation with Rank-Kendall Method

A new approach enhances the learning process between teacher and student models.

Jun 4, 2025 ― 7 min read

Computer Vision and Pattern Recognition Introducing Cascade Prompt Learning for Models

A new method to balance general knowledge and task-specific adaptation in models.

Jun 4, 2025 ― 6 min read

Computer Vision and Pattern Recognition TA-Cleaner: A New Defense Against Attacks on Multimodal Models

Introducing TA-Cleaner, a method to improve multimodal model defenses against data poisoning.

Jun 4, 2025 ― 7 min read

Machine Learning Improving Model Performance with Logit Adjustment

This study discusses enhancing model accuracy for long-tailed data using logit adjustment.

Jun 4, 2025 ― 7 min read