Latest Articles for Model Optimization

Computation and Language Advancing Efficiency in Language Models with WGQA

WGQA enhances the efficiency of language models while reducing memory needs.

2025-07-13T08:11:06+00:00 ― 5 min read

Machine Learning Improving Large Language Models with the LIAR Framework

LIAR offers a new way to prune models without retraining, enhancing efficiency and performance.

2025-07-10T20:08:42+00:00 ― 6 min read

Machine Learning Advancing Knowledge Transfer from GNNs to MLPs

New framework improves knowledge distillation by focusing on hard samples.

2025-07-09T18:12:24+00:00 ― 7 min read

Computation and Language Improving Language Model Efficiency with DDK Framework

DDK enhances knowledge distillation, making smaller language models more efficient.

2025-07-09T07:32:30+00:00 ― 5 min read

Computer Vision and Pattern Recognition Improving Vision Transformers with Singular Defect Repairing

SINDER enhances Vision Transformers by addressing image analysis defects.

2025-07-08T11:55:24+00:00 ― 6 min read

Computer Vision and Pattern Recognition Improving Diffusion Models with New Quantization Methods

A new framework enhances diffusion models' efficiency while preserving image quality.

2025-07-05T23:05:36+00:00 ― 5 min read

Machine Learning Enhancing Data-Free Quantization for Vision Transformers

A new method improves accuracy in quantizing Vision Transformers without original data.

2025-07-05T13:05:12+00:00 ― 5 min read

Machine Learning Momentum-Filtered Optimizer: A New Approach to Prevent Forgetting in LLMs

MoFO helps large language models retain knowledge during fine-tuning without losing performance.

2025-07-05T01:30:00+00:00 ― 5 min read

Statistical Mechanics Understanding Diffusion Models in Machine Learning

A look into how diffusion models generate data and their practical uses.

2025-07-04T20:25:52+00:00 ― 5 min read

Neural and Evolutionary Computing Advancing Neural Architecture Search with Novelty

A new method enhances architecture search for deep learning models.

2025-07-04T19:02:54+00:00 ― 6 min read

Computation and Language Improving Performance in Sparse Language Models

A new method enhances sparse language model training while minimizing performance loss.

2025-07-04T17:36:00+00:00 ― 7 min read

Machine Learning Bayesian Hierarchical Low-Rank Adaptation for Multi-Task Learning

A new method improves multi-task learning in language models by sharing knowledge.

2025-07-04T02:54:28+00:00 ― 6 min read

Computer Vision and Pattern Recognition Innovative Approach to Low-Bit Quantization

A new framework called CoRa improves model performance during low-bit quantization.

2025-07-04T00:13:12+00:00 ― 5 min read

Computation and Language Efficient Techniques for Large Language Models

Learn methods to optimize large language models for better performance and efficiency.

2025-07-01T10:51:48+00:00 ― 7 min read

Machine Learning Eigen Attention: A New Approach to Memory Efficiency in LLMs

Eigen Attention improves memory efficiency for large language models processing long texts.

2025-06-29T16:43:48+00:00 ― 6 min read

Computation and Language Advancements in Speech Models Through Pruning Techniques

Research reveals how to make speech models smaller and more efficient.

2025-06-29T16:24:35+00:00 ― 5 min read

Computer Vision and Pattern Recognition Token Compensator: Enhancing Efficiency in Vision Transformers

A new method improves performance of Vision Transformers through effective token compression.

2025-06-28T10:50:30+00:00 ― 6 min read

Computer Vision and Pattern Recognition Optimizing Vision Transformers for Mobile Devices

Learn how PQV-Mobile enhances ViTs for efficient mobile applications.

2025-06-27T14:41:48+00:00 ― 5 min read

Machine Learning Advancements in Mixture of Experts Models with BAM

BAM enhances MoE efficiency by integrating attention and FFN parameters.

2025-06-27T12:35:24+00:00 ― 4 min read

Computer Vision and Pattern Recognition Compressing Computer Vision Models for Efficient Use

Techniques to reduce model size for effective deployment in limited-resource environments.

2025-06-27T12:19:36+00:00 ― 7 min read

Computation and Language Improving Language Models with Sparse-Dense-Sparse Method

A new technique enhances the efficiency of pre-trained language models.

2025-06-26T02:29:18+00:00 ― 6 min read

Machine Learning Improving State-Space Models with Transformer Knowledge

Using Transformers to enhance State-Space Models for better efficiency in NLP.

2025-06-25T21:52:48+00:00 ― 6 min read

Computer Vision and Pattern Recognition Improving Few-Shot Learning with LVLMs

Discover strategies to enhance few-shot learning in large vision language models.

2025-06-25T11:05:00+00:00 ― 5 min read

Machine Learning User-Centric Model Merging for Enhanced Performance

A new approach to merge machine learning models based on user preferences for better outcomes.

2025-06-24T18:37:30+00:00 ― 6 min read

Computation and Language Reducing Size of Language Models While Maintaining Performance

A method to shrink language models without sacrificing effectiveness through pruning and distillation.

2025-06-24T13:29:24+00:00 ― 4 min read

Machine Learning Optimizing Decision Tree Policies for Reinforcement Learning

A new approach to enhance decision tree models in reinforcement learning.

2025-06-24T08:52:54+00:00 ― 7 min read

Machine Learning FISTAPruner: A New Approach to Model Pruning

Introducing FISTAPruner, a method to prune language models efficiently while keeping performance high.

2025-06-24T07:31:24+00:00 ― 6 min read

Artificial Intelligence Improving Model Merging with Weight Scope Alignment

This article explores a new method for better merging of machine learning models.

2025-06-23T14:26:54+00:00 ― 4 min read

Machine Learning Challenges of LLaMA3-70B with 8-bit Quantization

LLaMA3-70B faces unique issues with 8-bit quantization affecting its performance.

2025-06-21T13:51:48+00:00 ― 3 min read

Artificial Intelligence Mixing Models for Efficient System Design

Combine trained models to improve performance and reduce costs.

2025-06-19T14:51:30+00:00 ― 5 min read

Machine Learning Revolutionizing Model Compression with Hyper-Compression Techniques

An innovative approach to compress advanced models efficiently without losing performance.

2025-06-19T08:48:06+00:00 ― 6 min read

Machine Learning Improving Model Performance through Weight-Ensembling

Learn how new methods enhance weight-ensembling in machine learning.

2025-06-18T02:07:24+00:00 ― 5 min read

Machine Learning Introducing RoLoRA: A New Approach to Federated Fine-Tuning

RoLoRA enhances federated learning with robust fine-tuning and efficient communication.

2025-06-18T01:59:30+00:00 ― 5 min read

Audio and Speech Processing Streamlining Speech Models: Reducing Complexity in Transformers

This article discusses the benefits of simplifying transformer models for speech tasks.

2025-06-15T14:45:20+00:00 ― 4 min read

Computer Vision and Pattern Recognition Advancements in Vision-Language Models Through RPP

RPP improves fitting and generalization in Vision-Language Models using refined prompts.

2025-06-15T05:47:18+00:00 ― 7 min read

Machine Learning Improving Privacy in Machine Learning Training

A new method enhances model performance while ensuring privacy in deep learning.

2025-06-15T01:37:56+00:00 ― 7 min read

Machine Learning ETAGE: A New Approach to Test Time Adaptation

ETAGE improves model performance during testing with new data types.

2025-06-12T18:16:30+00:00 ― 5 min read

Machine Learning Adapting Kernel Regression for Better Predictions

Examining how flexibility in models enhances predictive accuracy through dynamic adjustments.

2025-06-11T13:21:12+00:00 ― 7 min read

Machine Learning Reducing Memory Use in Language Models

A new technique cuts memory needs for large language models while keeping performance.

2025-06-11T10:56:18+00:00 ― 5 min read

Computer Vision and Pattern Recognition Optimizing Remote Sensing with Knowledge Distillation

Improving model efficiency in remote sensing through knowledge distillation techniques.

2025-06-10T01:37:36+00:00 ― 6 min read