Latest Articles for Model Optimization

Computer Vision and Pattern Recognition Improving Generalization in Vision-Language Models with OGEN

OGEN enhances vision-language models' ability to recognize new classes effectively.

2025-09-13T12:21:48+00:00 ― 6 min read

Machine Learning Improving Large Language Models for Wider Use

This article reviews techniques to enhance Large Language Models' efficiency and performance.

2025-09-12T03:58:24+00:00 ― 7 min read

Machine Learning Boosting Efficiency in Language Models with Speculative Decoding

A method for speeding up large language models without sacrificing output quality.

2025-09-12T02:47:18+00:00 ― 6 min read

Machine Learning DE-BERT: A New Approach to Early Exiting in Language Models

Introducing DE-BERT, a framework improving efficiency in language models through early exiting strategies.

2025-09-11T23:06:06+00:00 ― 6 min read

Computation and Language Identifying Winning Tickets in Multilingual Language Models

A method for fine-tuning language models using fewer parameters.

2025-09-10T23:08:18+00:00 ― 6 min read

Machine Learning Advancements in Quantization Techniques for Machine Learning Models

Learn how new techniques improve the efficiency of large machine learning models.

2025-09-10T13:31:36+00:00 ― 4 min read

Computation and Language A New Method for Efficient Prompt Tuning

Introducing BMTPT for improved prompt tuning in language models.

2025-09-08T14:55:00+00:00 ― 5 min read

Computation and Language New Method SLEB Improves Efficiency of Large Language Models

SLEB streamlines LLMs by removing redundant transformer blocks, enhancing speed and efficiency.

2025-09-07T23:54:24+00:00 ― 6 min read

Computation and Language LoRETTA: A New Method for Fine-Tuning Language Models

LoRETTA improves fine-tuning efficiency for large language models with fewer parameters.

2025-09-07T03:29:54+00:00 ― 5 min read

Computation and Language Reducing Memory Needs in Language Models

A new approach to make language models smaller and faster using 1-bit quantization.

2025-09-07T02:26:42+00:00 ― 7 min read

Computation and Language Improving In-Context Learning with Influence Analysis

A new method for selecting demonstrations enhances model performance in language tasks.

2025-09-06T21:02:48+00:00 ― 8 min read

Machine Learning Simplifying AI Alignment with REINFORCE and RLOO

New methods promise better AI model performance through simplified reinforcement learning.

2025-09-05T04:29:36+00:00 ― 5 min read

Machine Learning Improving Efficiency in Large Language Models

New quantization method enhances performance of large language models while reducing size.

2025-09-04T18:21:18+00:00 ― 5 min read

Computer Vision and Pattern Recognition Improving Quantization Methods for Deep Learning Models

New techniques enhance quantization while managing outliers for better model performance.

2025-09-04T10:03:36+00:00 ― 5 min read

Machine Learning Fine-Tuning Large Models with Low-Rank Adaptation

A study on efficient methods for fine-tuning large models through Low-Rank Adaptation.

2025-09-04T03:44:24+00:00 ― 5 min read

Machine Learning Improving Image Generation from Text Descriptions

A new method enhances image generation accuracy using vision-language models.

2025-09-03T22:44:12+00:00 ― 5 min read

Machine Learning Advancements in Meta-Reinforcement Learning Techniques

Exploring new methods to enhance decision-making in learning agents.

2025-09-01T03:03:36+00:00 ― 7 min read

Machine Learning Connecting Flatness and Generalization in Machine Learning

Research reveals how flat minima relate to better model performance on unseen data.

2025-08-31T11:56:24+00:00 ― 5 min read

Computation and Language PipeRAG: Enhancing Retrieval-Augmented Generation

A new method to make RAG faster and improve quality.

2025-08-31T07:26:30+00:00 ― 6 min read

Machine Learning Improving Domain Generalization with UDIM

A new approach enhances model performance across diverse data types.

2025-08-29T23:58:24+00:00 ― 6 min read

Machine Learning Balancing Efficiency and Robustness in Deep Learning Models

Investigating model compression methods to improve efficiency and defenses against attacks.

2025-08-29T09:53:06+00:00 ― 7 min read

Machine Learning Enhancing Federated Learning Efficiency with FedMef

FedMef improves federated learning for low-resource devices through innovative pruning techniques.

2025-08-27T06:24:12+00:00 ― 6 min read

Machine Learning Enhancing Machine Learning with MetaOptimize

MetaOptimize improves model performance by adjusting learning settings dynamically.

2025-08-25T12:07:32+00:00 ― 7 min read

Machine Learning Advancements in Fine-Tuning Machine Learning Models

Introducing a new method for efficient model fine-tuning.

2025-08-25T05:08:04+00:00 ― 5 min read

Computer Vision and Pattern Recognition Optimizing Convolutional Neural Networks with RL Pruning

A new method uses reinforcement learning to prune CNNs while training.

2025-08-24T16:15:24+00:00 ― 8 min read

Machine Learning Improving Efficiency in Low-Precision Neural Networks

This paper discusses the costs and improvements for low-precision neural networks.

2025-08-24T09:16:42+00:00 ― 4 min read

Computer Vision and Pattern Recognition Advancing Adaptation Methods for Machine Learning

Generalized Diffusion Adaptation improves model performance with out-of-distribution samples.

2025-08-24T09:08:48+00:00 ― 6 min read

Machine Learning Tackling Incomplete Data with Variational Autoencoders

Strategies for improving variational autoencoders in handling incomplete datasets.

2025-08-23T21:11:48+00:00 ― 5 min read

Computation and Language Multilingual Brain Surgeon: A New Approach to Model Compression

A method to improve language model performance across diverse languages during compression.

2025-08-21T23:52:18+00:00 ― 6 min read

Computer Vision and Pattern Recognition A New Approach to Pruning Vision-Language Models

Introducing a method for task-agnostic pruning of complex models.

2025-08-21T10:34:24+00:00 ― 7 min read

Computer Vision and Pattern Recognition Improving MLLMs with Transferable Visual Prompting

A new method enhances multimodal models using shared visual prompts.

2025-08-18T16:04:54+00:00 ― 8 min read

Computer Vision and Pattern Recognition Introducing Contrastive Knowledge Distillation

A new method to improve model performance in AI through knowledge transfer.

2025-08-17T10:27:24+00:00 ― 4 min read

Computation and Language Simplifying Task Selection for Instruction Tuning

A new method, InsTa, enhances task selection in instruction tuning.

2025-08-16T09:10:36+00:00 ― 7 min read

Machine Learning Assessing Large Language Models: Size and Precision Matters

This study evaluates how model size and quantization impact language model performance.

2025-08-13T18:22:18+00:00 ― 7 min read

Machine Learning Optimizing Large Language Models with Student Float Format

New techniques improve efficiency and accuracy in large language models.

2025-08-13T15:36:24+00:00 ― 5 min read

Computer Vision and Pattern Recognition Improving Image Generation with LoRA Conditioning

Enhancing diffusion models by adding LoRA to attention layers for better images.

2025-08-13T14:56:54+00:00 ― 4 min read

Computer Vision and Pattern Recognition Efficient Model Design with Differentiable Model Scaling

A new method for improving model structures more effectively and efficiently.

2025-08-11T15:56:36+00:00 ― 6 min read

Cryptography and Security Addressing Security Risks in Quantized Deep Learning Models

This paper presents EFRAP, a defense against quantization-conditioned backdoor attacks in deep learning models.

2025-08-09T11:32:24+00:00 ― 7 min read

Machine Learning Improving Fine-Tuning with Spectral Adaptation

A new method enhances fine-tuning of large models using spectral information.

2025-08-09T02:35:12+00:00 ― 5 min read

Machine Learning Efficient Adaptation of Large AI Models

A method combining low-rank and orthogonal adaptations for AI models.

2025-08-07T20:57:42+00:00 ― 5 min read