Latest Articles for Quantization

Cryptography and Security Enhancing Privacy with Quantized Secure Inference

A framework for safer data processing in machine learning.

2025-08-12T19:27:42+00:00 ― 6 min read

Machine Learning Simplifying Gradient Estimation in Quantization-Aware Training

This article discusses effective gradient estimators for quantization-aware training in deep learning.

2025-08-12T16:33:54+00:00 ― 6 min read

Machine Learning Optimizing Deep Neural Networks for Real-World Applications

Explore methods to enhance efficiency and security of deep neural networks.

2025-08-12T09:27:18+00:00 ― 5 min read

Machine Learning Efficient Quantization of Large Language Models

Learn effective methods to quantize LLMs while maintaining accuracy and performance.

2025-08-12T09:19:24+00:00 ― 7 min read

Machine Learning Optimizing Large Language Models with Low-Rank Decomposition

This study investigates memory efficiency in large language models through low-rank decomposition.

2025-08-12T02:20:42+00:00 ― 5 min read

Machine Learning Advancements in Quantization for Large Language Models

Combining SmoothQuant and GPTQ improves efficiency and performance of large language models.

2025-08-11T22:23:42+00:00 ― 6 min read

Machine Learning Challenges of Adversarial Examples in Deep Neural Networks

Examining the weaknesses of DNNs against adversarial examples and their implications.

2025-08-10T22:33:48+00:00 ― 5 min read

Computer Vision and Pattern Recognition Reducing the Size of CNNs with RSDTR

A new method for compressing CNNs while maintaining accuracy for efficient image processing.

2025-08-10T08:28:30+00:00 ― 7 min read

Machine Learning Advancements in Model Compression Techniques

PV-Tuning improves fine-tuning and compression for large language models.

2025-08-08T11:03:00+00:00 ― 6 min read

Computation and Language Addressing Activation Spikes in LLM Quantization

New methods improve model performance during quantization in language models.

2025-08-08T03:32:42+00:00 ― 6 min read

Machine Learning Advancements in Memory-Efficient Neural Network Training

New techniques enable training large neural networks on consumer-grade hardware with reduced memory.

2025-08-06T18:29:48+00:00 ― 8 min read

Machine Learning The Risks of Quantization in Language Models

Examining the dangers of quantized language models and their potential misuse.

2025-08-05T17:44:36+00:00 ― 5 min read

Machine Learning High Granularity Quantization: Improving Deep Learning Efficiency

Learn how HGQ optimizes deep learning models for speed and accuracy.

2025-08-04T08:39:18+00:00 ― 6 min read

Computer Vision and Pattern Recognition Efficient Image Generation with HQ-DiT

A new method for running Diffusion Transformers more effectively on smaller devices.

2025-08-04T06:03:42+00:00 ― 6 min read

Machine Learning Combining Sparsity and Quantization in Neural Networks

Research on optimizing deep learning models with sparsity and quantization techniques.

2025-08-03T23:44:30+00:00 ― 6 min read

Machine Learning Enhancing LLM Efficiency Through Calibration Set Quality

Examining the impact of calibration set quality on LLM performance post-quantization.

2025-08-03T23:05:00+00:00 ― 7 min read

Cryptography and Security Advancements in Differential Privacy for Discrete Data

A new method improves data privacy for discrete data analysis.

2025-08-03T19:55:24+00:00 ― 6 min read

Neural and Evolutionary Computing Deep Spiking Neural Networks: A New Approach

DSNNs process information like real neurons, offering improved efficiency for data handling.

2025-08-02T18:30:42+00:00 ― 5 min read

Machine Learning Improving Reinforcement Learning with Representation Learning Techniques

A method to enhance decision-making in reinforcement learning using representation learning.

2025-08-02T14:57:24+00:00 ― 6 min read

Computer Vision and Pattern Recognition ViDiT-Q: Optimizing Diffusion Transformers for Efficiency

A new method improves image and video generation speed and quality.

2025-08-02T14:10:00+00:00 ― 6 min read

Information Theory Optimizing Information Transfer in Communication Systems

Research on quantization techniques for efficient data transmission in noisy channels.

2025-08-01T21:50:44+00:00 ― 5 min read

Machine Learning Efficient Fine-Tuning of Language Models on Limited Devices

Introducing a method to fine-tune LLMs on low-resource devices.

2025-08-01T18:25:00+00:00 ― 5 min read

Machine Learning Tender: A New Era in Language Model Efficiency

Tender offers a novel solution for efficient use of large language models.

2025-07-28T10:00:18+00:00 ― 6 min read

Image and Video Processing Energy-Efficient JPEG Compression Techniques

Explore methods for compressing images while saving energy without sacrificing quality.

2025-07-27T16:28:45+00:00 ― 6 min read

Machine Learning Examining Decision Boundaries in Language Models

A study on the decision-making processes of large language models.

2025-07-27T12:24:42+00:00 ― 4 min read

Computation and Language Making Machine Translation Evaluation More Efficient

A new approach to machine translation evaluation metrics for better accessibility.

2025-07-26T09:17:18+00:00 ― 5 min read

Machine Learning Introducing QuEE: A New Approach to Model Efficiency

QuEE combines quantization and early exiting for efficient machine learning.

2025-07-26T07:18:48+00:00 ― 6 min read

Cryptography and Security A New Method for Protecting Data in Machine Learning

This article presents a method to protect personal data in machine learning systems.

2025-07-24T04:21:30+00:00 ― 8 min read

Information Retrieval Improving Course Recommendations with Two-Stage Retrieval

BrightFit enhances course suggestions through a new two-stage retrieval approach.

2025-07-23T07:17:30+00:00 ― 6 min read

Computation and Language Improving Long Context Handling in LLMs

Evaluating methods to enhance long context performance in language models.

2025-07-21T19:12:54+00:00 ― 7 min read

Sound Improving Speech Quality Monitoring on Devices

Advancements in predicting speech quality using efficient methods for mobile devices.

2025-07-21T13:55:10+00:00 ― 5 min read

Optimization and Control Lloyd's Algorithm: Simplifying Complex Data

A method to convert continuous data into a simpler, discrete form.

2025-07-21T12:45:19+00:00 ― 7 min read

Machine Learning A New Approach to Improve Deep Neural Networks

Combining pruning and quantization streamlines DNN efficiency for smaller devices.

2025-07-21T09:12:30+00:00 ― 6 min read

Systems and Control Improving Distributed Learning Through Effective Quantization Methods

Examining quantization techniques for better distributed learning across various network structures.

2025-07-21T03:08:33+00:00 ― 7 min read

Machine Learning Federated Learning and 8-Bit Precision: A New Approach

This article explores the benefits of using FP8 in federated learning.

2025-07-21T00:39:00+00:00 ― 5 min read

Computation and Language The Impact of Quantization on Multilingual Models

Studying how quantization affects performance in different languages.

2025-07-20T08:43:06+00:00 ― 5 min read

Machine Learning GPTQT: A New Approach to Language Model Quantization

GPTQT enhances efficiency and performance in large language model quantization, making AI more accessible.

2025-07-20T03:19:12+00:00 ― 5 min read

Computation and Language Addressing Harmful Memes: A New Approach

This paper presents a method to identify and manage harmful memes effectively.

2025-07-19T23:30:06+00:00 ― 5 min read

Machine Learning ISQuant: A Game Changer in Model Compression

ISQuant offers a new approach to quantization for efficient model deployment.

2025-07-19T00:03:54+00:00 ― 5 min read

Machine Learning Efficient Techniques for Deep Reinforcement Learning Models

Evaluating quantization and pruning to optimize DRL models for limited resources.

2025-07-18T19:27:24+00:00 ― 5 min read