Latest Articles for Quantization

Information Retrieval Improving Course Recommendations with Two-Stage Retrieval

BrightFit enhances course suggestions through a new two-stage retrieval approach.

2025-07-23T07:17:30+00:00 ― 6 min read

Computation and Language Improving Long Context Handling in LLMs

Evaluating methods to enhance long context performance in language models.

2025-07-21T19:12:54+00:00 ― 7 min read

Sound Improving Speech Quality Monitoring on Devices

Advancements in predicting speech quality using efficient methods for mobile devices.

2025-07-21T13:55:10+00:00 ― 5 min read

Optimization and Control Lloyd's Algorithm: Simplifying Complex Data

A method to convert continuous data into a simpler, discrete form.

2025-07-21T12:45:19+00:00 ― 7 min read

Machine Learning A New Approach to Improve Deep Neural Networks

Combining pruning and quantization streamlines DNN efficiency for smaller devices.

2025-07-21T09:12:30+00:00 ― 6 min read

Systems and Control Improving Distributed Learning Through Effective Quantization Methods

Examining quantization techniques for better distributed learning across various network structures.

2025-07-21T03:08:33+00:00 ― 7 min read

Machine Learning Federated Learning and 8-Bit Precision: A New Approach

This article explores the benefits of using FP8 in federated learning.

2025-07-21T00:39:00+00:00 ― 5 min read

Computation and Language The Impact of Quantization on Multilingual Models

Studying how quantization affects performance in different languages.

2025-07-20T08:43:06+00:00 ― 5 min read

Machine Learning GPTQT: A New Approach to Language Model Quantization

GPTQT enhances efficiency and performance in large language model quantization, making AI more accessible.

2025-07-20T03:19:12+00:00 ― 5 min read

Computation and Language Addressing Harmful Memes: A New Approach

This paper presents a method to identify and manage harmful memes effectively.

2025-07-19T23:30:06+00:00 ― 5 min read

Machine Learning ISQuant: A Game Changer in Model Compression

ISQuant offers a new approach to quantization for efficient model deployment.

2025-07-19T00:03:54+00:00 ― 5 min read

Machine Learning Efficient Techniques for Deep Reinforcement Learning Models

Evaluating quantization and pruning to optimize DRL models for limited resources.

2025-07-18T19:27:24+00:00 ― 5 min read

Computation and Language Assessing Safety in Compressed Language Models

A look into the safety concerns of compressed language models.

2025-07-18T03:31:30+00:00 ― 6 min read

Artificial Intelligence Enhancing Reliability in Binary Neural Networks

New method improves the performance of Binary Neural Networks under faults.

2025-07-18T03:23:36+00:00 ― 4 min read

High Energy Physics - Theory Carrollian Conformal Scalar Theories in Physics

Study of Carrollian symmetries and their implications in modern physics.

2025-07-17T20:32:57+00:00 ― 6 min read

High Energy Physics - Theory Investigating Quantum Aspects of ModMax Electrodynamics

Research focuses on the quantum behavior of ModMax, a modified electrodynamics model.

2025-07-16T21:07:48+00:00 ― 7 min read

Computation and Language RoLoRA: Improving Fine-Tuning for Large Language Models

A new method enhancing model performance through effective outlier management.

2025-07-16T02:24:48+00:00 ― 6 min read

Image and Video Processing Advancements in Image Compression Techniques

New deep learning methods improve image compression efficiency and quality.

2025-07-15T09:44:25+00:00 ― 5 min read

Quantum Physics Improving Qubit State Readings with Neural Networks

This study enhances qubit measurements using machine learning and FPGA technology.

2025-07-13T15:32:45+00:00 ― 7 min read

Machine Learning On-Device Training for Smart Devices

Training DNNs on microcontrollers boosts efficiency and privacy in smart technology.

2025-07-13T05:41:00+00:00 ― 6 min read

Symplectic Geometry Insights into Regular Lagrangians

An overview of regular Lagrangians and their role in mathematics and physics.

2025-07-13T00:42:08+00:00 ― 5 min read

Machine Learning Enhancing Efficiency in Transformer Training Through Quantization

This article examines how quantization can improve Transformer language model training efficiency.

2025-07-12T10:43:24+00:00 ― 5 min read

Hardware Architecture Optimizing AI Models on Microcontrollers with MCU-MixQ

MCU-MixQ enhances AI model performance on microcontrollers by optimizing resource use.

2025-07-12T02:17:48+00:00 ― 5 min read

Computation and Language Enhancing Sentiment Analysis with Local LLMs

Study reveals improved sentiment analysis through local LLMs and majority voting.

2025-07-12T00:58:48+00:00 ― 10 min read

Machine Learning Advancing Recurrent Neural Networks for Efficient Use

Techniques for optimizing RNNs, focusing on Mamba and quantization challenges.

2025-07-11T13:15:42+00:00 ― 6 min read

Machine Learning The Rise of Specialized Language Models in Medicine

Smaller models tailored for specific fields, like medicine, show great potential.

2025-07-10T01:58:30+00:00 ― 6 min read

Computer Vision and Pattern Recognition Improving Deep Learning Efficiency with Structured Ternary Patterns

New method enhances deep learning models for limited-resource devices.

2025-07-09T19:55:06+00:00 ― 5 min read

Sound MIDI Music Generation: Current Challenges and Future Directions

An overview of MIDI music creation and its expressive potential.

2025-07-07T00:55:45+00:00 ― 5 min read

Sound Optimizing Speaker Diarization for Faster Results

Methods to speed up speaker diarization without sacrificing accuracy.

2025-07-05T00:20:45+00:00 ― 6 min read

Hardware Architecture Making Large Language Models Work on Smaller Devices

New methods aim to run powerful models on limited hardware efficiently.

2025-07-03T16:42:54+00:00 ― 4 min read

Machine Learning The Efficiency of Low Precision in Deep Learning

Reducing model size and improving efficiency with lower precision formats.

2025-07-02T07:00:30+00:00 ― 5 min read

Computation and Language Efficient Techniques for Large Language Models

Learn methods to optimize large language models for better performance and efficiency.

2025-07-01T10:51:48+00:00 ― 7 min read

Computation and Language Addressing E-Commerce Challenges with LLMs

Utilizing LLMs to enhance e-commerce tasks through instruction tuning and quantization.

2025-07-01T08:37:30+00:00 ― 5 min read

Information Theory Capacity Analysis of 1-Bit MIMO Fading Channels

Examining how antenna numbers influence 1-bit MIMO communication performance.

2025-07-01T08:37:15+00:00 ― 6 min read

Machine Learning Optimizing Deep Learning Models for Limited Resources

Combining HW-NAS and ACO for efficient neural networks.

2025-07-01T04:40:30+00:00 ― 6 min read

Machine Learning Improving Efficiency in Large Language Models

Exploring techniques to enhance LLM performance during inference.

2025-07-01T04:32:36+00:00 ― 5 min read

Computer Vision and Pattern Recognition Improving Efficiency in Multimodal Model Training

A new method enhances efficiency and performance of multimodal large language models.

2025-06-30T21:33:54+00:00 ― 5 min read

Computer Vision and Pattern Recognition Optimizing Vision Transformers for Mobile Devices

Learn how PQV-Mobile enhances ViTs for efficient mobile applications.

2025-06-27T14:41:48+00:00 ― 5 min read

High Energy Physics - Theory Insights into String Theory and Its Frameworks

A look into the principles and challenges of string theory.

2025-06-26T22:16:48+00:00 ― 4 min read

General Relativity and Quantum Cosmology New Insights into Black Holes Using Loop Quantum Gravity

Research offers fresh views on black holes through a new quantization scheme.

2025-06-26T08:48:15+00:00 ― 6 min read