Latest Articles for Quantization

Computation and Language Assessing Safety in Compressed Language Models

A look into the safety concerns of compressed language models.

2025-07-18T03:31:30+00:00 ― 6 min read

Artificial Intelligence Enhancing Reliability in Binary Neural Networks

New method improves the performance of Binary Neural Networks under faults.

2025-07-18T03:23:36+00:00 ― 4 min read

High Energy Physics - Theory Carrollian Conformal Scalar Theories in Physics

Study of Carrollian symmetries and their implications in modern physics.

2025-07-17T20:32:57+00:00 ― 6 min read

High Energy Physics - Theory Investigating Quantum Aspects of ModMax Electrodynamics

Research focuses on the quantum behavior of ModMax, a modified electrodynamics model.

2025-07-16T21:07:48+00:00 ― 7 min read

Computation and Language RoLoRA: Improving Fine-Tuning for Large Language Models

A new method enhancing model performance through effective outlier management.

2025-07-16T02:24:48+00:00 ― 6 min read

Image and Video Processing Advancements in Image Compression Techniques

New deep learning methods improve image compression efficiency and quality.

2025-07-15T09:44:25+00:00 ― 5 min read

Quantum Physics Improving Qubit State Readings with Neural Networks

This study enhances qubit measurements using machine learning and FPGA technology.

2025-07-13T15:32:45+00:00 ― 7 min read

Machine Learning On-Device Training for Smart Devices

Training DNNs on microcontrollers boosts efficiency and privacy in smart technology.

2025-07-13T05:41:00+00:00 ― 6 min read

Symplectic Geometry Insights into Regular Lagrangians

An overview of regular Lagrangians and their role in mathematics and physics.

2025-07-13T00:42:08+00:00 ― 5 min read

Machine Learning Enhancing Efficiency in Transformer Training Through Quantization

This article examines how quantization can improve Transformer language model training efficiency.

2025-07-12T10:43:24+00:00 ― 5 min read

Hardware Architecture Optimizing AI Models on Microcontrollers with MCU-MixQ

MCU-MixQ enhances AI model performance on microcontrollers by optimizing resource use.

2025-07-12T02:17:48+00:00 ― 5 min read

Computation and Language Enhancing Sentiment Analysis with Local LLMs

Study reveals improved sentiment analysis through local LLMs and majority voting.

2025-07-12T00:58:48+00:00 ― 10 min read

Machine Learning Advancing Recurrent Neural Networks for Efficient Use

Techniques for optimizing RNNs, focusing on Mamba and quantization challenges.

2025-07-11T13:15:42+00:00 ― 6 min read

Machine Learning The Rise of Specialized Language Models in Medicine

Smaller models tailored for specific fields, like medicine, show great potential.

2025-07-10T01:58:30+00:00 ― 6 min read

Computer Vision and Pattern Recognition Improving Deep Learning Efficiency with Structured Ternary Patterns

New method enhances deep learning models for limited-resource devices.

2025-07-09T19:55:06+00:00 ― 5 min read

Sound MIDI Music Generation: Current Challenges and Future Directions

An overview of MIDI music creation and its expressive potential.

2025-07-07T00:55:45+00:00 ― 5 min read

Sound Optimizing Speaker Diarization for Faster Results

Methods to speed up speaker diarization without sacrificing accuracy.

2025-07-05T00:20:45+00:00 ― 6 min read

Hardware Architecture Making Large Language Models Work on Smaller Devices

New methods aim to run powerful models on limited hardware efficiently.

2025-07-03T16:42:54+00:00 ― 4 min read

Machine Learning The Efficiency of Low Precision in Deep Learning

Reducing model size and improving efficiency with lower precision formats.

2025-07-02T07:00:30+00:00 ― 5 min read

Computation and Language Efficient Techniques for Large Language Models

Learn methods to optimize large language models for better performance and efficiency.

2025-07-01T10:51:48+00:00 ― 7 min read

Computation and Language Addressing E-Commerce Challenges with LLMs

Utilizing LLMs to enhance e-commerce tasks through instruction tuning and quantization.

2025-07-01T08:37:30+00:00 ― 5 min read

Information Theory Capacity Analysis of 1-Bit MIMO Fading Channels

Examining how antenna numbers influence 1-bit MIMO communication performance.

2025-07-01T08:37:15+00:00 ― 6 min read

Machine Learning Optimizing Deep Learning Models for Limited Resources

Combining HW-NAS and ACO for efficient neural networks.

2025-07-01T04:40:30+00:00 ― 6 min read

Machine Learning Improving Efficiency in Large Language Models

Exploring techniques to enhance LLM performance during inference.

2025-07-01T04:32:36+00:00 ― 5 min read

Computer Vision and Pattern Recognition Improving Efficiency in Multimodal Model Training

A new method enhances efficiency and performance of multimodal large language models.

2025-06-30T21:33:54+00:00 ― 5 min read

Computer Vision and Pattern Recognition Optimizing Vision Transformers for Mobile Devices

Learn how PQV-Mobile enhances ViTs for efficient mobile applications.

2025-06-27T14:41:48+00:00 ― 5 min read

High Energy Physics - Theory Insights into String Theory and Its Frameworks

A look into the principles and challenges of string theory.

2025-06-26T22:16:48+00:00 ― 4 min read

General Relativity and Quantum Cosmology New Insights into Black Holes Using Loop Quantum Gravity

Research offers fresh views on black holes through a new quantization scheme.

2025-06-26T08:48:15+00:00 ― 6 min read

Distributed, Parallel, and Cluster Computing HoSZp: A New Era in Scientific Data Compression

HoSZp allows efficient computations on compressed scientific data, improving analysis workflows.

2025-06-24T15:43:42+00:00 ― 6 min read

Computation and Language The Future of On-Device Language Models

Learn how language models on devices improve speed and privacy.

2025-06-22T16:03:54+00:00 ― 7 min read

Computation and Language Improving Language Models for Mobile Devices

A new method makes using large language models on mobile devices more efficient.

2025-06-22T14:21:12+00:00 ― 10 min read

Computer Vision and Pattern Recognition Advancements in Zero-Shot Quantization for Infrared Imaging

This article explores zero-shot quantization and its applications in infrared imaging.

2025-06-22T13:49:36+00:00 ― 5 min read

Information Theory Advancements in LDPC Code Decoding Techniques

New strategies simplify decoding of LDPC codes for faster communications.

2025-06-21T14:25:07+00:00 ― 5 min read

Machine Learning Challenges of LLaMA3-70B with 8-bit Quantization

LLaMA3-70B faces unique issues with 8-bit quantization affecting its performance.

2025-06-21T13:51:48+00:00 ― 3 min read

Machine Learning New Techniques in Fine-Tuning Language Models

Discover efficient methods for fine-tuning large language models using Gaussian noise.

2025-06-21T13:43:54+00:00 ― 5 min read

Signal Processing Advancements in Blood Pressure Monitoring Using Wearable Tech

New methods enable non-invasive blood pressure monitoring through wearable devices.

2025-06-20T03:15:30+00:00 ― 5 min read

Machine Learning Revolutionizing Model Compression with Hyper-Compression Techniques

An innovative approach to compress advanced models efficiently without losing performance.

2025-06-19T08:48:06+00:00 ― 6 min read

Computer Vision and Pattern Recognition Advancements in Text-to-Image Compression Techniques

New methods improve image generation efficiency on limited devices.

2025-06-19T06:57:30+00:00 ― 4 min read

Computational Physics The Stern-Gerlach Experiment: A Snapshot of Quantum Mechanics

Exploring key concepts and implications of the Stern-Gerlach experiment in quantum physics.

2025-06-19T03:03:27+00:00 ― 4 min read

Signal Processing Innovative Approach to Quantization Errors

A new method adapts to input signals, improving quantization accuracy.

2025-06-17T21:00:25+00:00 ― 5 min read