Latest Articles for Fine-tuning

High Energy Physics - Phenomenology Investigating Neutrino Mass with Trimaximal Mixing

This article explores neutrino mass through minor zeros in the mass matrix.

2025-09-18T15:52:48+00:00 ― 5 min read

Machine Learning New Method for Private Fine-Tuning of Language Models

DP-ZO balances privacy and performance in language model training.

2025-09-18T12:02:54+00:00 ― 5 min read

Computer Vision and Pattern Recognition Improving Model Adaptation with Targeted Augmentations

A new framework enhances model performance on unseen data using targeted changes.

2025-09-18T07:10:36+00:00 ― 6 min read

Computer Vision and Pattern Recognition Efficient Image Editing with EGAN Framework

New methods improve image editing speed and quality using smaller models.

2025-09-17T14:11:30+00:00 ― 5 min read

High Energy Physics - Phenomenology Addressing the Hierarchy Problem with Composite Higgs Models

New models explore stability of the weak scale in high energy physics.

2025-09-17T08:13:06+00:00 ― 5 min read

Software Engineering Addressing Inter-dataset Code Duplication in Model Evaluation

Examining the effects of inter-dataset code duplication on model performance metrics.

2025-09-17T01:33:06+00:00 ― 7 min read

Machine Learning Fine-Tuning Pruned Neural Networks with Stochastic Subnetwork Annealing

A new method that improves pruned neural networks for better performance.

2025-09-16T19:13:54+00:00 ― 7 min read

Computer Vision and Pattern Recognition AI's Role in Mapping Permafrost Features

AI tools like SAM are reshaping how we map permafrost and understand climate change.

2025-09-16T18:58:06+00:00 ― 7 min read

Biological Physics Understanding Criticality in Biological Systems

An overview of intrinsic and extrinsic criticality in biological systems.

2025-09-16T15:45:00+00:00 ― 6 min read

Computation and Language Advancements in Mathematical Reasoning for LLMs

This study enhances how language models handle math reasoning tasks.

2025-09-16T14:37:24+00:00 ― 5 min read

Computation and Language Improving Question Answering with Limited Data

Strategies to enhance QA models when labeled data is scarce.

2025-09-16T08:41:54+00:00 ― 6 min read

Machine Learning A New Method for Fine-Tuning Foundation Models

AutoFT improves model performance on unseen data through innovative fine-tuning techniques.

2025-09-16T04:29:06+00:00 ― 6 min read

Machine Learning Speeding Up Large Language Models with Extra Heads

A new method speeds up LLM text generation using additional prediction heads.

2025-09-15T18:05:00+00:00 ― 4 min read

Computer Vision and Pattern Recognition Advancements in Eye Disease Detection Using AI

A new AI framework improves eye disease detection through enhanced imaging techniques.

2025-09-15T09:23:36+00:00 ― 5 min read

Computation and Language New Method for Efficient Language Model Training

A method improves efficiency in training and using large language models.

2025-09-15T04:55:00+00:00 ― 7 min read

Bioinformatics Improving Rare Disease Diagnosis through Standardized Vocabulary

Research shows promise in using fine-tuned models for better rare disease understanding.

2025-09-14T15:04:42+00:00 ― 7 min read

Cryptography and Security Addressing Multilingual Jailbreak Attacks on Language Models

Study reveals risks of multilingual jailbreak attacks on large language models.

2025-09-13T03:56:12+00:00 ― 5 min read

Software Engineering Automating Code Reviews with GPT-3.5: A Study

This article explores methods for using GPT-3.5 to automate code reviews effectively.

2025-09-12T19:30:36+00:00 ― 5 min read

Machine Learning Challenges and Strategies for Large Language Models

Analyzing the cost and efficiency of large language models in various tasks.

2025-09-12T16:52:36+00:00 ― 6 min read

Computation and Language Evaluating Language Models: In-Topic vs Cross-Topic Performance

This study analyzes how language models handle familiar and unfamiliar topics.

2025-09-12T01:52:00+00:00 ― 6 min read

Machine Learning Large Language Models in Time Series Analysis

Exploring the use of LLMs to analyze time series data across various fields.

2025-09-11T05:03:48+00:00 ― 8 min read

Machine Learning Decoding-Time Realignment: A New Approach to Language Model Training

DeRa offers a method to adjust language model alignment without retraining.

2025-09-11T02:33:42+00:00 ― 5 min read

Computation and Language Identifying Winning Tickets in Multilingual Language Models

A method for fine-tuning language models using fewer parameters.

2025-09-10T23:08:18+00:00 ― 6 min read

Machine Learning How Noise Affects Language Model Training

This article examines the impact of noise on language model performance.

2025-09-10T17:52:18+00:00 ― 7 min read

Machine Learning Advancements in Quantization Techniques for Machine Learning Models

Learn how new techniques improve the efficiency of large machine learning models.

2025-09-10T13:31:36+00:00 ― 4 min read

Machine Learning Improving Explainability in Machine Learning Models

New methods enhance the clarity of machine learning predictions.

2025-09-10T10:35:28+00:00 ― 7 min read

Machine Learning Improving Confidence in Vision-Language Models

New method enhances reliability of model predictions in real-world applications.

2025-09-10T01:09:00+00:00 ― 6 min read

Computation and Language Addressing Cultural Bias in Language Models

A new approach to integrate diverse cultural insights into language models.

2025-09-10T00:45:18+00:00 ― 7 min read

Human-Computer Interaction Innovative Idea Generation with AI Assistance

A new method to spark creativity in problem-solving through AI support.

2025-09-09T22:23:06+00:00 ― 8 min read

Artificial Intelligence Challenges in Developing Effective AI Agents

Exploring issues in creating decision-making AI models and solutions.

2025-09-09T21:19:54+00:00 ― 5 min read

Audio and Speech Processing Introducing AV-SUPERB: A New Benchmark for Audio-Visual Models

AV-SUPERB evaluates audio and visual models across various tasks for better performance.

2025-09-08T22:32:35+00:00 ― 5 min read

Information Retrieval Advances in Long Document Retrieval Models

New tools improve how systems retrieve information from long documents.

2025-09-08T20:26:48+00:00 ― 4 min read

Computation and Language The Role of Language Models in Hiring Decisions

Exploring how language models reflect personality traits in recruitment.

2025-09-08T12:17:00+00:00 ― 7 min read

Machine Learning Improving Neural Network Fine-Tuning with Active Learning

This study enhances fine-tuning efficiency in neural networks using transductive active learning.

2025-09-08T10:42:12+00:00 ― 7 min read

Audio and Speech Processing Improving Whisper for Low-Resource Languages

Enhancing Whisper's speech recognition for Vietnamese and other low-resource languages.

2025-09-08T03:55:10+00:00 ― 4 min read

Computation and Language Adapting Language Models Without Costly Data

A new method for adapting LLMs without extensive labeling.

2025-09-07T13:22:24+00:00 ― 8 min read

Machine Learning Adapting Language Models to User Feedback

This article discusses a method to improve LLMs using verbal feedback without overgeneralization.

2025-09-07T11:16:00+00:00 ― 10 min read

Computation and Language LoRETTA: A New Method for Fine-Tuning Language Models

LoRETTA improves fine-tuning efficiency for large language models with fewer parameters.

2025-09-07T03:29:54+00:00 ― 5 min read

Machine Learning Understanding Indiscriminate Data Poisoning Attacks in Machine Learning

Exploring the threats posed by indiscriminate data poisoning in self-supervised learning.

2025-09-06T18:01:06+00:00 ― 7 min read

Computation and Language Risks of Data Exposure in Language Models

Examining how fine-tuning increases the risk of revealing sensitive training data.

2025-09-06T12:37:12+00:00 ― 6 min read