Latest Articles for Fine-tuning

Materials Science Machine Learning and Potential Energy Surfaces in Materials Science

Exploring the role of machine learning in predicting material behaviors and challenges faced.

2025-08-01T12:29:27+00:00 ― 5 min read

Machine Learning Training Agents in Complex 3D Environments

A study on aligning agents in 3D games to improve behavior.

2025-08-01T10:54:42+00:00 ― 6 min read

Machine Learning Optimizing Text Embeddings with Efficient Training

Learn how to train models for text embeddings wisely and effectively.

2025-08-01T10:38:54+00:00 ― 5 min read

Computation and Language Advancements in Medical Language Models with UltraMedical Datasets

UltraMedical collections improve medical language models and address data shortages.

2025-08-01T07:05:36+00:00 ― 6 min read

Machine Learning Advancing Tabular Data Classification with LoCalPFN

Discover how LoCalPFN improves transformer performance on tabular data.

2025-08-01T00:46:24+00:00 ― 5 min read

Computation and Language Efficient Fine-Tuning Methods for Multimodal Models

Study reveals effective techniques to enhance multimodal large language models.

2025-08-01T00:14:48+00:00 ― 6 min read

Computer Vision and Pattern Recognition Evaluating Lightweight Backbones for Image Classification

A study on the effectiveness of various lightweight models in image classification.

2025-07-31T17:08:12+00:00 ― 7 min read

Computer Vision and Pattern Recognition Improving Vision-Language Models with Generated Datasets

This study explores methods to enhance vision-language models using generated images.

2025-07-31T14:38:06+00:00 ― 5 min read

Computation and Language Improving Language Models for Better Conversations

This article reviews methods to enhance dialogue generation in language models.

2025-07-31T00:09:06+00:00 ― 5 min read

Computation and Language Evaluating Safety in Fine-Tuning Large Language Models

Examining the risks and safety measures in fine-tuning language models.

2025-07-30T05:03:36+00:00 ― 5 min read

Computation and Language Assessing Large Language Models in Programming by Example Tasks

A look into how LLMs tackle programming by example challenges.

2025-07-29T21:25:24+00:00 ― 5 min read

Machine Learning Advancements in Tabular Data Classification with ICL-Transformers

A new approach to classifying tabular data using ICL-transformers shows promising results.

2025-07-29T04:32:32+00:00 ― 5 min read

Computation and Language The Challenge of Faithful Reasoning in LLMs

Examining the effectiveness of reasoning in large language models.

2025-07-28T12:30:24+00:00 ― 7 min read

Computation and Language The Geometry of Latent Space in Transformer Models

Investigating how latent space affects transformer model performance on language tasks.

2025-07-28T01:03:06+00:00 ― 7 min read

Computation and Language The Rise of Synthetic News and Detection Challenges

Examining the impact of synthetic news content and detection difficulties.

2025-07-28T00:23:36+00:00 ― 6 min read

Machine Learning Memorization Risks in Reinforcement Learning with Human Feedback

Examining memorization in code completion models and its privacy implications.

2025-07-27T19:07:36+00:00 ― 7 min read

Computation and Language Enhancing Planning Skills in Language Models

This article examines ways to improve planning abilities in large language models.

2025-07-27T08:35:36+00:00 ― 7 min read

Computation and Language Assessing Knowledge in Language Models Without Generated Responses

A method to evaluate model knowledge through internal processing.

2025-07-27T05:26:00+00:00 ― 7 min read

Computation and Language DetectBench: A New Standard for Evidence Detection in Language Models

DetectBench evaluates LLMs on their ability to detect hidden evidence in reasoning tasks.

2025-07-27T05:02:18+00:00 ― 5 min read

Computation and Language Stabilizing Fine-Tuning with Delayed Ensemble

A new method to improve model stability and performance in low-resource settings.

2025-07-27T02:00:36+00:00 ― 6 min read

Computation and Language The Impact of Fine-Tuning on Language Models' Factual Recall

How fine-tuning affects language models' ability to recall facts accurately.

2025-07-26T12:34:48+00:00 ― 6 min read

Machine Learning Enhancing Language Models with Prefix Learning and NTK-Attention

Advancements in fine-tuning language models using innovative techniques.

2025-07-26T01:47:00+00:00 ― 6 min read

Computation and Language RankAdaptor: A New Frontier in Model Compression

RankAdaptor optimizes fine-tuning for pruned AI models, enhancing performance efficiently.

2025-07-25T10:30:36+00:00 ― 8 min read

Machine Learning Optimizing Memory in Large Machine Learning Models

Methods to reduce memory usage during fine-tuning of large models.

2025-07-25T09:35:18+00:00 ― 5 min read

Computation and Language Improving Chinese Speech Recognition Through Pinyin Regularization

This study presents a dataset and method to enhance Chinese ASR accuracy using Pinyin.

2025-07-25T07:47:55+00:00 ― 7 min read

Machine Learning Improving Reasoning in Language Models with Preference Optimization

New methods refine reasoning skills in language models for better task performance.

2025-07-25T06:33:36+00:00 ― 7 min read

Machine Learning Improving Alignment in Language Models with WARP

A new method enhances how language models align with human values.

2025-07-24T22:47:30+00:00 ― 6 min read

Computation and Language Improving Instruction-Following Models with Length Instructions

This study focuses on enhancing model responses by targeting specific length requirements.

2025-07-24T13:10:48+00:00 ― 5 min read

Computer Vision and Pattern Recognition Efficient Knowledge Distillation for Smart Devices

Research on improving knowledge transfer in resource-limited smart devices.

2025-07-24T05:56:18+00:00 ― 6 min read

Computation and Language Assessing Retrieval Robustness in Language Models

This study evaluates how well large language models use external information.

2025-07-23T20:27:30+00:00 ― 6 min read

Sound Synthetic Music Dataset Aims to Improve Genre Classification

GTZAN-synth dataset leverages synthetic music for better music tagging systems.

2025-07-23T17:44:30+00:00 ― 5 min read

Neural and Evolutionary Computing Advancements in Spiking Neural Networks for Language Processing

New method enhances spiking neural networks' performance in language tasks.

2025-07-23T09:47:36+00:00 ― 6 min read

Machine Learning Advancing Molecular Design Through Uncertainty-Guided Techniques

New methods improve molecular design by measuring prediction uncertainty.

2025-07-22T13:59:52+00:00 ― 7 min read

Cryptography and Security Advancing Data Processing with Mobile Edge Computing

A new system enhances data processing while ensuring user privacy and efficient resource use.

2025-07-22T09:34:00+00:00 ― 6 min read

Computation and Language HyperLoader: A New Way to Train Models

HyperLoader improves multi-task model training using innovative techniques and hypernetworks.

2025-07-21T16:34:54+00:00 ― 6 min read

Machine Learning Threats to Language Model Safety Revealed

Research shows how easily safety features can be removed from Llama 3 models.

2025-07-21T15:23:48+00:00 ― 5 min read

Machine Learning Improving Model Capacity in Fine-Tuning

A new framework enhances large model performance efficiently during fine-tuning.

2025-07-21T14:04:48+00:00 ― 6 min read

Machine Learning Consistent Proxy Tuning: A New Way for Black-box Models

CPT improves black-box model performance without direct access to internal parameters.

2025-07-21T11:03:06+00:00 ― 6 min read

Machine Learning Advancing On-Device Fine-Tuning for Language Models

Fine-tuning large language models directly on smartphones while protecting user data.

2025-07-21T08:40:54+00:00 ― 6 min read

Software Engineering Improving Code Generation for Domain-Specific Languages

Examining methods to enhance code generation for specialized programming languages using LLMs.

2025-07-21T04:36:00+00:00 ― 6 min read