Latest Articles for Language Models

Computation and Language New Sentence Encoders for Portuguese Language

Introducing models designed to improve natural language processing in Portuguese.

2025-07-05T22:10:18+00:00 ― 6 min read

Computation and Language Enhancing Efficiency in Prompt Engineering

Learn how active prompt engineering improves tasks for language models.

2025-07-05T19:48:06+00:00 ― 5 min read

Computation and Language Optimizing Chunk Size for Better AI Responses

This article reviews how chunk size affects AI-generated answers.

2025-07-05T08:28:42+00:00 ― 6 min read

Computation and Language New Method for Detecting Pre-training Data in Language Models

A fresh approach highlights surprising tokens to assess language model training data.

2025-07-05T05:19:06+00:00 ― 6 min read

Computation and Language Improving Italian Language Models for Legal and Bureaucratic Contexts

This study examines methods to enhance Italian language models in specialized fields.

2025-07-04T18:55:00+00:00 ― 9 min read

Machine Learning Strengthening Safety in Open-Weight LLMs

A new method improves tamper resistance in open-weight language models.

2025-07-03T22:14:42+00:00 ― 7 min read

Computation and Language Improving Small Language Models with Fine-Tuning Techniques

Enhancing smaller language models like MiniCPM through effective fine-tuning practices.

2025-07-03T20:24:06+00:00 ― 6 min read

Computation and Language Evaluating Spatial Reasoning in Language Models

Benchmark assesses large language models' ability to understand spatial relationships.

2025-07-02T22:09:00+00:00 ― 4 min read

Cryptography and Security Identifying Large Language Models Through Unique Traits

A new method analyzes language models by examining their specific characteristics.

2025-07-02T06:36:48+00:00 ― 4 min read

Computation and Language Impact of Format Restrictions on LLM Performance

This article examines how structured generation affects language model reasoning and comprehension.

2025-07-02T00:09:42+00:00 ― 5 min read

Computation and Language OpenFactCheck: A New Tool for Fact-Checking LLMs

OpenFactCheck provides a framework for evaluating the accuracy of language model outputs.

2025-07-01T18:14:12+00:00 ― 5 min read

Computation and Language Addressing Bias in Language Models with BiasKE and FAST

Innovative methods to enhance fairness in large language models.

2025-07-01T07:42:12+00:00 ― 7 min read

Computation and Language Advancing Synthetic Data for Language Models

A new method enhances synthetic data quality for better language model alignment.

2025-06-30T13:24:06+00:00 ― 5 min read

Audio and Speech Processing Improving Speech Recognition with Contextual Keywords

A new system enhances speech recognition by using contextual keywords for better accuracy.

2025-06-29T22:53:15+00:00 ― 5 min read

Artificial Intelligence SAGE-RT: A New Method for Language Model Safety

SAGE-RT creates synthetic data to improve language model safety assessments.

2025-06-28T06:37:42+00:00 ― 5 min read

Computation and Language New Benchmark Evaluates Legal Knowledge in Arabic Language Models

ArabLegalEval assesses LLMs' performance in handling Arabic legal information.

2025-06-27T05:52:30+00:00 ― 6 min read

Computation and Language Evaluating Language Models with Multiple LLMs

A new method to assess language model outputs using multiple LLM judges.

2025-06-26T14:28:12+00:00 ― 7 min read

Computation and Language Evaluating Language Model Agents in Scientific Research

A new benchmark assesses language model agents for handling scientific data analysis.

2025-06-26T10:47:00+00:00 ― 7 min read

Computation and Language Improving Small Language Models in Telecom

New methods enhance small models' accuracy in telecommunications question answering.

2025-06-25T02:31:30+00:00 ― 5 min read

Computation and Language Addressing Knowledge Conflicts in LLMs with ConflictBank

ConflictBank offers insights into knowledge conflicts in language models.

2025-06-24T17:42:12+00:00 ― 5 min read

Computation and Language The Impact of Memorization in In-Context Learning

This article explores the role of memorization in improving ICL performance.

2025-06-24T07:18:06+00:00 ― 5 min read

Computation and Language New Text Embedding Model for Russian Language

Introducing a new model and benchmark for Russian text processing.

2025-06-23T18:55:30+00:00 ― 5 min read

Artificial Intelligence Evaluating Language Model Metrics: A Deep Dive

Researchers examine the reliability of metrics for language model safety.

2025-06-23T14:50:36+00:00 ― 6 min read

Computation and Language The Impact of Next-Token Prediction on Language Models

A deep dive into how next-token prediction shapes language understanding in models.

2025-06-21T16:14:00+00:00 ― 6 min read

Distributed, Parallel, and Cluster Computing Efficient Training of Long-Context Language Models Using FPDT

FPDT offers a solution for training long-context LLMs more efficiently.

2025-06-20T12:35:00+00:00 ― 5 min read

Computation and Language MemLong: Transforming Language Models for Long Texts

MemLong improves language models' ability to handle lengthy texts effectively.

2025-06-20T12:19:12+00:00 ― 6 min read

Computers and Society Generating Social Networks Using Language Models

This article analyzes how language models create realistic social networks and their biases.

2025-06-20T06:31:36+00:00 ― 6 min read

Computation and Language Improving AI Reasoning with Self-Critique

This article discusses a new framework for enhancing reasoning in AI models.

2025-06-20T01:31:24+00:00 ― 5 min read

Computation and Language A New Way to Measure Creativity

Introducing a framework for generating creativity test items using language models.

2025-06-19T19:43:48+00:00 ― 5 min read

Computation and Language Improving Long-Text Handling in LLMs with YOURA

A new method enhances long-text processing in language models for better answers.

2025-06-18T05:17:00+00:00 ― 5 min read

Computation and Language Evaluating Long-Form Text Generation in LLMs

LongGenBench assesses large language models in generating high-quality long text.

2025-06-17T21:54:36+00:00 ― 5 min read

Computation and Language The Continued Importance of Retrieval-Augmented Generation

RAG remains vital in optimizing language model responses, especially with long texts.

2025-06-17T14:40:06+00:00 ― 5 min read

Machine Learning Evaluating Sparse Autoencoders in Language Models

This article assesses the effectiveness of sparse autoencoders in knowledge representation about cities.

2025-06-16T21:25:12+00:00 ― 5 min read

Computation and Language How Learning Methods Shape Language Models

A study on the impact of ICL and SFT on language model structure.

2025-06-16T16:25:00+00:00 ― 6 min read

Computation and Language Improving Machine Translation with Fine-Tuning Techniques

Study shows fine-tuning LLMs with TMs enhances translation quality for organizations.

2025-06-16T11:48:30+00:00 ― 6 min read

Machine Learning Understanding Multi-layer Sparse Autoencoders in Language Models

This article discusses MLSAEs and their role in examining language model layers.

2025-06-15T23:57:30+00:00 ― 5 min read

Computation and Language ECHO: A New Approach in Reasoning Techniques

ECHO combines diverse reasoning patterns for better problem-solving in language models.

2025-06-15T21:43:12+00:00 ― 6 min read

Software Engineering Evaluating Language Models for Web App Coding

Study assesses language models on their ability to generate web application code.

2025-06-15T12:06:30+00:00 ― 6 min read

Cryptography and Security AdaPPA: A New Approach to Jailbreak Attacks on LLMs

AdaPPA enhances jailbreak attacks on language models by combining safe and harmful responses.

2025-06-14T16:05:42+00:00 ― 5 min read

Machine Learning Improving Code Generation with PF-PPO

PF-PPO enhances language models by filtering out unreliable rewards for better code responses.

2025-06-14T14:15:06+00:00 ― 5 min read