Latest Articles for Language Models

Computation and Language Using Large Language Models to Combat Misinformation

Automated tools like LLMs help in verifying claims efficiently.

2025-07-20T20:18:18+00:00 ― 6 min read

Machine Learning A Simple Method to Protect Language Models

This approach uses self-evaluation to guard against harmful outputs in language models.

2025-07-20T09:06:48+00:00 ― 2 min read

Computation and Language The Impact of Quantization on Multilingual Models

Studying how quantization affects performance in different languages.

2025-07-20T08:43:06+00:00 ― 5 min read

Computation and Language Divergent Chain of Thought (DCoT): A New Approach to Language Models

DCoT enhances language model performance through multiple reasoning paths.

2025-07-20T08:03:36+00:00 ― 7 min read

Computation and Language Evolving Meanings: Analyzing Word Changes Over Time and Context

Study reveals how word meanings shift with context and time using word embeddings.

2025-07-20T01:12:48+00:00 ― 5 min read

Computation and Language Introducing the HaF-RM Framework for Reward Models

A new approach to training reward models that aligns with human preferences.

2025-07-19T15:51:54+00:00 ― 5 min read

Computation and Language Optimizing Prompts for Language Models

Adapting prompts to specific models improves performance in language tasks.

2025-07-19T14:09:12+00:00 ― 7 min read

Computation and Language Semantic Graphs and Syntactic Simplification with LLMs

Examining the role of semantic graphs in simplifying sentences with large language models.

2025-07-19T12:18:36+00:00 ― 6 min read

Computation and Language Advancements in Citation Text Generation with LLMs

Research explores improving citation text generation using large language models.

2025-07-19T11:23:18+00:00 ― 5 min read

Computation and Language Counterfactual Generation in Natural Language Processing

A look into methods and challenges of generating counterfactuals in NLP.

2025-07-19T10:12:12+00:00 ― 5 min read

Public and Global Health Classifying Tweets on Childhood Disorders

A study classifies tweets from parents about childhood disorders.

2025-07-19T00:48:00+00:00 ― 5 min read

Computation and Language Aligning AI Evaluations with Human Preferences

The study reveals the bias in AI evaluation tools favoring longer responses.

2025-07-19T00:11:48+00:00 ― 4 min read

Human-Computer Interaction Human Interactions with AI: Toxicity and Communication

Examining how users shape toxic language in conversations with large language models.

2025-07-17T07:22:48+00:00 ― 5 min read

Computation and Language Advances in Low-Resource Text Summarization

A new method improves summarization with limited training data.

2025-07-16T22:49:18+00:00 ― 4 min read

Computers and Society Assessing Large Language Models in Theory of Computing

This paper evaluates LLM performance in a Theory of Computing course.

2025-07-15T21:16:42+00:00 ― 5 min read

Artificial Intelligence Assessing Credences in Large Language Models

Exploring how confidence levels are attributed to LLMs and their implications.

2025-07-15T01:00:06+00:00 ― 7 min read

Artificial Intelligence Assessing Reasoning Skills in Language Models Through Games

We test language models' reasoning skills using various games, revealing significant limitations.

2025-07-14T18:48:48+00:00 ― 8 min read

Computation and Language Innovative Framework for Automatic Science Journalism

A new method simplifies science communication using collaborative language models.

2025-07-14T18:33:00+00:00 ― 5 min read

Computation and Language Improving LLM Efficiency with Shared Attention

A new method enhances the efficiency of language models using shared attention weights.

2025-07-14T05:07:12+00:00 ― 5 min read

Physics and Society How Large Language Models Transform Information

This study examines how LLMs change information through interactions.

2025-07-13T03:14:51+00:00 ― 5 min read

Machine Learning Analyzing Learning Dynamics in Large Language Models

This paper studies how training influences the predictions of large language models.

2025-07-13T01:12:24+00:00 ― 6 min read

Computation and Language Improving Long Text Processing in Language Models

New methods enhance cache management for large language models.

2025-07-12T23:37:36+00:00 ― 5 min read

Artificial Intelligence MMAU Benchmark: Assessing Language Model Skills

A detailed look at the MMAU benchmark for language models.

2025-07-12T02:25:42+00:00 ― 5 min read

Computation and Language The Impact of Embedding Initialization in Transformers

This article examines how embedding initialization affects transformer model performance.

2025-07-11T16:49:00+00:00 ― 6 min read

Machine Learning Evaluating the Reliability of Steering Vectors in AI Models

This article analyzes the effectiveness and reliability of steering vectors in language models.

2025-07-11T13:31:30+00:00 ― 6 min read

Computation and Language Can AI Compete with Human Storytelling?

Analyzing the storytelling capabilities of large language models compared to human authors.

2025-07-10T18:49:42+00:00 ― 4 min read

Artificial Intelligence Evaluating Language Models in Scientific Coding

A new benchmark assesses language models on scientific coding challenges across multiple fields.

2025-07-10T17:22:48+00:00 ― 5 min read

Machine Learning New Attacks Expose Flaws in Text Watermarking

Research reveals vulnerabilities in watermarking methods for AI-generated text.

2025-07-10T05:08:06+00:00 ― 12 min read

Artificial Intelligence Challenges of Language Models in Abstract Reasoning

An examination of how LLMs perform on the Abstraction and Reasoning Corpus.

2025-07-09T23:52:06+00:00 ― 5 min read

Computation and Language Evaluating LLMs Through Grid Puzzles

An analysis of LLM performance on grid puzzles to assess reasoning abilities.

2025-07-09T18:51:54+00:00 ― 6 min read

Computation and Language Improving Text Generation with Multi-Prompt Decoding

This article examines multi-prompt decoding to enhance text generation quality.

2025-07-09T15:34:24+00:00 ― 6 min read

Computer Vision and Pattern Recognition Evaluating Multimodal Models with MIBench

MIBench tests multimodal models' performance on multiple images.

2025-07-09T14:23:18+00:00 ― 6 min read

Hardware Architecture Improving HDL Code Generation with Hierarchical Prompting

A new method enhances LLM efficiency in creating complex hardware designs.

2025-07-08T15:05:00+00:00 ― 5 min read

Computation and Language Comparing RAG and Long-Context Language Models

Analyzing the effectiveness of RAG and long-context LLMs in processing text.

2025-07-08T12:03:18+00:00 ― 6 min read

Computation and Language Analyzing Language Agents in Strategic Games

A study on language agents' behavior in a social deduction game.

2025-07-08T07:18:54+00:00 ― 4 min read

Computation and Language Improving Storytelling Accuracy with Fact Tracking

A new method to detect and fix factual errors in storytelling.

2025-07-08T03:45:36+00:00 ― 10 min read

Computation and Language Improving Math Reasoning in Smaller Language Models

A new method enhances math solving skills in smaller language models using DPO and self-training.

2025-07-07T04:11:30+00:00 ― 6 min read

Computation and Language The Need for Personalization in AI Models

New methods for personalizing AI language models are essential for user diversity.

2025-07-07T00:54:00+00:00 ― 6 min read

Machine Learning Challenges and Insights in Language Model Generalization

A look into how language models handle arithmetic tasks and their learning process.

2025-07-06T22:55:30+00:00 ― 6 min read

Computation and Language Evaluating Language Models: A New Toolkit

A toolkit designed for better evaluation of human-bot interactions.

2025-07-06T18:11:06+00:00 ― 5 min read