Latest Articles for Language Models

Computation and Language Improving Example Selection for Language Models

New method enhances performance of language models through better example selection.

2025-09-05T08:26:36+00:00 ― 6 min read

Computation and Language Improving LLM Reasoning with the Mirror Approach

A new method enhances language models' reasoning capabilities through structured feedback.

2025-09-04T23:53:06+00:00 ― 5 min read

Machine Learning Enhancing Language Model Safety with Self-Refine Method

A novel approach to improve the safety of language models without extensive retraining.

2025-09-04T16:46:30+00:00 ― 4 min read

Computation and Language MATHWELL: A New Tool for Generating Math Problems

MATHWELL helps teachers create engaging math problems for K-8 students quickly.

2025-09-04T13:21:06+00:00 ― 6 min read

Computation and Language Evaluating Large Language Models in Multi-Agent Environments

New benchmark assesses LLMs' skills in interacting with multiple agents.

2025-09-04T00:58:30+00:00 ― 12 min read

Computation and Language Reevaluating Language Model Assessments

Research challenges traditional methods of evaluating language model values and opinions.

2025-09-03T21:41:00+00:00 ― 6 min read

Computation and Language Improving Sentence Complexity Predictions with Linguistic Knowledge

Researchers enhance Encoder-Decoder models to better predict sentence complexity using linguistic features.

2025-09-03T16:48:42+00:00 ― 6 min read

Computation and Language Analyzing Instruction Fine-Tuning in Language Models

A study on the effects of instruction fine-tuning on model performance.

2025-09-03T05:29:18+00:00 ― 4 min read

Computation and Language Comparing Learning Methods for Multilingual NLP

This study evaluates the effectiveness of different learning approaches in multilingual natural language processing.

2025-09-01T15:18:18+00:00 ― 4 min read

Computation and Language How Language Models Reflect Gendered Emotions

This article examines gender biases in language models and their implications.

2025-09-01T07:55:54+00:00 ― 7 min read

Computation and Language Improving Fact Verification in RAG Systems

A new method enhances fact checking in retrieval augmented generation systems.

2025-08-31T22:19:12+00:00 ― 7 min read

Computation and Language Evaluating LLMs with the PPTC-R Benchmark

A new benchmark assesses LLM performance on complex PowerPoint tasks.

2025-08-31T21:08:06+00:00 ― 5 min read

Human-Computer Interaction iScore: A Tool for Evaluating Language Models in Education

iScore helps educators evaluate how well language models score written summaries.

2025-08-31T14:33:06+00:00 ― 7 min read

Computer Vision and Pattern Recognition Mipha: A New Efficient Multimodal Assistant

Mipha combines visual and text understanding with smaller models for greater efficiency.

2025-08-30T20:15:00+00:00 ― 6 min read

Human-Computer Interaction Introducing HILL: A Tool for Detecting LLM Errors

HILL helps users spot inaccuracies in language model responses.

2025-08-30T13:00:30+00:00 ― 5 min read

Computation and Language Introducing TaxoLLaMA: A New Approach to Language Tasks

TaxoLLaMA enhances understanding of word meanings and relationships for better language processing.

2025-08-29T06:27:42+00:00 ― 6 min read

Computation and Language Enhancing AI Collaboration Through Semantic Decoding

This paper explores how semantic decoding improves AI teamwork and outputs.

2025-08-27T04:57:18+00:00 ― 6 min read

Computation and Language Adaptive-RAG: A New Approach to Question Complexity

Adaptive-RAG improves answer accuracy by addressing question complexity.

2025-08-27T03:06:42+00:00 ― 6 min read

Computation and Language Evaluating Argument Quality with Language Models

Discover how language models can enhance our understanding of argument quality.

2025-08-26T06:02:42+00:00 ― 8 min read

Cryptography and Security Examining Jailbreak Prompts in AI Language Models

A study of techniques used to bypass safety measures in AI language models.

2025-08-26T04:35:48+00:00 ― 8 min read

Computation and Language Evaluating LLM Performance: MCQs vs. Long-Form Questions

This article discusses the effectiveness of MCQs in testing LLMs compared to long-form questions.

2025-08-25T12:16:12+00:00 ― 5 min read

Computation and Language Evaluating Language Models in Maze Navigation

MANGO benchmark tests language models for navigation and mapping in maze contexts.

2025-08-24T20:28:12+00:00 ― 6 min read

Computation and Language Assessing Multilingual Models in Low-Resource Languages

This study evaluates cross-lingual performance of multilingual models in named entity recognition.

2025-08-24T05:11:48+00:00 ― 6 min read

Computation and Language Assessing Collaboration Between Language Models and Humans

Study explores how language models work with humans and each other in task completion.

2025-08-23T23:55:48+00:00 ― 6 min read

Computation and Language Challenges of Pronoun Use in Language Models

This article examines how language models handle pronouns and the implications for identity.

2025-08-23T01:24:54+00:00 ― 4 min read

Computation and Language Mamba: A Fresh Take on Language Models

Exploring how Mamba recalls and edits facts differently than traditional models.

2025-08-22T14:29:12+00:00 ― 5 min read

Cryptography and Security Vulnerabilities in Language Models: The Sandwich Attack

Examining a new method to exploit language models' weaknesses using low-resource languages.

2025-08-21T04:38:54+00:00 ― 5 min read

Computation and Language Dynamic Personality Generation in Language Models

A new method to shape LLM personalities using the Big Five traits.

2025-08-20T18:06:54+00:00 ― 5 min read

Computation and Language The Impact of Language Imbalance on Multilingual Model Training

Discover how language imbalance can improve multilingual model performance.

2025-08-20T12:27:12+00:00 ― 5 min read

Computer Science and Game Theory Large Language Models and Human-Like Decision Making

This study examines if language models make decisions like humans in strategic scenarios.

2025-08-20T08:06:30+00:00 ― 9 min read

Computer Vision and Pattern Recognition Evaluating Visual Perception in Language Models

A new benchmark reveals gaps in visual understanding of large language models.

2025-08-18T12:23:42+00:00 ― 7 min read

Computation and Language Advancements in Unsupervised Constituency Parsing

A look into the span-overlap method for enhanced sentence parsing.

2025-08-18T08:18:48+00:00 ― 6 min read

Computers and Society The Impact of Names on Bias in Language Models

Examining how names influence biases in language models.

2025-08-17T17:02:24+00:00 ― 8 min read

Computation and Language Building a High-Quality Japanese Web Corpus

A robust Japanese corpus created from Common Crawl data improves LLM performance.

2025-08-16T05:53:06+00:00 ― 7 min read

Computation and Language Tracking Changes in Word Meanings Through Model Testing

This study investigates how language models respond to changing word meanings.

2025-08-15T10:39:42+00:00 ― 7 min read

Computation and Language Temperature's Role in Creative Storytelling with LLMs

Examining how temperature influences creativity in language model-generated narratives.

2025-08-14T19:54:54+00:00 ― 7 min read

Computation and Language Addressing Anchored Bias in GPT-2 Models

Investigating positional bias in language models and ways to reduce it.

2025-08-13T06:31:18+00:00 ― 5 min read

Machine Learning Examining GPT-2's Approach to Acronym Prediction

This study dissects how GPT-2 predicts three-letter acronyms.

2025-08-12T22:45:12+00:00 ― 7 min read

Computation and Language Balancing Safety and Utility in Language Models

Strategies to reduce overly cautious behavior in language models.

2025-08-12T18:40:18+00:00 ― 7 min read

Computation and Language A New Approach to Text Evaluation with LLMs

This framework improves text evaluation efficiency and accuracy using Large Language Models.

2025-08-12T05:06:36+00:00 ― 7 min read