Latest Articles for Language Models

Computation and Language Enhancing LLM Performance in Educational Text Difficulty Assessment

New metrics improve large language models' effectiveness in education.

2025-08-10T18:28:54+00:00 ― 6 min read

Computation and Language Memorization in Large Language Models Explained

This article examines how large language models recall information from training data.

2025-08-10T00:02:54+00:00 ― 6 min read

Computation and Language Enhancing Language Models for Uralic Languages

Adapting multilingual models can improve performance for less-used Uralic languages.

2025-08-09T22:43:54+00:00 ― 5 min read

Computation and Language Advancements in Ordinal Classification for NLP

Explore the role of ordinal classification and the impact of pretrained language models.

2025-08-09T17:20:00+00:00 ― 6 min read

Computation and Language Understanding In-Context Learning with DETAIL Method

Explore how DETAIL enhances understanding of in-context learning in language models.

2025-08-09T06:40:06+00:00 ― 6 min read

Computation and Language Introducing Triple Preference Optimization for LLMs

TPO offers a new method to align language models with human preferences efficiently.

2025-08-06T22:11:00+00:00 ― 6 min read

Computation and Language Introducing ThReaD: A New Approach for Language Models

ThReaD improves LLMs' performance on complex tasks through dynamic thread management.

2025-08-06T10:20:00+00:00 ― 5 min read

Machine Learning Ensuring Safety in Fine-Tuning Language Models

This article examines the risks of fine-tuning language models for safety.

2025-08-06T09:40:30+00:00 ― 3 min read

Computation and Language Improving Safety in Large Language Models

A new approach enhances prompt diversity for safer language models.

2025-08-05T22:44:48+00:00 ― 7 min read

Cryptography and Security Detecting Watermarks in Language Models

Research reveals the challenges of watermark detection in large language models.

2025-08-05T11:25:24+00:00 ― 7 min read

Computation and Language Improving Language Models with Adversarial Tuning

This study presents a system to enhance language model accuracy using adversarial challenges.

2025-08-05T10:14:18+00:00 ― 7 min read

Computation and Language Dynamic Team Building for Language Models

Learn how adaptive teams improve task performance with language model agents.

2025-08-05T06:56:48+00:00 ― 6 min read

Computation and Language Introducing MAP-Neo: A New Open-Source Bilingual Model

MAP-Neo aims for transparency and performance in AI language modeling.

2025-08-04T21:04:18+00:00 ― 5 min read

Cryptography and Security Addressing LLM Watermarking Vulnerabilities

Examining the challenges and solutions in LLM watermarking to prevent misuse.

2025-08-04T12:07:06+00:00 ― 6 min read

Computation and Language Advancements in Korean Language Model Evaluation

New resources enhance assessment of Korean language models.

2025-08-04T10:48:06+00:00 ― 4 min read

Computation and Language The Impact of Instruction Diversity on Language Models

Research shows diverse instructions improve language model performance in unseen tasks.

2025-08-04T06:11:36+00:00 ― 7 min read

Computation and Language Better Decisions with Language Model Agents

Research introduces a method to improve decision-making in language model agents.

2025-08-04T04:21:00+00:00 ― 9 min read

Computation and Language Assessing Reasoning Abilities of Language Models

This study examines how LLMs handle reasoning in abstract and contextual scenarios.

2025-08-02T16:24:18+00:00 ― 5 min read

Computation and Language Introducing the Block Transformer: A New Approach to Language Models

The Block Transformer improves text processing speed and efficiency in language models.

2025-08-02T14:49:30+00:00 ― 6 min read

Machine Learning LLMs Struggle with Basic Reasoning Tasks

Recent tests reveal LLMs' weaknesses in simple reasoning despite high benchmark scores.

2025-08-02T09:01:54+00:00 ― 5 min read

Software Engineering Refactoring Pythonic Code: A New Approach

A guide to transforming non-idiomatic Python code using modern techniques.

2025-08-02T04:01:42+00:00 ― 6 min read

Computation and Language Analyzing Reliability in Language Model Summarization

This study examines how LLMs handle changes in summarization tasks.

2025-08-01T07:37:12+00:00 ― 8 min read

Computation and Language Generating Meaningful Sentences with FrameNet

This study explores how to create sentences that maintain specific meanings using FrameNet.

2025-07-31T20:09:54+00:00 ― 9 min read

Computation and Language Evaluating GPT-4 for Scientific Information Extraction

This study assesses GPT-4's ability to extract data from materials science literature.

2025-07-31T13:11:12+00:00 ― 6 min read

Cryptography and Security The Threat of Jamming Attacks on RAG Systems

Jamming attacks can disrupt retrieval-augmented generation systems by blocking responses.

2025-07-31T10:09:30+00:00 ― 6 min read

Computation and Language Assessing Language Models as World Simulators

This article evaluates the capability of language models to simulate game environments.

2025-07-31T02:15:30+00:00 ― 5 min read

Computation and Language Evaluating Reasoning Strategies in Large Language Models

A new approach to assess reasoning strategies with a focus on computational costs.

2025-07-31T01:43:54+00:00 ― 7 min read

Computation and Language MedExQA: Advancing Medical Question-Answering Systems

MedExQA sets a new standard for evaluating medical language models with a focus on explanations.

2025-07-30T23:13:48+00:00 ― 6 min read

Computation and Language Assessing Out-of-Context Knowledge Reasoning in LLMs

Study evaluates how well LLMs reason beyond immediate context.

2025-07-30T11:54:24+00:00 ― 5 min read

Artificial Intelligence Challenges in Direct Preference Optimization for LLMs

Exploring the limitations of Direct Preference Optimization in language model training.

2025-07-30T10:59:06+00:00 ― 6 min read

Artificial Intelligence Assessing Language Models in Research Activities

Evaluating how well language models perform research surveys across various academic fields.

2025-07-30T05:11:30+00:00 ― 6 min read

Computation and Language StreamBench: Evaluating Language Models in Real-Time

A new tool to assess language models' continuous improvement through feedback.

2025-07-30T03:52:30+00:00 ― 6 min read

Computation and Language Evaluating Language Models Through Collaboration

A new framework assesses language models on emotional intelligence and creativity.

2025-07-30T00:50:48+00:00 ― 7 min read

Machine Learning Improving Learning with Effective Example Selection

New methods enhance language models' performance through better example selection.

2025-07-29T15:22:00+00:00 ― 7 min read

Computation and Language Improving Text Readability with ReadCtrl

ReadCtrl allows language models to better match text complexity to reader abilities.

2025-07-29T08:07:30+00:00 ― 5 min read

Sound GAMA: A New Model for Sound Understanding

GAMA improves audio processing by merging sound and language insights.

2025-07-29T04:55:00+00:00 ― 5 min read

Computation and Language Evaluating LLMs with the New SciEx Benchmark

SciEx reveals strengths and challenges of LLMs in scientific evaluation.

2025-07-29T00:53:00+00:00 ― 6 min read

Computation and Language Improving BERT's Knowledge on COVID-19

This study shows how BERT learns COVID-19 facts through continuous training.

2025-07-28T18:57:30+00:00 ― 4 min read

Computation and Language Evaluating Large Language Models with Structured Text

A new benchmark tests LLMs' abilities with structured data formats.

2025-07-28T12:22:30+00:00 ― 6 min read

Computation and Language Improving LLM Agents with Step-Level Guidance

A new framework enhances how LLM agents learn through detailed process guidance.

2025-07-28T09:20:48+00:00 ― 7 min read