Latest Articles for Language Models

Computation and Language Making BERT Smaller: Knowledge Distillation Methods

Learn how to reduce BERT's size while maintaining performance through knowledge distillation.

2025-11-10T18:24:00+00:00 ― 5 min read

Computation and Language Improving Language Model Outputs with Smaller Models

A new method enhances attribution and correctness in language models' outputs.

2025-11-10T13:55:24+00:00 ― 3 min read

Computation and Language Rethinking Dialogue Agents: Role-Play and Identity

A new approach to understanding dialogue agents through role-play and simulation.

2025-11-09T23:34:18+00:00 ― 18 min read

Computation and Language Evaluating GPT-4's Abstract Reasoning Skills

This article analyzes GPT-4's abilities on abstract reasoning tasks and the impact of object representation.

2025-11-09T13:57:36+00:00 ― 5 min read

Computation and Language Chain-of-Thought Hub: Evaluating Reasoning in Language Models

A tool to assess large language models' multi-step reasoning capabilities.

2025-11-09T08:41:36+00:00 ― 5 min read

Computation and Language Improving Language Models through Entailment and Self-Training

This research shows how entailment and self-training improve language models without needing human-labeled data.

2025-11-09T06:11:30+00:00 ― 6 min read

Computation and Language Evaluating ChatGPT: Performance Across Tasks

An in-depth analysis of ChatGPT's capabilities across various tasks and challenges.

2025-11-07T23:07:06+00:00 ― 6 min read

Artificial Intelligence Improving Strategic Reasoning in AI Using Language Models

This article explores how language models enhance AI's strategic reasoning in games.

2025-11-07T09:49:12+00:00 ― 5 min read

Computation and Language How Spoken Language Models Understand Syntax

Research examines syntax understanding in spoken language models using various methods.

2025-11-07T07:34:54+00:00 ― 6 min read

Computation and Language A New Method for Comparing Language Models

Introducing TopEx, a fresh approach to understand language model differences.

2025-11-06T05:38:36+00:00 ― 6 min read

Computation and Language New French Language Model Surpasses Competitors

Introducing a French model that outperforms leading benchmarks with less data.

2025-11-05T10:25:12+00:00 ― 5 min read

Computation and Language Protecting Privacy in Language Processing

Exploring methods to ensure personal information safety in language models.

2025-11-05T09:14:06+00:00 ― 5 min read

Artificial Intelligence Evaluating Auto-GPT Agents in Real-World Tasks

A study on Auto-GPT performance in decision-making tasks.

2025-11-05T04:37:36+00:00 ― 6 min read

Computation and Language Introducing LexGPT: A Legal Language Model

LexGPT aims to assist legal professionals with understanding and generating legal text.

2025-11-04T14:40:12+00:00 ― 5 min read

Artificial Intelligence Harnessing Language Models for Project Planning

This paper explores how language models streamline project planning and execution.

2025-11-04T11:30:36+00:00 ― 6 min read

Computation and Language Addressing Gaps in Language Models for Gender Pronouns

This study highlights the need for better recognition of non-binary pronouns in language models.

2025-11-03T21:09:30+00:00 ― 6 min read

Computation and Language Improving Logical Reasoning in Language Models

A new method enhances reasoning accuracy in language models using structured prompts.

2025-11-03T20:06:18+00:00 ― 7 min read

Computation and Language Introducing WOGLI: A New Dataset for German NLI

WOGLI focuses on word order impacts in German language inference.

2025-11-03T02:59:18+00:00 ― 6 min read

Computation and Language PandaLM: A New Tool for Language Model Tuning

PandaLM automates evaluation processes to improve large language models' instruction following.

2025-11-02T19:44:48+00:00 ― 5 min read

Computation and Language ToolAlpaca: Advancing Tool Use in Smaller AI Models

ToolAlpaca aims to help smaller models effectively learn to use real-world tools.

2025-11-02T12:46:06+00:00 ― 5 min read

Information Retrieval Enhancing LLMs with RETA-LLM Toolkit

Learn how RETA-LLM combines language models and retrieval systems for better answers.

2025-11-02T11:50:48+00:00 ― 6 min read

Computer Vision and Pattern Recognition Enhancing Language Models with SVG for Visual Understanding

This article discusses using SVG to improve how language models interpret images.

2025-11-01T21:21:48+00:00 ― 5 min read

Cryptography and Security TrojLLM: Exposing Vulnerabilities in Language Models

TrojLLM creates hidden prompts to manipulate large language model outputs.

2025-11-01T14:38:54+00:00 ― 4 min read

Computation and Language Introducing RoBERTweet: A Tool for Romanian Tweets

A new model designed to analyze Romanian tweets using advanced technology.

2025-11-01T11:21:24+00:00 ― 5 min read

Computation and Language Advancing Language Models for Scholarly Knowledge Extraction

Investigating prompt-based methods for improving language models in research data retrieval.

2025-11-01T02:56:14+00:00 ― 7 min read

Computation and Language The Challenge of Inverse Scaling in AI Models

Larger language models may perform poorly on certain tasks, raising critical questions in AI research.

2025-10-31T00:12:06+00:00 ― 5 min read

Artificial Intelligence Improving Control of Language Models with SMC Steering

A new method enhances control over text generation in language models.

2025-10-30T10:31:20+00:00 ― 5 min read

Computation and Language Improving ChatGPT's Language Task Performance

Strategies to boost ChatGPT's efficiency across various language tasks.

2025-10-29T22:07:54+00:00 ― 5 min read

Computation and Language Assessing AI's Creative Problem-Solving Skills

New dataset highlights AI performance in creative tasks with distractions.

2025-10-29T10:56:24+00:00 ― 5 min read

Computation and Language Evaluating Text Generation: New Methods for a Complex Task

A fresh approach to assess the quality of generated text in large language models.

2025-10-29T03:26:06+00:00 ― 6 min read

Computation and Language Assessing Logical Reasoning in AI Models

Examining how AI handles human-like reasoning and its biases.

2025-10-28T13:36:36+00:00 ― 5 min read

Computation and Language Efficient Speech Recognition Adaptation Using Text Data

A new method enhances speech recognition models using only text data for adaptation.

2025-10-27T06:52:30+00:00 ― 5 min read

Computation and Language Assessing Language Models for Grammar Correction in Brazilian Portuguese

A study on the effectiveness of language models for grammar correction in Brazilian Portuguese.

2025-10-26T16:42:42+00:00 ― 5 min read

Computation and Language Assessing Global Perspectives in Language Models

This article evaluates how language models reflect diverse global opinions.

2025-10-26T08:48:42+00:00 ― 7 min read

Computation and Language Evaluating Arabic Language Tasks with Large Models

A study on how well advanced models perform in Arabic language tasks.

2025-10-26T03:40:36+00:00 ― 7 min read

Computation and Language Evaluating GPT Models in Biomedical Research

Assessing large language models' performance in answering biomedical questions through BioASQ.

2025-10-26T02:13:42+00:00 ― 7 min read

Computation and Language Evaluating Quality in Large Language Models

A study on assessing text generation quality from large language models.

2025-10-25T14:38:30+00:00 ― 6 min read

Computation and Language Evaluating Language Models on Textual Entailment and Paraphrasing

Study shows how well models handle paraphrasing in textual entailment tasks.

2025-10-25T13:11:36+00:00 ― 6 min read

Computation and Language Assessing Social Media Language Understanding

A new benchmark aims to improve language models for social media communication.

2025-10-25T08:19:18+00:00 ― 7 min read

Computation and Language Introducing BLUEX: A Dataset for Portuguese Language Models

BLUEX offers a rich resource to evaluate language models in Portuguese using entrance exam questions.

2025-10-21T09:39:12+00:00 ― 6 min read