Latest Articles for LLMs

Computation and Language New Benchmark for Mandarin Language Models

Evaluating LLM performance in Mandarin Chinese through a new benchmark called CMMLU.

2025-10-30T23:08:54+00:00 ― 5 min read

Computation and Language The Role of Human-Labeled Data in LLM Growth

Exploring the balance between human input and machine learning capabilities.

2025-10-29T16:04:30+00:00 ― 6 min read

Computation and Language Evaluating LLMs with External Tools

A dataset designed to assess LLMs' use of external tools for answering questions.

2025-10-27T17:35:48+00:00 ― 5 min read

Materials Science Harnessing Large Language Models in Science

LLMs showcase potential to advance chemistry and materials science through innovative projects.

2025-10-27T15:12:27+00:00 ― 8 min read

Computation and Language Evaluating Language Models in Programming Tasks

This article explores how well language models understand programming challenges.

2025-10-26T21:42:54+00:00 ― 6 min read

Computation and Language Evaluating Language Models for Medical Named Entity Recognition

This study assesses language models in medical named entity recognition accuracy.

2025-10-25T12:24:12+00:00 ― 4 min read

Computation and Language The Impact of Personality in Large Language Models

Examining how personality traits influence language models and their communication.

2025-10-25T09:22:30+00:00 ― 7 min read

Computer Vision and Pattern Recognition Advancements in Language and Visual Models

New model links language understanding with image processing efficiently.

2025-10-25T06:20:48+00:00 ― 5 min read

Image and Video Processing The Role of AI in Medical Imaging

AI chatbots are transforming medical imaging by enhancing efficiency and communication.

2025-10-24T14:05:50+00:00 ― 5 min read

Computation and Language Advancing Biomedical Concept Linking with Language Models

New method enhances biomedical concept linking using large language models.

2025-10-24T08:53:06+00:00 ― 7 min read

Computation and Language Assessing Large Language Models in Geometry Understanding

This study evaluates how well LLMs recognize and relate geometric shapes.

2025-10-24T02:41:48+00:00 ― 6 min read

Software Engineering Leveraging LLMs for Property-Based Testing

Exploring how LLMs can aid in property-based testing for software.

2025-10-21T16:22:06+00:00 ― 7 min read

Computation and Language LLM-Assisted Content Analysis: A New Approach

Integrating LLMs in deductive coding streamlines content analysis for researchers.

2025-10-21T09:34:00+00:00 ― 6 min read

Psychiatry and Clinical Psychology Evaluating AI's Role in Mental Health Care

A study assesses AI's effectiveness in recognizing mental health risks.

2025-10-21T01:36:00+00:00 ― 6 min read

Computation and Language Improving LLM Responses to Negated Questions

This article examines how LLMs handle negated questions and proposes improvements.

2025-10-20T07:42:54+00:00 ― 5 min read

Computation and Language Large Language Models in Healthcare: A Comprehensive Evaluation

Assessing the impact of LLMs on medical tasks and their potential applications.

2025-10-17T02:25:36+00:00 ― 5 min read

Computation and Language Evaluating the Trustworthiness of LLMs in Children's Story Creation

This study assesses LLMs' ability to create trusted children's stories.

2025-10-16T04:26:18+00:00 ― 4 min read

Computation and Language Evaluating Large Language Models in Radiology

A study assesses the effectiveness of LLMs in interpreting radiology reports.

2025-10-16T00:21:24+00:00 ― 6 min read

Materials Science Advancements in Crystal Structure Generation with AI

CrystaLLM leverages AI to speed up crystal structure creation using CIF data.

2025-10-14T20:32:03+00:00 ― 6 min read

Computation and Language Assessing Personalities of Large Language Models

Investigating if LLMs exhibit human-like personalities through MBTI analysis.

2025-10-14T06:37:06+00:00 ― 7 min read

Software Engineering Improving Automated Software Traceability with LLMs

This article discusses using prompts to enhance software traceability with large language models.

2025-10-14T02:08:30+00:00 ― 7 min read

Computation and Language The Growing Challenge of Detecting Machine-Generated Text

Researchers develop methods to identify text created by machines versus humans.

2025-10-13T05:36:06+00:00 ― 5 min read

Cryptography and Security Risks of Prompt-to-SQL Injection in Chatbots

A study on the vulnerabilities of LLM-integrated applications against SQL injection attacks.

2025-10-12T19:19:54+00:00 ― 7 min read

Computation and Language New Method for Evaluating Language Model Responses

A novel approach uses wider networks to improve evaluation quality of language models.

2025-10-12T17:13:30+00:00 ― 6 min read

Computation and Language Large Language Models Changing Medical Data Processing

Research highlights LLMs' role in improving medical data extraction and classification.

2025-10-12T15:15:00+00:00 ― 5 min read

Information Retrieval Assessing LLMs in GTFS Data Analysis

This research explores how LLMs can process and retrieve GTFS data.

2025-10-12T08:24:12+00:00 ― 5 min read

Software Engineering The Challenge of Non-Determinism in Code Generation

Examining the unpredictable nature of code generation with ChatGPT.

2025-10-11T23:50:42+00:00 ― 5 min read

Software Engineering Challenges and Solutions in Code Translation with LLMs

Examining limitations of LLMs in translating code and techniques for improvement.

2025-10-11T22:55:24+00:00 ― 5 min read

Computation and Language Integrating Language Models with Planning Systems

A new method combines language models and planners for complex tasks.

2025-10-09T19:50:12+00:00 ― 6 min read

Cryptography and Security Addressing Software Supply Chain Security Challenges

Exploring recent summit insights on software supply chain security.

2025-10-09T07:11:48+00:00 ― 5 min read

Cryptography and Security The Dark Side of AI: Malware Threats

Advanced AI tools can be misused for creating malware, raising cybersecurity concerns.

2025-10-07T15:33:54+00:00 ― 5 min read

Computation and Language Advancements in Visual Reasoning with LLMs

New methods combine fast and slow reasoning for improved visual problem-solving.

2025-10-07T05:33:30+00:00 ― 6 min read

Information Retrieval Improving News Recommendations with LKPNR

A new framework combines LLMs and KGs for better personalized news suggestions.

2025-10-05T04:34:42+00:00 ― 5 min read

Cryptography and Security Securing Sensitive Software Against Side-Channel Attacks

Learn how to protect software from side-channel attacks using automated tools.

2025-10-04T23:26:36+00:00 ― 5 min read

Computation and Language Assessing the Risks of Medical Chatbots

An analysis of the dangers in using language models for medical queries.

2025-10-03T13:20:30+00:00 ― 6 min read

Programming Languages AskIt: A New Tool for Software Development with LLMs

AskIt simplifies LLM integration in software projects, improving efficiency and reducing code length.

2025-10-03T04:31:12+00:00 ― 7 min read

General Economics The Impact of Large Language Models on China's Labor Market

Examining how LLMs are altering job dynamics in China.

2025-10-02T02:32:00+00:00 ― 4 min read

Robotics How Language Models Can Guide Robot Learning

Exploring the role of language models in teaching robots to learn through interaction.

2025-10-01T11:58:00+00:00 ― 6 min read

Robotics Challenges and Insights in Human-Robot Interaction

Exploring key issues in how humans and robots communicate.

2025-09-29T14:00:54+00:00 ― 5 min read

Computation and Language AI Participation in Communication Games: The Werewolf Experiment

This study examines how LLMs engage in communication games like Werewolf.

2025-09-29T06:30:36+00:00 ― 6 min read