Latest Articles for Evaluation

Computation and Language LLMs Outperform Traditional Systems in Translation

Study shows LLMs provide more natural translations, especially for idiomatic phrases.

2025-11-08T23:12:48+00:00 ― 5 min read

Human-Computer Interaction The AMS Algorithm: A Tool for Job Placement

Examining the AMS algorithm's impact on job-seeker evaluations and public opinion.

2025-11-08T21:45:54+00:00 ― 6 min read

Computation and Language Analyzing Sentiment in Russian News Articles

A study on targeted sentiment analysis in Russian news reports.

2025-11-08T07:01:06+00:00 ― 4 min read

Computation and Language Advancements in Evaluating NLP Model Robustness

A new framework enhances evaluation of NLP models against adversarial attacks.

2025-11-07T23:30:48+00:00 ― 6 min read

Computation and Language Evaluating ChatGPT: Performance Across Tasks

An in-depth analysis of ChatGPT's capabilities across various tasks and challenges.

2025-11-07T23:07:06+00:00 ― 6 min read

Artificial Intelligence New Method for Evaluating Soccer Players

A fresh approach to assess both on-ball and off-ball player actions.

2025-11-07T16:32:06+00:00 ― 4 min read

Computation and Language Improving Dialogue Models with Expert Assistance

A new method enhances dialogue models for better interaction in mental health support.

2025-11-07T16:16:18+00:00 ― 5 min read

Human-Computer Interaction The Rise of Serious Games in Learning

Discover the impact of serious games on education and training.

2025-11-07T01:23:36+00:00 ― 4 min read

Computation and Language Improving Summarization Accuracy with Reinforcement Learning

A new method enhances summary accuracy while maintaining informative content.

2025-11-06T22:45:36+00:00 ― 8 min read

Computer Vision and Pattern Recognition Advances in Structured Text Extraction Techniques

A look at recent developments in extracting text from complex documents.

2025-11-04T13:13:18+00:00 ― 5 min read

Computation and Language Measuring Imageability with Text-to-Image Models

Research explores how words create mental images using advanced technology.

2025-11-04T10:43:12+00:00 ― 5 min read

Machine Learning Mild Constraints Enhance Offline Reinforcement Learning

New policy approach improves evaluation performance in offline RL applications.

2025-11-04T00:11:12+00:00 ― 6 min read

Computation and Language The Role of Instruction Tuning in Language Models

Explore how instruction tuning enhances language model performance across various tasks.

2025-11-03T06:32:36+00:00 ― 6 min read

Computation and Language PandaLM: A New Tool for Language Model Tuning

PandaLM automates evaluation processes to improve large language models' instruction following.

2025-11-02T19:44:48+00:00 ― 5 min read

Computer Vision and Pattern Recognition Video-ChatGPT: The Future of Video Understanding

A new model enables detailed conversations about video content.

2025-11-02T19:21:06+00:00 ― 5 min read

Computation and Language Evaluating Language Models with Xiezhi Benchmark

Xiezhi offers a new way to assess language models across diverse subjects.

2025-11-02T05:39:30+00:00 ― 5 min read

Computation and Language Detecting ChatGPT-Generated Text in French

Researchers develop methods to detect text generated by ChatGPT in French.

2025-11-01T23:20:18+00:00 ― 5 min read

Machine Learning Advancements in NetHack AI Research

New library enhances AI training and evaluation in NetHack.

2025-10-31T09:01:24+00:00 ― 8 min read

Software Engineering Evaluating Software Impact in Science

A look at the importance and challenges of software evaluation in the scientific community.

2025-10-31T05:46:33+00:00 ― 7 min read

Computation and Language Definition Modeling in Natural Language Processing

A look at how definition modeling generates word meanings in NLP.

2025-10-31T04:56:30+00:00 ― 6 min read

Applications Improving Fire Spread Predictions with Advanced Modeling

New modeling techniques enhance wildfire predictions and management efforts.

2025-10-30T21:22:12+00:00 ― 7 min read

Artificial Intelligence Advancing Behavioral Cloning with Search-Based Methods

New approach improves agent adaptability in complex environments.

2025-10-30T11:25:48+00:00 ― 7 min read

Biomolecules New Methods Transform Antibody Protein Design

Researchers innovate protein design with advanced sampling techniques for better antibodies.

2025-10-30T00:56:03+00:00 ― 5 min read

Computation and Language Building Effective Summarization Systems

A guide to selecting models and training data for summarization.

2025-10-29T15:01:18+00:00 ― 5 min read

Computer Vision and Pattern Recognition Improving HD Maps for Self-Driving Cars

A new method enhances map accuracy for safer autonomous driving.

2025-10-29T14:29:42+00:00 ― 7 min read

Machine Learning Evaluating Structural Embeddings in Network Science

A new framework to assess structural embeddings for enhanced data analysis.

2025-10-29T06:19:54+00:00 ― 6 min read

Computation and Language Evaluating Text Generation: New Methods for a Complex Task

A fresh approach to assess the quality of generated text in large language models.

2025-10-29T03:26:06+00:00 ― 6 min read

Machine Learning A New Tool for GPTs in Healthcare

This library simplifies healthcare data processing for predictive modeling using GPTs.

2025-10-28T21:22:42+00:00 ― 5 min read

Information Retrieval Improving POI Recommendation Systems with Context

A new framework enhances local recommendations by using contextual data.

2025-10-28T19:16:18+00:00 ― 6 min read

Computation and Language Assessing Document-Level Relation Extraction Models

A study on the reasoning behind document-level relation extraction model predictions.

2025-10-28T18:52:36+00:00 ― 5 min read

Computation and Language Evaluating Social Reasoning in Language Models

New benchmark assesses language models' understanding of human thoughts and feelings.

2025-10-28T15:19:18+00:00 ― 6 min read

Audio and Speech Processing Improving Speech Technology Evaluations through Detailed Reporting

Examining the impact of detailed evaluations on speech synthesis systems.

2025-10-28T07:58:35+00:00 ― 5 min read

Computation and Language The Need for Explainable Machine Translation Metrics

This paper highlights the importance of explainable evaluation metrics in machine translation.

2025-10-28T04:15:42+00:00 ― 7 min read

Computer Vision and Pattern Recognition Automating Labeling Instructions for Datasets

A new method for generating clear labeling instructions for image datasets.

2025-10-27T15:37:18+00:00 ― 7 min read

Machine Learning LM4HPC: Bridging Language Models and High-Performance Computing

Introducing LM4HPC, a framework to enhance language model application in HPC tasks.

2025-10-27T03:06:48+00:00 ― 6 min read

Digital Libraries Reproducibility Challenges at Interspeech Conferences

A look at reproducibility issues in speech processing research.

2025-10-26T16:18:00+00:00 ― 7 min read

Computation and Language Improving Chatbot Evaluation with C-PMI

A new method enhances chatbot interaction assessment at each dialogue turn.

2025-10-26T09:51:54+00:00 ― 6 min read

Computation and Language Enhancing User Experience in NLP Applications

A new method prioritizes user needs in developing NLP tools for industry.

2025-10-26T08:25:00+00:00 ― 7 min read

Computation and Language Evaluating Language Models with Elementary Math Problems

A study assessing AI language models in solving elementary math challenges.

2025-10-26T06:50:12+00:00 ― 6 min read

Genetics Advances in Plant Breeding: A Look Ahead

Discover how plant breeding is evolving with genomic data and advanced selection methods.

2025-10-25T21:54:39+00:00 ― 6 min read