GUICourse aims to improve interaction with digital interfaces through targeted datasets for GUI agents.
― 4 min read
Cutting edge science explained simply
GUICourse aims to improve interaction with digital interfaces through targeted datasets for GUI agents.
― 4 min read
VideoVista offers a comprehensive evaluation for video question-answering models.
― 5 min read
This study reveals how language models change behavior during training.
― 6 min read
This study examines methods to enhance machine empathy through storytelling.
― 7 min read
A study on the decision-making processes of large language models.
― 4 min read
MMNeedle benchmark tests multimodal models on long context handling capabilities.
― 5 min read
This article examines the true meaning of democratization in AI.
― 6 min read
This study analyzes how language influences cultural values in large models.
― 8 min read
A method for identifying emotions and their causes in unlabeled data.
― 5 min read
L-ICV improves performance in visual question answering using fewer examples.
― 6 min read
This article examines how relational concepts shape knowledge retrieval in large language models.
― 6 min read
APPL streamlines development with large language models using an intuitive, Python-like syntax.
― 2 min read
Examining the roots and implications of bias in language technology.
― 6 min read
Long-context language models streamline complex tasks and improve interaction with AI.
― 7 min read
A new framework addresses challenges in knowledge distillation for long-tailed data.
― 7 min read
This article examines ways to improve planning abilities in large language models.
― 7 min read
A new dataset enhances story understanding across multiple languages.
― 6 min read
Exploring the safety challenges posed by adversarial attacks on multimodal agents.
― 6 min read
GLM-4 models show improved capabilities in language understanding and generation.
― 8 min read
This article examines how LLMs answer complex multi-hop questions.
― 7 min read
A new model combines LLMs and machine translation for better language processing.
― 6 min read
Examining the issues and potential improvements in academic peer review.
― 7 min read
Introducing a new scale for evaluating emotional depth in storytelling.
― 8 min read
A method to evaluate model knowledge through internal processing.
― 7 min read
Hierarchical Prompting Taxonomy improves evaluation methods for language models.
― 6 min read
DetectBench evaluates LLMs on their ability to detect hidden evidence in reasoning tasks.
― 5 min read
Introducing SeTAR, a training-free solution for detecting out-of-distribution data in neural networks.
― 7 min read
A study on using LLMs to judge other LLMs and its implications.
― 7 min read
Explore the impact of IA research on natural language processing.
― 6 min read
PromptDSI improves document retrieval by efficiently managing new and existing information.
― 6 min read
A new method improves machine translation for underrepresented languages.
― 5 min read
MultiSocial dataset aids in detecting machine-generated texts across 22 languages.
― 6 min read
P-Tailor customizes language models using the Big Five Personality Traits.
― 6 min read
This article discusses how deep neural networks learn language through next-token prediction.
― 7 min read
FuseGen combines multiple models for better quality synthetic data in machine learning.
― 7 min read
Synthetic data enhances the accuracy of stance detection in online discussions.
― 7 min read
A new method to improve model stability and performance in low-resource settings.
― 6 min read
IPEval assesses language models' understanding of intellectual property concepts.
― 5 min read
New methods are improving communication for the deaf community through enhanced sign language recognition.
― 6 min read
Snap helps large language models unlearn specific information while keeping their performance.
― 7 min read