A study on how context affects leaderboard generation in AI research.
― 5 min read
Cutting edge science explained simply
A study on how context affects leaderboard generation in AI research.
― 5 min read
Evaluating LLMs' strategic reasoning capabilities using diverse games.
― 7 min read
This study examines the use of AI in analyzing student answers in biology education.
― 6 min read
Examining how LLMs transform data accessibility and interaction.
― 5 min read
This article reviews how language models perform in spatial reasoning tasks.
― 7 min read
A new model replicates human-like understanding in AI systems.
― 7 min read
New methods like PromptFix help secure language models from hidden threats.
― 5 min read
Exploring multi-label classification to enhance discourse relation recognition.
― 8 min read
Evaluating methods for precise control of text features in LLM outputs.
― 13 min read
A new approach enhances language model alignment using limited human-annotated data.
― 4 min read
A new method enhances the alignment and safety of large language models.
― 6 min read
A look into how words relate within language systems over time.
― 5 min read
New method improves speech translation in noisy environments while preserving expressiveness.
― 4 min read
Circuit breakers provide a new method to prevent harmful AI outputs effectively.
― 3 min read
VISTA improves how we find information by integrating text and visuals.
― 7 min read
Explore the learning abilities of language models and their applications.
― 7 min read
ABEX uses Abstract-and-Expand to enhance training data for natural language understanding tasks.
― 8 min read
A new method to examine conversational tones in humans and AI.
― 6 min read
SPAC offers a new way to enhance language model responses.
― 6 min read
An analysis of Transformers' struggles with counting and copying tasks.
― 7 min read
MLVU benchmark aims to improve machine understanding of long videos.
― 5 min read
A new method to develop adaptable agents using diverse environments.
― 6 min read
A new method to assess commonsense reasoning in AI models through open-ended tasks.
― 8 min read
New systems enhance the classification of moral values in texts.
― 6 min read
PredEx offers predictions and explanations for legal judgments in India.
― 6 min read
Highlighting the need for fairness in mental health speech datasets.
― 6 min read
This study examines how LLMs handle changes in summarization tasks.
― 8 min read
UltraMedical collections improve medical language models and address data shortages.
― 6 min read
A look at the importance of culture in Natural Language Processing advancements.
― 6 min read
A dataset to identify propaganda in Arabic memes for better media literacy.
― 5 min read
A new approach improves activity recognition by combining various data types.
― 7 min read
This study evaluates LLMs' abilities in answering medical questions effectively.
― 6 min read
Research seeks to improve how LLMs handle misleading information.
― 6 min read
A new framework enhances self-training for large language models using guided reasoning.
― 8 min read
ETRASK improves relation extraction through innovative instance selection and pretrained models.
― 5 min read
New method improves large language models' performance in specialized fields.
― 7 min read
StreamSpeech improves real-time speech translation with efficiency and quality.
― 5 min read
A method to forecast non-factual answers from language models before they generate responses.
― 6 min read
This article examines how language models create and relate concepts to understanding.
― 6 min read
Methods to create accurate timelines from event annotations in texts.
― 6 min read