New metrics improve large language models' effectiveness in education.
― 6 min read
Cutting edge science explained simply
New metrics improve large language models' effectiveness in education.
― 6 min read
This article examines how large language models recall information from training data.
― 6 min read
Adapting multilingual models can improve performance for less-used Uralic languages.
― 5 min read
Explore the role of ordinal classification and the impact of pretrained language models.
― 6 min read
Explore how DETAIL enhances understanding of in-context learning in language models.
― 6 min read
TPO offers a new method to align language models with human preferences efficiently.
― 6 min read
ThReaD improves LLMs' performance on complex tasks through dynamic thread management.
― 5 min read
This article examines the risks of fine-tuning language models for safety.
― 3 min read
A new approach enhances prompt diversity for safer language models.
― 7 min read
Research reveals the challenges of watermark detection in large language models.
― 7 min read
This study presents a system to enhance language model accuracy using adversarial challenges.
― 7 min read
Learn how adaptive teams improve task performance with language model agents.
― 6 min read
MAP-Neo aims for transparency and performance in AI language modeling.
― 5 min read
Examining the challenges and solutions in LLM watermarking to prevent misuse.
― 6 min read
New resources enhance assessment of Korean language models.
― 4 min read
Research shows diverse instructions improve language model performance in unseen tasks.
― 7 min read
Research introduces a method to improve decision-making in language model agents.
― 9 min read
This study examines how LLMs handle reasoning in abstract and contextual scenarios.
― 5 min read
The Block Transformer improves text processing speed and efficiency in language models.
― 6 min read
Recent tests reveal LLMs' weaknesses in simple reasoning despite high benchmark scores.
― 5 min read
A guide to transforming non-idiomatic Python code using modern techniques.
― 6 min read
This study examines how LLMs handle changes in summarization tasks.
― 8 min read
This study explores how to create sentences that maintain specific meanings using FrameNet.
― 9 min read
This study assesses GPT-4's ability to extract data from materials science literature.
― 6 min read
Jamming attacks can disrupt retrieval-augmented generation systems by blocking responses.
― 6 min read
This article evaluates the capability of language models to simulate game environments.
― 5 min read
A new approach to assess reasoning strategies with a focus on computational costs.
― 7 min read
MedExQA sets a new standard for evaluating medical language models with a focus on explanations.
― 6 min read
Study evaluates how well LLMs reason beyond immediate context.
― 5 min read
Exploring the limitations of Direct Preference Optimization in language model training.
― 6 min read
Evaluating how well language models perform research surveys across various academic fields.
― 6 min read
A new tool to assess language models' continuous improvement through feedback.
― 6 min read
A new framework assesses language models on emotional intelligence and creativity.
― 7 min read
New methods enhance language models' performance through better example selection.
― 7 min read
ReadCtrl allows language models to better match text complexity to reader abilities.
― 5 min read
GAMA improves audio processing by merging sound and language insights.
― 5 min read
SciEx reveals strengths and challenges of LLMs in scientific evaluation.
― 6 min read
This study shows how BERT learns COVID-19 facts through continuous training.
― 4 min read
A new benchmark tests LLMs' abilities with structured data formats.
― 6 min read
A new framework enhances how LLM agents learn through detailed process guidance.
― 7 min read