Introducing models designed to improve natural language processing in Portuguese.
― 6 min read
Cutting edge science explained simply
Introducing models designed to improve natural language processing in Portuguese.
― 6 min read
Learn how active prompt engineering improves tasks for language models.
― 5 min read
This article reviews how chunk size affects AI-generated answers.
― 6 min read
A fresh approach highlights surprising tokens to assess language model training data.
― 6 min read
This study examines methods to enhance Italian language models in specialized fields.
― 9 min read
A new method improves tamper resistance in open-weight language models.
― 7 min read
Enhancing smaller language models like MiniCPM through effective fine-tuning practices.
― 6 min read
Benchmark assesses large language models' ability to understand spatial relationships.
― 4 min read
A new method analyzes language models by examining their specific characteristics.
― 4 min read
This article examines how structured generation affects language model reasoning and comprehension.
― 5 min read
OpenFactCheck provides a framework for evaluating the accuracy of language model outputs.
― 5 min read
Innovative methods to enhance fairness in large language models.
― 7 min read
A new method enhances synthetic data quality for better language model alignment.
― 5 min read
A new system enhances speech recognition by using contextual keywords for better accuracy.
― 5 min read
SAGE-RT creates synthetic data to improve language model safety assessments.
― 5 min read
ArabLegalEval assesses LLMs' performance in handling Arabic legal information.
― 6 min read
A new method to assess language model outputs using multiple LLM judges.
― 7 min read
A new benchmark assesses language model agents for handling scientific data analysis.
― 7 min read
New methods enhance small models' accuracy in telecommunications question answering.
― 5 min read
ConflictBank offers insights into knowledge conflicts in language models.
― 5 min read
This article explores the role of memorization in improving ICL performance.
― 5 min read
Introducing a new model and benchmark for Russian text processing.
― 5 min read
Researchers examine the reliability of metrics for language model safety.
― 6 min read
A deep dive into how next-token prediction shapes language understanding in models.
― 6 min read
FPDT offers a solution for training long-context LLMs more efficiently.
― 5 min read
MemLong improves language models' ability to handle lengthy texts effectively.
― 6 min read
This article analyzes how language models create realistic social networks and their biases.
― 6 min read
This article discusses a new framework for enhancing reasoning in AI models.
― 5 min read
Introducing a framework for generating creativity test items using language models.
― 5 min read
A new method enhances long-text processing in language models for better answers.
― 5 min read
LongGenBench assesses large language models in generating high-quality long text.
― 5 min read
RAG remains vital in optimizing language model responses, especially with long texts.
― 5 min read
This article assesses the effectiveness of sparse autoencoders in knowledge representation about cities.
― 5 min read
A study on the impact of ICL and SFT on language model structure.
― 6 min read
Study shows fine-tuning LLMs with TMs enhances translation quality for organizations.
― 6 min read
This article discusses MLSAEs and their role in examining language model layers.
― 5 min read
ECHO combines diverse reasoning patterns for better problem-solving in language models.
― 6 min read
Study assesses language models on their ability to generate web application code.
― 6 min read
AdaPPA enhances jailbreak attacks on language models by combining safe and harmful responses.
― 5 min read
PF-PPO enhances language models by filtering out unreliable rewards for better code responses.
― 5 min read