A look at uncertainty types and their importance in language models.
― 5 min read
Cutting edge science explained simply
A look at uncertainty types and their importance in language models.
― 5 min read
A look at models that operate without matrix multiplication for better efficiency.
― 6 min read
A new method improves translation quality through effective data augmentation.
― 6 min read
This article investigates how language models process verbal aspect in Russian.
― 9 min read
Discover how Extended Mind Transformers improve memory handling in language models.
― 6 min read
This study focuses on improving zero-shot learning through better entity and relation descriptions.
― 3 min read
A new method enhances event resolution by combining language models for better accuracy.
― 5 min read
Zyda, a dataset with 1.3 trillion tokens, enhances language model training.
― 5 min read
Improving methods for assessing meaning similarity between sentences in natural language.
― 6 min read
A new dataset evaluates Large Language Models' reasoning with complex queries.
― 8 min read
Assessing question difficulty enhances the effectiveness of information retrieval systems.
― 6 min read
A new method improves confidence scoring in language models using stable explanations.
― 9 min read
Introducing PlugIR for better image searches through interactive user dialogue.
― 7 min read
MIVPG improves how models interpret images and text together.
― 5 min read
A new framework improves pruning methods for large language models without retraining.
― 5 min read
A new method enhances image classification using detailed textual descriptions.
― 7 min read
Introducing a method to fine-tune LLMs on low-resource devices.
― 5 min read
A new dataset enhances research in linking events across documents with creative language.
― 6 min read
This study examines the use of AI in analyzing student answers in biology education.
― 6 min read
A new model replicates human-like understanding in AI systems.
― 7 min read
New methods like PromptFix help secure language models from hidden threats.
― 5 min read
Exploring multi-label classification to enhance discourse relation recognition.
― 8 min read
Evaluating methods for precise control of text features in LLM outputs.
― 13 min read
A new approach enhances language model alignment using limited human-annotated data.
― 4 min read
A new method enhances the alignment and safety of large language models.
― 6 min read
A new method sheds light on how language models remember training data.
― 8 min read
A new method enhances uncertainty estimation in language models, boosting user trust.
― 5 min read
Explore the learning abilities of language models and their applications.
― 7 min read
ABEX uses Abstract-and-Expand to enhance training data for natural language understanding tasks.
― 8 min read
This paper explores how MLLMs store and transfer information in answering visual questions.
― 6 min read
Learn how to train models for text embeddings wisely and effectively.
― 5 min read
New systems enhance the classification of moral values in texts.
― 6 min read
This study examines how LLMs handle changes in summarization tasks.
― 8 min read
A look at the importance of culture in Natural Language Processing advancements.
― 6 min read
This tool simplifies prompt creation and analysis for mixed content input.
― 7 min read
ETRASK improves relation extraction through innovative instance selection and pretrained models.
― 5 min read
New method improves large language models' performance in specialized fields.
― 7 min read
FastGAS improves efficiency in selecting examples for in-context learning using a graph-based approach.
― 7 min read
A method to forecast non-factual answers from language models before they generate responses.
― 6 min read
VTrans method significantly reduces transformer model sizes without sacrificing performance.
― 5 min read