An overview of how machines process text, images, and audio.
― 6 min read
Cutting edge science explained simply
An overview of how machines process text, images, and audio.
― 6 min read
Examining the reliability of human feedback for assessing language model outputs.
― 6 min read
A new framework enhances detection of harmful online language through continuous learning.
― 7 min read
New methods improve machine translation for low-resource languages.
― 4 min read
Examining how biases affect the quality of language model evaluations.
― 5 min read
A new method combines language models with reinforcement learning for AI training.
― 5 min read
New methods improve speech processing and generation in language models.
― 5 min read
SelfExtend offers a new approach to enhance LLMs' long text processing.
― 5 min read
This paper examines techniques to reduce hallucination in language models for better accuracy.
― 6 min read
A new framework enhances visual reasoning using language models as controllers.
― 5 min read
Exploring how language models recover and adapt after changes.
― 7 min read
New method improves learning new classes with less data.
― 4 min read
A new method for accurate Arabic text diacritization is introduced.
― 7 min read
New method enhances retrieval efficiency across languages without heavy translation.
― 7 min read
A new dataset for improving e-commerce image and text recognition.
― 7 min read
Examining the nature and capabilities of language models in generating meaningful text.
― 7 min read
A dataset tests language models on self-referential language tasks.
― 6 min read
QE-fusion enhances translation quality by combining multiple candidate outputs.
― 5 min read
Evaluating language models on their ability to grasp context in communication.
― 6 min read
A new approach using multi-agent systems to enhance smaller language models.
― 6 min read
A study reveals small language models struggle with multiple choice questions.
― 6 min read
This study focuses on enhancing retrieval-augmented generation methods for Brazilian Portuguese.
― 6 min read
A new dataset enhances the connection between language and 3D environments.
― 7 min read
A new method to improve response times in language models by separating processing phases.
― 6 min read
This study assesses the performance of language models on modified math problems.
― 5 min read
A new method enhances how we identify synonyms and antonyms.
― 5 min read
Investigating the risks of jailbreak attacks on Large Language Models.
― 6 min read
Microsoft's MuLanTTS offers natural and expressive French text-to-speech capabilities.
― 5 min read
MuMo speeds up language model performance for non-Roman scripts.
― 7 min read
The study investigates universal neurons in GPT-2 models and their roles.
― 4 min read
A study on MLLMs and their performance in nonverbal reasoning tasks.
― 7 min read
This article explores using game theory to enhance communication through language models.
― 8 min read
The CLAP model bridges audio and text processing for various applications.
― 4 min read
This study examines how language structure boosts layout predictions in machines.
― 4 min read
A new framework speeds up information retrieval for language models.
― 6 min read
Exploring ways to improve sequence labeling in language models.
― 6 min read
This article examines how transformer models deal with multiword expressions and their associated challenges.
― 7 min read
Gradient-Based Red Teaming improves safety in language models.
― 5 min read
Exploring new ways to categorize inaccuracies in language models for better understanding.
― 10 min read
A new dataset enhances the extraction of key entities in various English texts.
― 5 min read