HaloQuest addresses hallucination issues in vision-language models with a new dataset.
― 9 min read
Cutting edge science explained simply
HaloQuest addresses hallucination issues in vision-language models with a new dataset.
― 9 min read
Michelangelo evaluates language models on their ability to reason through long contexts.
― 4 min read