Understanding and mitigating hallucination in AI for reliable performance.
― 7 min read
Cutting edge science explained simply
Understanding and mitigating hallucination in AI for reliable performance.
― 7 min read
Advancements in AI models enhance accuracy in medical image interpretation.
― 7 min read
MASSIVE-AMR dataset enhances multilingual understanding in AI systems.
― 5 min read
A new dataset analyzes misleading information in LLM responses.
― 7 min read
Research explores ways to improve trust in LLM outputs through factuality and sourcing.
― 6 min read
This study assesses the reliability of AI tools in legal practice.
― 6 min read
A new method to edit language models effectively while maintaining performance.
― 5 min read
This article examines how negation affects large language models and their accuracy.
― 6 min read
A new approach enhances accuracy and creativity in language model outputs.
― 5 min read
This article explains the phenomenon of hallucinations in image generation models.
― 5 min read
New dataset helps assess AI text accuracy and reliability.
― 6 min read
New techniques aim to fix errors in language models without complete retraining.
― 5 min read
New benchmark assesses how video-language models handle inaccuracies effectively.
― 6 min read
A new framework aims to improve accuracy in semantic parsing models.
― 6 min read
A new method to detect and correct inaccuracies in language models.
― 5 min read
Enhancing pharmacovigilance through reliable language model outputs.
― 6 min read
This study assesses how medical LVLMs perform amidst hallucinations using a new dataset.
― 6 min read
A new framework aims to detect and fix errors in LVLM outputs.
― 7 min read
This study examines how LLMs evaluate their own knowledge and risk of errors.
― 8 min read
A tool to identify misleading answers from large language models.
― 6 min read
TongGu simplifies the understanding of Classical Chinese with specialized techniques.
― 5 min read
A new method generates synthetic data to improve detection of false outputs.
― 6 min read
This study evaluates how well large models handle multiple objects in images.
― 6 min read
Research focuses on improving accuracy and reliability of language models.
― 6 min read
GenSco enhances QA systems by improving multi-hop question answering accuracy and coherence.
― 5 min read
A new method to assess accuracy in language model outputs.
― 4 min read
An overview of NLG progress, challenges, and future research directions.
― 6 min read
This paper studies how training influences the predictions of large language models.
― 6 min read
A critique-based model improves accuracy in spotting inaccuracies in AI-generated text.
― 5 min read
Research highlights methods to detect false information in automotive AI.
― 8 min read
A new benchmark sheds light on hallucination in vision language models.
― 5 min read
Generative AI is improving how data professionals write SQL queries.
― 4 min read
A new dataset enhances the accuracy of event factuality detection in texts.
― 7 min read
Introducing DOPRA, a cost-effective way to improve MLLM accuracy.
― 5 min read
This article evaluates web agents' effectiveness in managing complex online tasks.
― 6 min read
HaloQuest addresses hallucination issues in vision-language models with a new dataset.
― 9 min read
pRAGe helps simplify medical terms for better patient understanding.
― 6 min read
This article discusses challenges in detecting hallucinations in machine translation across various languages.
― 5 min read
This article presents a method to improve context understanding in language models.
― 5 min read
A new benchmark evaluates LLMs for factual accuracy.
― 6 min read