Discover how vocabulary size influences the performance of large language models.
― 6 min read
Cutting edge science explained simply
Discover how vocabulary size influences the performance of large language models.
― 6 min read
This study compares methods for stance detection on key topics in Arabic texts.
― 6 min read
A study on how well LLMs function as reliable knowledge bases.
― 5 min read
A new approach to analyzing unstructured data using advanced querying techniques.
― 6 min read
A new dataset to assess question-answering in Indian languages.
― 5 min read
A new model effectively identifies authorship by analyzing writing styles.
― 5 min read
LIAR offers a new way to prune models without retraining, enhancing efficiency and performance.
― 6 min read
RAG combines data retrieval and text generation for better language model performance.
― 8 min read
A new method enhances prompt tuning effectiveness and interpretability.
― 8 min read
This study explores methods to create smaller language models effectively and affordably.
― 5 min read
Research reveals how friendly prompts can mislead AI systems.
― 5 min read
A new method that enhances LLM performance while reducing resource use.
― 6 min read
A study on the reliability of LLM self-explanations in natural language tasks.
― 6 min read
ChatQA 2 enhances performance in processing long texts and retrieval tasks.
― 6 min read
This study assesses the reasoning skills of audio-language models with a new task.
― 7 min read
A robust dataset for training advanced chat-based AI systems.
― 5 min read
A new approach to state-space models enhances efficiency and performance in language tasks.
― 6 min read
New model improves visual reasoning by utilizing 3D reconstruction methods.
― 6 min read
New method RoE enhances multi-modal large language models' efficiency with dynamic routing.
― 7 min read
Examining the impact of model size on data-to-text generation performance.
― 6 min read
Learn how early exiting improves efficiency in Natural Language Processing models.
― 5 min read
This study focuses on generating citations with appropriate length for better quality.
― 5 min read
E-LLaGNN improves GNNs by selectively using language models for better performance.
― 5 min read
A modular approach enhances sentence encoders across various languages.
― 6 min read
A new approach to improve text-to-image model prompts for enhanced results.
― 5 min read
Learn how Text Style Transfer changes text style while preserving meaning.
― 8 min read
Exploring how transformers analyze sentiments in text, such as movie reviews.
― 4 min read
This article examines how user assistance can improve large language models' performance in generating SQL queries.
― 5 min read
A method to improve vision-language models by reducing overfitting.
― 7 min read
A new dataset enhances the accuracy of event factuality detection in texts.
― 7 min read
This article examines multi-prompt decoding to enhance text generation quality.
― 6 min read
TAGCOS optimizes instruction tuning by selecting effective data subsets for language models.
― 6 min read
This study analyzes methods for improving language model alignment with human preferences.
― 6 min read
A new knowledge base for chemical patent searches aims to enhance reaction extraction.
― 7 min read
A new method combines language models and databases for better data access.
― 7 min read
A new model improves efficiency in task-oriented dialog systems without heavy manual work.
― 6 min read
DDK enhances knowledge distillation, making smaller language models more efficient.
― 5 min read
A new framework enhances knowledge graph completion efficiency and accuracy using large language models.
― 7 min read
Research enhances language models' ability to process time-related information in tables.
― 4 min read
A new method improves how vision-language models adapt during testing.
― 7 min read