Learn how to reduce BERT's size while maintaining performance through knowledge distillation.
― 5 min read
Cutting edge science explained simply
Learn how to reduce BERT's size while maintaining performance through knowledge distillation.
― 5 min read
A new method enhances attribution and correctness in language models' outputs.
― 3 min read
A new approach to understanding dialogue agents through role-play and simulation.
― 18 min read
This article analyzes GPT-4's abilities on abstract reasoning tasks and the impact of object representation.
― 5 min read
A tool to assess large language models' multi-step reasoning capabilities.
― 5 min read
This research shows how entailment and self-training improve language models without needing human-labeled data.
― 6 min read
An in-depth analysis of ChatGPT's capabilities across various tasks and challenges.
― 6 min read
This article explores how language models enhance AI's strategic reasoning in games.
― 5 min read
Research examines syntax understanding in spoken language models using various methods.
― 6 min read
Introducing TopEx, a fresh approach to understand language model differences.
― 6 min read
Introducing a French model that outperforms leading benchmarks with less data.
― 5 min read
Exploring methods to ensure personal information safety in language models.
― 5 min read
A study on Auto-GPT performance in decision-making tasks.
― 6 min read
LexGPT aims to assist legal professionals with understanding and generating legal text.
― 5 min read
This paper explores how language models streamline project planning and execution.
― 6 min read
This study highlights the need for better recognition of non-binary pronouns in language models.
― 6 min read
A new method enhances reasoning accuracy in language models using structured prompts.
― 7 min read
WOGLI focuses on word order impacts in German language inference.
― 6 min read
PandaLM automates evaluation processes to improve large language models' instruction following.
― 5 min read
ToolAlpaca aims to help smaller models effectively learn to use real-world tools.
― 5 min read
Learn how RETA-LLM combines language models and retrieval systems for better answers.
― 6 min read
This article discusses using SVG to improve how language models interpret images.
― 5 min read
TrojLLM creates hidden prompts to manipulate large language model outputs.
― 4 min read
A new model designed to analyze Romanian tweets using advanced technology.
― 5 min read
Investigating prompt-based methods for improving language models in research data retrieval.
― 7 min read
Larger language models may perform poorly on certain tasks, raising critical questions in AI research.
― 5 min read
A new method enhances control over text generation in language models.
― 5 min read
Strategies to boost ChatGPT's efficiency across various language tasks.
― 5 min read
New dataset highlights AI performance in creative tasks with distractions.
― 5 min read
A fresh approach to assess the quality of generated text in large language models.
― 6 min read
Examining how AI handles human-like reasoning and its biases.
― 5 min read
A new method enhances speech recognition models using only text data for adaptation.
― 5 min read
A study on the effectiveness of language models for grammar correction in Brazilian Portuguese.
― 5 min read
This article evaluates how language models reflect diverse global opinions.
― 7 min read
A study on how well advanced models perform in Arabic language tasks.
― 7 min read
Assessing large language models' performance in answering biomedical questions through BioASQ.
― 7 min read
A study on assessing text generation quality from large language models.
― 6 min read
Study shows how well models handle paraphrasing in textual entailment tasks.
― 6 min read
A new benchmark aims to improve language models for social media communication.
― 7 min read
BLUEX offers a rich resource to evaluate language models in Portuguese using entrance exam questions.
― 6 min read