Examining changes in social biases in language models over time.
― 7 min read
Cutting edge science explained simply
Examining changes in social biases in language models over time.
― 7 min read
Discover how large language models are transforming simultaneous translation.
― 6 min read
Examining the impact of data contamination on language model performance and evaluation.
― 6 min read
A look at controlling language model behavior with the KL-then-steer technique.
― 5 min read
Examining how language models handle ambiguous Spanish words through a new dataset.
― 5 min read
This article discusses the security risks and defense strategies for large language models.
― 8 min read
This article discusses the adaptation of language models for improved support across various languages.
― 5 min read
A study on how language models can ignore instructions and their implications.
― 7 min read
This research improves language models' planning through cognitive maps.
― 5 min read
A study evaluates how machines create varied and creative poetry compared to humans.
― 6 min read
A study on how machines adapt to phonological changes in speech.
― 7 min read
A new method improves how we assess counter narratives to hate speech.
― 6 min read
Research shows untrained models connect with human brain responses in language processing.
― 8 min read
Research highlights in-context learning abilities in large language models.
― 6 min read
A new framework improves language models' representation of diverse human values.
― 7 min read
Research assesses language models' claim verification abilities using a new dataset.
― 5 min read
This article examines how certain neurons affect uncertainty in language model predictions.
― 6 min read
This study presents a dataset and method to enhance Chinese ASR accuracy using Pinyin.
― 7 min read
New methods refine reasoning skills in language models for better task performance.
― 7 min read
A new method enhances how language models align with human values.
― 6 min read
An analysis of language models and their role in healthcare.
― 6 min read
A new model merges Mamba and Transformer for improved language processing.
― 5 min read
A novel method combines vision and language for unseen object pose estimation.
― 5 min read
Exploring methods to enhance LLMs for practical applications.
― 9 min read
Study assesses how well MLLMs interpret visual data and their performance compared to humans.
― 5 min read
Evaluating how LLMs create persuasive text across various topics.
― 6 min read
A fresh method addresses data contamination in testing language models.
― 5 min read
FineWeb offers 15 trillion tokens to improve language model training.
― 7 min read
This study benchmarks Language Models' performance using Italian INVALSI tests.
― 7 min read
A study on translating Nigerian English for better accessibility in Nollywood films.
― 6 min read
Can self-play enhance language models' performance in cooperative settings?
― 6 min read
Assessing strategies to manage copyright issues in language models.
― 6 min read
NeBuLa improves action prediction from conversations in collaborative gaming.
― 6 min read
This article examines if large language models possess beliefs and intentions.
― 5 min read
An overview of automata, their types, and practical uses in computer science.
― 6 min read
New method enhances spiking neural networks' performance in language tasks.
― 6 min read
New model improves speech-to-text translation using large language models.
― 6 min read
A new approach to enhance accuracy in verifying information generated by language models.
― 5 min read
This article examines how wording affects language model performance.
― 6 min read
A new method measures how language models adapt their beliefs with new evidence.
― 9 min read