A study on LLM performance using instruction tuning and in-context learning.
Taihang Wang, Xiaoman Xu, Yimin Wang
― 5 min read
Cutting edge science explained simply
A study on LLM performance using instruction tuning and in-context learning.
Taihang Wang, Xiaoman Xu, Yimin Wang
― 5 min read
A new model enhances efficiency in collecting language data during fieldwork.
Aso Mahmudi, Borja Herce, Demian Inostroza Amestica
― 6 min read
This study examines how language models create effective research paper titles from abstracts.
Tohida Rehman, Debarshi Kumar Sanyal, Samiran Chattopadhyay
― 5 min read
This study examines the effectiveness of Sparse Autoencoders in understanding language model features.
David Chanin, James Wilken-Smith, Tomáš Dulka
― 6 min read
PODA enhances AI's ability to understand texts and reason logically.
Chenxu Wang, Ping Jian, Zhen Yang
― 6 min read
A new framework streamlines microstructure design using natural language commands.
Nikita Kartashov, Nikolaos N. Vlassis
― 7 min read
This research explores LLM effectiveness in various languages beyond English.
Daoyang Li, Mingyu Jin, Qingcheng Zeng
― 6 min read
Research shows AI can predict user stances from indirect social media posts.
Siyuan Brandon Loh, Liang Ze Wong, Prasanta Bhattacharya
― 6 min read
This article examines how different layers affect LLM performance.
Yang Zhang, Yanfei Dong, Kenji Kawaguchi
― 5 min read
A study on how AI agents follow user-defined rules using the ACS dataset.
Lior Madmoni, Amir Zait, Ilia Labzovsky
― 9 min read
CADA-GAN enhances ASR systems' performance across various recording environments.
Chien-Chun Wang, Li-Wei Chen, Cheng-Kang Chou
― 6 min read
New methods boost LLM performance by compressing token input.
Runsong Zhao, Pengcheng Huang, Xinyu Liu
― 5 min read
MQM-APE enhances the quality of machine translation evaluations through advanced error analysis.
Qingyu Lu, Liang Ding, Kanjian Zhang
― 7 min read
This study evaluates how well LLMs understand narrative tropes in movie summaries.
Hung-Ting Su, Ya-Ching Hsu, Xudong Lin
― 4 min read
FLEX method offers a new approach for evaluating text-to-SQL systems accurately.
Heegyu Kim, Taeyang Jeon, Seunghwan Choi
― 6 min read
New features enhance user experience in screen understanding and multilingual interactions.
Naman Goyal
― 6 min read
Using technology to gather plant trait information efficiently from the web.
Diego Marcos, Robert van de Vlasakker, Ioannis N. Athanasiadis
― 4 min read
EVA combines audio and visual signals for better speech recognition accuracy.
Yihan Wu, Yifan Peng, Yichen Lu
― 4 min read
A novel model enhances text embeddings through in-context learning strategies.
Chaofan Li, MingHao Qin, Shitao Xiao
― 5 min read
A new method aims to reduce semantic leakage in cross-lingual sentence embeddings.
Dayeon Ki, Cheonbok Park, Hyunjoong Kim
― 5 min read
New models aim to combat harmful language online through advanced detection techniques.
Tonmoy Roy, Md Robiul Islam, Asif Ahammad Miazee
― 6 min read
QualIT enhances text analysis by combining language models and clustering techniques.
Satya Kapoor, Alex Gil, Sreyoshi Bhaduri
― 5 min read
This study investigates AI's role in salary negotiation advice and potential biases.
R. Stuart Geiger, Flynn O'Sullivan, Elsie Wang
― 4 min read
A new framework improves dialogue quality in educational chatbots for effective learning.
Haoyu Huang, Tong Niu, Rui Yang
― 6 min read
This research investigates LLMs' performance in cognitive tasks similar to infant behavior.
Pengrui Han, Peiyang Song, Haofei Yu
― 6 min read
A new tool evaluates large language models' performance across multiple data types.
Yizhi Li, Ge Zhang, Yinghao Ma
― 5 min read
This article presents a new framework to enhance inference-time techniques for language models.
Jon Saad-Falcon, Adrian Gamarra Lafuente, Shlok Natarajan
― 5 min read
A new method enhances aspect-sentiment triplet extraction accuracy.
Iwo Naglik, Mateusz Lango
― 6 min read
A new framework improves prompt creation for large language models.
Mingqi Li, Karan Aggarwal, Yong Xie
― 6 min read
This study assesses various models for retrieving clinical information effectively.
Skatje Myers, Timothy A. Miller, Yanjun Gao
― 7 min read
A new method enhances Flash Attention performance for sparse attention masks.
Agniv Sharma, Jonas Geiping
― 5 min read
A new metric enhancing the assessment of factual consistency in automatic summaries.
Yuxuan Ye, Edwin Simpson, Raul Santos Rodriguez
― 5 min read
Assessing the effectiveness of LLMs for threat analysis.
Sanchana Srikanth, Mohammad Hasanuzzaman, Farah Tasnur Meem
― 10 min read
Examining the advantages of decoder-only models for machine translation tasks.
Gaëtan Caillaut, Raheel Qader, Mariam Nakhlé
― 6 min read
A new AI tool helps assess COVID-19 risk through patient conversations.
Mohammad Amin Roshani, Xiangyu Zhou, Yao Qiang
― 4 min read
This study enhances key information extraction using a new model for unstructured documents.
Furkan Pala, Mehmet Yasin Akpınar, Onur Deniz
― 9 min read
This study highlights methods to enhance large language models in medical settings.
Clément Christophe, Tathagata Raha, Svetlana Maslenkova
― 6 min read
Examining how AI can identify and measure uncertainty in human beliefs.
Anthony Sicilia, Malihe Alikhani
― 7 min read
This approach simplifies choosing effective pretraining datasets for language models.
Tristan Thrush, Christopher Potts, Tatsunori Hashimoto
― 8 min read
A new approach enhances mental health session summaries through a planning engine.
Aseem Srivastava, Smriti Joshi, Tanmoy Chakraborty
― 7 min read