Combining SAM and MLLMs for better object localization in images.
Yi-Chia Chen, Wei-Hua Li, Cheng Sun
― 8 min read
Cutting edge science explained simply
Combining SAM and MLLMs for better object localization in images.
Yi-Chia Chen, Wei-Hua Li, Cheng Sun
― 8 min read
A new model improves depression detection in social media posts with clear explanations.
Sumit Dalal, Sarika Jain, Mayank Dave
― 5 min read
This article presents a new method for realistic dialogue systems using user-specific traits.
Atsushi Otsuka, Kazuya Matsuo, Ryo Ishii
― 4 min read
A new method improves the reliability of large language models' answers.
Derian Boer, Fabian Koch, Stefan Kramer
― 5 min read
Examining LLMs for generating audio programming code using visual languages.
William Zhang, Maria Leon, Ryan Xu
― 5 min read
This article compares discrete and continuous speech representations for effective speech recognition.
Yaoxun Xu, Shi-Xiong Zhang, Jianwei Yu
― 5 min read
Introducing HTLA, a model improving text classification accuracy through better label alignment.
Ashish Kumar, Durga Toshniwal
― 7 min read
This article discusses a new rating system for evaluating language models more fairly.
Jasper Dekoninck, Maximilian Baader, Martin Vechev
― 5 min read
Updates enhance the FLORES dataset for Hausa, Northern Sotho, Xitsonga, and isiZulu.
Idris Abdulmumin, Sthembiso Mkhwanazi, Mahlatse S. Mbooi
― 5 min read
A model improving sentence parsing by focusing on entity structure.
Xinyi Bai
― 5 min read
Streamlined systems for improved interactions between humans and machines.
Thierry Petit, Arnault Pachot, Claire Conan-Vrinat
― 5 min read
This study evaluates how well VLMs can understand visual perspectives.
Gracjan Góral, Alicja Ziarko, Michal Nauman
― 5 min read
New methods enhance how tools are selected for language models.
Suhong Moon, Siddharth Jha, Lutfi Eren Erdogan
― 8 min read
Examining LVLMs' effectiveness in generating multilingual art explanations.
Shintaro Ozaki, Kazuki Hayashi, Yusuke Sakai
― 7 min read
A hybrid agent for the Werewolf game enhances interaction and gameplay.
Takehiro Sato, Shintaro Ozaki, Daisaku Yokoyama
― 6 min read
The study evaluates how well language models handle Hakka cultural understanding.
Chen-Chi Chang, Ching-Yuan Chen, Hung-Shin Lee
― 5 min read
Addressing biases in chatbots through regular audits to align with societal values.
Yanchen Wang, Lisa Singh
― 4 min read
A new method enables language models to correct their own mistakes in math.
Yuchen Yan, Jin Jiang, Yang Liu
― 5 min read
A new benchmark evaluates biases in language models used for medical diagnoses.
Rajat Rawat, Hudson McBride, Dhiyaan Nirmal
― 5 min read
A new design to improve long-term memory in language models.
Yuan Yang, Siheng Xiong, Ehsan Shareghi
― 5 min read
ERFSL streamlines reward function creation using large language models.
Guanwen Xie, Jingzehua Xu, Yiyuan Yang
― 5 min read
Examining the efficiency and latency challenges of SMoE models in language processing.
Soumajyoti Sarkar, Leonard Lausen, Volkan Cevher
― 6 min read
A new model improves language processing by focusing on input representation.
Benjamin L. Badger
― 6 min read
CHESS improves efficiency of language models while maintaining performance on resource-limited devices.
Junhui He, Shangyu Wu, Weidong Wen
― 6 min read
A new framework combines humor theories with machine learning for effective humor detection.
Victor De Marez, Thomas Winters, Ayla Rigouts Terryn
― 8 min read
New method enhances efficiency of large language models by focusing on relevant information.
Barys Liskavets, Maxim Ushakov, Shuvendu Roy
― 6 min read
This study investigates the performance of entity linking models in conversational contexts.
Mohanna Hoveyda, Arjen P. de Vries, Maarten de Rijke
― 6 min read
Learn how keyphrase prediction enhances content organization and retrieval.
Muhammad Umair, Tangina Sultana, Young-Koo Lee
― 5 min read
New framework enhances sign language recognition through context and visual inputs.
Yuqi Liu, Wenqian Zhang, Sihan Ren
― 6 min read
A framework using memory tokens improves video understanding and interaction.
Yuxuan Wang, Cihang Xie, Yang Liu
― 7 min read
Researchers develop a dataset to teach machines about metaphor and sarcasm.
Ke Chang, Hao Li, Junzhao Zhang
― 6 min read
A new approach to tokenization enhances analysis of ancient scripts.
Yingfa Chen, Chenlong Hu, Cong Feng
― 6 min read
A look at the complexities and improvements in speech-to-speech translation technology.
Vincent Wilmet, Johnson Du
― 6 min read
A new dataset enhances sign language recognition from multiple viewpoints.
Oline Ranum, David R. Wessels, Gomer Otterspeer
― 7 min read
A new method enhances long-text processing in language models for better answers.
Yun Joon Soh, Hanxian Huang, Yuandong Tian
― 5 min read
AIvril enhances RTL code generation through automated syntax checking and functional verification.
Mubashir ul Islam, Humza Sami, Pierre-Emmanuel Gaillardon
― 5 min read
Research focuses on enhancing language learning through visually grounded speech models.
Leanne Nortje
― 8 min read
Examining the relationship between LLMs and human cognition.
Qian Niu, Junyu Liu, Ziqian Bi
― 6 min read
Assessing large language models' ability to handle privacy regulations.
Xichou Zhu, Yang Liu, Zhou Shen
― 5 min read
A new method for enhancing k-NN retrieval accuracy and efficiency.
Sepanta Zeighami, Zac Wellmer, Aditya Parameswaran
― 5 min read