New models improve efficiency in retrieving information across various languages.
Rohan Jha, Bo Wang, Michael Günther
― 6 min read
Cutting edge science explained simply
New models improve efficiency in retrieving information across various languages.
Rohan Jha, Bo Wang, Michael Günther
― 6 min read
Latest Articles
Rena Gao, Jingxuan Wu, Carsten Roever
― 6 min read
Florian Atzenhofer-Baumgartner, Tamás Kovács
― 5 min read
Leandro Carísio Fernandes, Gustavo Bartz Guedes, Thiago Soares Laitz
― 5 min read
Examining memorization in language models and sampling techniques.
Luka Borec, Philipp Sadler, David Schlangen
― 4 min read
This article discusses a new framework for enhancing reasoning in AI models.
Xin Zheng, Jie Lou, Boxi Cao
― 5 min read
A new benchmark aids in assessing speech tokenizers for better performance.
Shikhar Vashishth, Harman Singh, Shikhar Bharadwaj
― 6 min read
Improving how machines assist users through better interaction and response measures.
Dan Bohus, Sean Andrist, Yuwei Bao
― 5 min read
Exploring how LLMs aid in movement analysis and their challenges.
Yuhan Ji, Song Gao
― 5 min read
Learn how causal knowledge graphs analyze relationships and inform decisions.
Oktie Hassanzadeh
― 5 min read
OnlySportsLM offers a tailored solution for effective sports language processing.
Zexin Chen, Chengxi Li, Xiangyu Xie
― 5 min read
A new approach to valuing data emphasizes its uniqueness for machine learning.
Mohamad Rida Rammal, Ruida Zhou, Suhas Diggavi
― 6 min read
New metric improves understanding of AI text and human writing.
Tyler Malloy, Maria José Ferreira, Fei Fang
― 6 min read
Clear AI explanations build trust and ensure responsible use in various fields.
Melkamu Mersha, Khang Lam, Joseph Wood
― 6 min read
MAPWise dataset challenges models on map-based questions and evaluates their reasoning skills.
Srija Mukhopadhyay, Abhishek Rajgaria, Prerana Khatiwada
― 6 min read
A new method for detecting hallucinations in language models using corrupted data.
Spencer Whitehead, Jacob Phillips, Sean Hendryx
― 8 min read
This study examines stance detection model performance without prior topic knowledge.
Abu Ubaida Akash, Ahmed Fahmy, Amine Trabelsi
― 7 min read
A new approach enhances event detection in LLMs using Semantic Causal Graphs.
Mazal Bethany, Emet Bethany, Brandon Wherry
― 8 min read
A new method improves automatic speech recognition by preserving sound order in knowledge transfer.
Xugang Lu, Peng Shen, Yu Tsao
― 4 min read
Introducing a framework for generating creativity test items using language models.
Antonio Laverghetta, Simone Luchini, Averie Linell
― 5 min read
A new method leverages speech data to improve autism assessments.
Jihyun Mun, Sunhee Kim, Minhwa Chung
― 6 min read
A novel method enhances how we process long videos.
Gueter Josmy Faure, Jia-Fong Yeh, Min-Hung Chen
― 5 min read
This study examines how language models enhance OCR outputs for historical newspapers.
Jonathan Bourne
― 6 min read
Comparing language models' effectiveness in classifying texts on climate change and ecology.
Francesca Grasso, Stefano Locci
― 5 min read
A new model improves speech recognition in multilingual conversations.
Hukai Huang, Jiayan Lin, Kaidi Wang
― 5 min read
Examining the organization and specialization of neurons in transformer models.
Nicholas Pochinkov, Thomas Jones, Mohammed Rashidur Rahman
― 6 min read
This paper explores mechanisms behind neuron activations and their impact on model performance.
Nicholas Pochinkov, Ben Pasero, Skylar Shibayama
― 6 min read
A new framework improves process models using expert insights and language models.
Ali Norouzifar, Humam Kourani, Marcus Dees
― 5 min read
This study examines the effectiveness of LLMs in musicology and their reliability.
Pedro Ramoneda, Emilia Parada-Cabaleiro, Benno Weck
― 5 min read
A new method aims to enhance word diversity while preserving style in literary translations.
Esther Ploeger, Huiyuan Lai, Rik van Noord
― 6 min read
Combine trained models to improve performance and reduce costs.
Rhui Dih Lee, Laura Wynter, Raghu Kiran Ganti
― 5 min read
This study compares BERT and Bi-LSTM for classifying electronic health records.
Shubham Agarwal, Thomas Searle, Mart Ratas
― 5 min read
Researchers enhance language models' learning with fresh data and innovative methods.
Maxime Méloux, Christophe Cerisara
― 6 min read
InkubaLM aims to improve language processing for underrepresented African languages.
Atnafu Lambebo Tonja, Bonaventure F. P. Dossou, Jessica Ojo
― 7 min read
CoRA enhances efficiency in training large language models using shared knowledge.
Xiaojun Xiao, Sen Shen, Qiming Bao
― 5 min read
TSO enhances language models by focusing on diversity, validity, and adaptability in preference data.
Kaihui Chen, Hao Yi, Qingyang Li
― 7 min read
A study on false refusals in language models and their impact on user experience.
Bang An, Sicheng Zhu, Ruiyi Zhang
― 6 min read
This project enhances text correction in Bulgarian historical documents using OCR technology.
Angel Beshirov, Milena Dobreva, Dimitar Dimitrov
― 5 min read
LongRecipe improves language models' understanding of long texts efficiently.
Zhiyuan Hu, Yuliang Liu, Jinman Zhao
― 5 min read
Innovative lightweight transducer enhances speech recognition efficiency and accuracy.
Genshun Wan, Mengzhi Wang, Tingzhi Mao
― 6 min read
A look into the effectiveness of pipeline versus end-to-end systems in summarizing across languages.
Daniel Varab, Christian Hardmeier
― 6 min read
A new approach to enhance decoder models for different dialects.
Dipankar Srirag, Aditya Joshi, Jacob Eisenstein
― 5 min read
YA-TA offers personalized support for students and instructors in large classrooms.
Dongil Yang, Suyeon Lee, Minjin Kim
― 7 min read
Using AI to simulate data for analyzing emotional connections in mental health.
Paulo Soares, Sean McCurdy, Andrew J. Gerber
― 10 min read