Using advanced models to better assess research ideas in academia.
Yi Xu, Bo Xue, Shuqian Sheng
― 6 min read
Cutting edge science explained simply
Using advanced models to better assess research ideas in academia.
Yi Xu, Bo Xue, Shuqian Sheng
― 6 min read
New methods connect metadata to knowledge graphs for better data interpretation.
Margherita Martorana, Xueli Pan, Benno Kruit
― 5 min read
A system to improve IT support using retrieval augmented generation.
Paulina Toro Isaza, Michael Nidd, Noah Zheutlin
― 7 min read
CAST offers a precise approach to managing language model responses.
Bruce W. Lee, Inkit Padhi, Karthikeyan Natesan Ramamurthy
― 7 min read
This paper presents late chunking for better text retrieval by preserving context.
Michael Günther, Isabelle Mohr, Daniel James Williams
― 5 min read
Paper Copilot helps researchers navigate scientific literature efficiently.
Guanyu Lin, Tao Feng, Pengrui Han
― 5 min read
Research shows how coding influences language models' abilities in various tasks.
Jackson Petty, Sjoerd van Steenkiste, Tal Linzen
― 5 min read
RLPF enhances user data summarization for better predictions.
Jiaxing Wu, Lin Ning, Luyang Liu
― 5 min read
Enhancing spoken word identification through visual cues in under-resourced languages.
Leanne Nortje, Dan Oneata, Herman Kamper
― 7 min read
This study examines how language models learn from examples and past knowledge.
Aliakbar Nafar, Kristen Brent Venable, Parisa Kordjamshidi
― 8 min read
KnoWoGen offers a novel system for creating datasets in knowledge work, addressing key limitations.
Desiree Heim, Christian Jilek, Adrian Ulges
― 6 min read
Enhancing machine translation for the low-resource Karakalpak language through new datasets.
Mukhammadsaid Mamasaidov, Abror Shopulatov
― 4 min read
Fast Forward enhances low-rank training efficiency for language models.
Adir Rahamim, Naomi Saphra, Sara Kangaslahti
― 6 min read
This article discusses MLSAEs and their role in examining language model layers.
Tim Lawson, Lucy Farnik, Conor Houghton
― 5 min read
This study assesses large language models as judges in math reasoning tasks.
Andreas Stephan, Dawei Zhu, Matthias Aßenmacher
― 5 min read
A new method for predicting personality traits from online posts using filtered data.
Jan Hofmann, Cornelia Sindermann, Roman Klinger
― 7 min read
This study examines the role of confidence scores in enhancing OCR performance.
Arthur Hemmer, Mickaël Coustaty, Nicola Bartolo
― 6 min read
A new method improves code generation by using multiple programming languages.
Tengfei Xue, Xuefeng Li, Tahir Azim
― 5 min read
UI-JEPA enhances how systems predict user actions from screen interactions.
Yicheng Fu, Raviteja Anantha, Prabal Vashisht
― 5 min read
ECHO combines diverse reasoning patterns for better problem-solving in language models.
Ziqi Jin, Wei Lu
― 6 min read
Researchers create a method to test interventions for eating disorders without real risks.
Louis Penafiel, Hsien-Te Kao, Isabel Erickson
― 6 min read
MeMo dataset sheds light on how group conversations are remembered.
Maria Tsfasman, Bernd Dudzik, Kristian Fenech
― 5 min read
A new dataset enhances multilingual speech technology in India.
Ashwin Sankar, Srija Anand, Praveen Srinivasa Varadhan
― 5 min read
This article examines red-teaming risks in large language models used in business.
George Kour, Naama Zwerdling, Marcel Zalmanovici
― 3 min read
SSR improves language models' performance while maintaining their general abilities.
Sonam Gupta, Yatin Nandwani, Asaf Yehudai
― 6 min read
Untie the Knots method improves handling of long texts in language models.
Junfeng Tian, Da Zheng, Yang Cheng
― 6 min read
Research reveals how layers in LLMs equally contribute to predictions.
Hangfeng He, Weijie J. Su
― 6 min read
This article discusses the benefits of simplifying transformer models for speech tasks.
Teresa Dorszewski, Albert Kjøller Jacobsen, Lenka Tětková
― 4 min read
This research enhances how models answer questions using tables.
Ruya Jiang, Chun Wang, Weihong Deng
― 6 min read
Examining the link between truthfulness and political bias in language models.
Suyash Fulay, William Brannon, Shrestha Mohanty
― 6 min read
Exploring the social and ethical issues in gathering language data from diverse communities.
Andrew Smart, Ben Hutchinson, Lameck Mbangula Amugongo
― 8 min read
A new interactive framework improves data labeling efficiency using expert feedback.
Giannis Karamanolakis, Daniel Hsu, Luis Gravano
― 9 min read
A study highlights reasoning challenges in modern language models amidst misleading information.
Neeladri Bhuiya, Viktor Schlegel, Stefan Winkler
― 6 min read
WaterSeeker improves detection methods for watermarked text in large documents.
Leyi Pan, Aiwei Liu, Yijian Lu
― 5 min read
Two innovative methods enhance Chinese spelling correction performance and accuracy.
Lei Sheng, Shuai-Shuai Xu
― 5 min read
A new dataset aims to tackle harmful speech in Chinese videos.
Hongbo Wang, Junyu Lu, Yan Han
― 6 min read
Sortformer integrates speaker diarization and ASR for improved audio processing.
Taejin Park, Ivan Medennikov, Kunal Dhawan
― 5 min read
This study reviews Google Translate's effectiveness in translating Mandarin texts to English.
Xuechun Wang, Rodney Beard, Rohitash Chandra
― 5 min read
VisScience tests large models on scientific reasoning using text and images.
Zhihuan Jiang, Zhen Yang, Jinhao Chen
― 5 min read
New models improve math problem-solving by incorporating visual context alongside text.
Zhen Yang, Jinhao Chen, Zhengxiao Du
― 5 min read