This study evaluates LLM performance on Indonesian professional exams across multiple fields.
Fajri Koto
― 4 min read
Cutting edge science explained simply
This study evaluates LLM performance on Indonesian professional exams across multiple fields.
Fajri Koto
― 4 min read
This article examines key factors in preference dataset quality for better reward model training.
Judy Hanwen Shen, Archit Sharma, Jun Qin
― 6 min read
Examining how language processing tools influence the richness of communication.
Josef Jon
― 8 min read
A comparison of methods to personalize large language models for better user responses.
Alireza Salemi, Hamed Zamani
― 5 min read
A framework to analyze Bangla social media content through text and images.
Fatema Tuj Johora Faria, Mukaffi Bin Moin, Md. Mahfuzur Rahman
― 5 min read
Study explores ASR development for Amis and Seediq, focusing on data use.
Yao-Fei Cheng, Li-Wei Chen, Hung-Shin Lee
― 7 min read
This project generates synthetic clinical letters to protect patient privacy in research.
Libo Ren, Samuel Belkadi, Lifeng Han
― 5 min read
A study assessing AI's effectiveness in social media text annotation.
Nicholas Pangakis, Samuel Wolken
― 9 min read
Study reveals how faulty code affects test case quality from LLMs.
Dong Huang, Jie M. Zhang, Mingzhe Du
― 5 min read
DAC model improves audio captioning with speed and diversity.
Manjie Xu, Chenxing Li, Xinyi Tu
― 5 min read
This research shows how metadata can boost political stance detection accuracy.
Stanley Cao, Felix Drinkall
― 6 min read
A new model for better relation extraction using syntax and context.
Xin Wang, Xinyi Bai
― 5 min read
This article examines methods for detecting data contamination in large language models.
Vinay Samuel, Yue Zhou, Henry Peng Zou
― 6 min read
A new model improves language understanding and reduces misinformation.
Xuan-Phi Nguyen, Shrey Pandit, Senthil Purushwalkam
― 6 min read
Research reveals how language models reflect human personality traits.
Joseph Suh, Suhong Moon, Minwoo Kang
― 5 min read
A new dataset aims to improve QA systems for the Quran and Ahadith.
Faiza Qamar, Seemab Latif, Rabia Latif
― 8 min read
A system creates fake medical records while ensuring patient privacy.
Samuel Belkadi, Libo Ren, Nicolo Micheletti
― 7 min read
This paper evaluates VLMs' ability to reason about sizes and distances.
Yuan-Hong Liao, Rafid Mahmood, Sanja Fidler
― 6 min read
This study assesses language models in classifying political content toxicity.
Bastián González-Bustamante
― 5 min read
AI agents enhance air traffic management by handling conflicts and learning from experiences.
Justas Andriuškevičius, Junzi Sun
― 8 min read
New methods streamline PICO extraction from clinical trials for efficient research.
Madhusudan Ghosh, Shrimon Mukherjee, Asmit Ganguly
― 7 min read
New technologies aim to improve care for patients with rare gynecological cancers.
Jacqueline Lammert, Nicole Pfarr, Leonid Kuligin
― 6 min read
A system empowers users to control their reflective writing process.
Inhwa Song, SoHyun Park, Sachin R. Pendse
― 5 min read
This study examines gender bias in teacher evaluations generated by AI models.
Yuanning Huang
― 9 min read
This study focuses on improving dialogue systems' reliability by assessing confidence in responses.
Yi-Jyun Sun, Suvodip Dey, Dilek Hakkani-Tur
― 6 min read
Automating text annotation improves accuracy and efficiency in machine learning.
Jianfei Wu, Xubin Wang, Weijia Jia
― 5 min read
A new approach combines two KenLM models for better data filtering.
Yungi Kim, Hyunsoo Ha, Sukyung Lee
― 5 min read
A new technique reveals weaknesses in AI safety measures for language models.
Emet Bethany, Mazal Bethany, Juan Arturo Nolazco Flores
― 6 min read
Transforming text into lifelike digital movements using innovative models.
S. Rohollah Hosseyni, Ali Ahmad Rahmani, S. Jamal Seyedmohammadi
― 4 min read
Visualize how keywords and topics evolve over time with TimeLink.
Daniel Palamarchuk, Lemara Williams, Brian Mayer
― 6 min read
This study investigates generative models for effective keyphrase creation in scientific papers.
Anna Glazkova, Dmitry Morozov
― 6 min read
A new technique cuts memory needs for large language models while keeping performance.
Luning Wang, Shiyao Li, Xuefei Ning
― 5 min read
A new method enhances efficiency in processing long inputs for language models.
Di Liu, Meng Chen, Baotong Lu
― 5 min read
Causal language models show promise in solving Sudoku and Zebra puzzles.
Kulin Shah, Nishanth Dikkala, Xin Wang
― 4 min read
Exploring how memory functions in LLMs and its comparison to human memory.
Wei Wang, Qing Li
― 7 min read
A framework to enhance cooperative behavior using advanced AI technology.
Qiliang Chen, Sepehr Ilami, Nunzio Lore
― 7 min read
ReflectDiffu improves chatbot interactions by better understanding emotions.
Jiahao Yuan, Zixiang Di, Zhiqing Cui
― 5 min read
Research highlights working memory constraints in Transformer models during complex tasks.
Dongyu Gong, Hantao Zhang
― 5 min read
A new method enhances language model communication by adjusting personality traits.
Navya Jain, Zekun Wu, Cristian Munoz
― 7 min read
A study evaluates GPT-4 and clinalytix Medical AI for predicting delirium risk.
Mohamed Rezk, Patricia Cabanillas Silva, Fried-Michael Dahlweid
― 7 min read