A method combining VMD and linear models boosts forecasting accuracy.
― 5 min read
Cutting edge science explained simply
A method combining VMD and linear models boosts forecasting accuracy.
― 5 min read
The PoEM framework assesses language models without needing precise labels.
― 5 min read
This study evaluates how slight changes impact language model responses.
― 4 min read
A new method helps identify test data contamination in LLMs using token probabilities.
― 8 min read
FSDEM offers a fresh approach to assessing feature selection techniques for data analysis.
― 5 min read
MAPWise dataset challenges models on map-based questions and evaluates their reasoning skills.
― 6 min read
This article discusses a new rating system for evaluating language models more fairly.
― 5 min read
Logit Scaling enhances out-of-distribution data detection without training data.
― 6 min read
This study evaluates machine learning models for detecting trash in rivers.
― 5 min read
A new method for assessing robustness in ML classifiers using adversarial distance.
― 6 min read
A closer look at how well large language models perform basic tasks.
― 7 min read
A new method improves AI explanations through collaboration between two language models.
― 5 min read
This research explores how topological degree assesses the effectiveness of VAEs in capturing data structure.
― 5 min read
Study reveals how language models utilize context for accurate responses.
― 6 min read
New methods help understand how models react to data changes.
― 6 min read
This article examines methods for detecting data contamination in large language models.
― 6 min read
This paper explores how bootstrap methods enhance stability and robustness in SGD models.
― 5 min read
A new benchmark aims to improve uncertainty assessment in language models.
― 5 min read
A new method improves model reasoning through structured programming traces.
― 8 min read
Examining how fine-tuning affects safety in language models across various tasks.
― 5 min read
A fresh approach to evaluating ML models using Item Response Theory for better insights.
― 5 min read
Strong baseline models enhance the evaluation of ML systems in healthcare.
― 6 min read
A look at confidence intervals in few-shot learning and their impact on model evaluation.
― 6 min read
Examining the understanding and output accuracy of language models.
― 5 min read
Research highlights using influence functions to enhance PINN performance in physics problems.
― 6 min read
A look into effective dimension and its impact on model training.
― 6 min read
This paper evaluates how well language models explain scientific concepts.
― 4 min read
This article examines GAMs as a solution for predictive performance and interpretability.
― 7 min read
Examining how hard samples affect model performance and the reliability of test accuracy.
― 9 min read
This article examines how different layers affect LLM performance.
― 5 min read
Soft labels can improve machine learning model performance in uncertain data scenarios.
― 6 min read
RepairBench sets benchmarks for comparing AI models in fixing software bugs.
― 5 min read
This method enhances the reliability of language model confidence scores.
― 5 min read
Learn how the applicability domain affects predictive model accuracy in various fields.
― 9 min read
A method to estimate reliability of responses from large language models.
― 4 min read
A new method for testing language models using randomized text.
― 6 min read
A method to improve steering vector effectiveness in language models.
― 5 min read
Explore the impact of shortcut learning on language models and their real-world applications.
― 4 min read
This paper examines methods to compare generative models through embedding-based representations.
― 6 min read
A framework to balance pseudo-label learning in machine learning.
― 5 min read