Latest Articles for Model Interpretability

Machine Learning Enhancing Decision Trees in Federated Learning

A novel method improves decision tree aggregation while maintaining interpretability and privacy.

2025-08-22T19:21:30+00:00 ― 6 min read

Machine Learning Improving GNN Explanations with Edge-Induced Subgraphs

A new approach for clearer GNN predictions using edge-focused subgraph explanations.

2025-08-14T14:46:48+00:00 ― 6 min read

Information Retrieval Understanding Neural Retrieval Models through Causal Interventions

This study analyzes neural retrieval models using causal methods for better relevance insights.

2025-08-14T05:02:12+00:00 ― 6 min read

Machine Learning A Clearer Look at Unsupervised Learning with White-Box Models

This paper discusses a white-box model for effective unsupervised learning.

2025-08-14T00:12:52+00:00 ― 6 min read

Machine Learning Improving AI Model Transparency with Sparse Autoencoders

Sparse autoencoders enhance the interpretability of AI systems and their decision-making processes.

2025-08-11T02:07:06+00:00 ― 18 min read

Human-Computer Interaction Evaluating Saliency Methods in NLP: A Human Perspective

This study assesses saliency methods in NLP through human evaluation.

2025-08-10T07:56:54+00:00 ― 8 min read

Machine Learning Improving Graph Neural Network Interpretability

A new method enhances clarity and performance of GNN predictions.

2025-08-09T13:24:20+00:00 ― 7 min read

Machine Learning Analyzing Circuits in Transformer Models for Better Performance

This article explores circuit analysis techniques in Transformer models for improved language processing.

2025-08-09T01:16:12+00:00 ― 5 min read

Machine Learning Enhancing Explainability in Deep Learning with Expected Grad-CAM

A new method offers clearer insights into deep learning model decisions.

2025-08-03T00:26:12+00:00 ― 6 min read

Machine Learning Enhancing Predictions with FreeShap in Language Models

FreeShap improves instance attribution for language models, boosting reliability and efficiency.

2025-08-01T17:06:00+00:00 ― 6 min read

Machine Learning Bilinear MLPs: A Clearer Path in Machine Learning

Bilinear MLPs offer simpler, more interpretable models in machine learning.

2025-08-01T06:57:42+00:00 ― 8 min read

Computer Vision and Pattern Recognition Building Self-Explainable Deep Learning Models

A new method improves model transparency and trust in critical areas like healthcare.

2025-07-28T20:08:36+00:00 ― 6 min read

Machine Learning Enhancing Trust in Graph Neural Networks Through Activation Rules

Explaining GNN decisions using activation rules improves trust and understanding.

2025-07-27T17:09:06+00:00 ― 8 min read

Sound Analyzing Audio Models with Network Dissection

A new method for understanding how audio models make predictions.

2025-07-27T12:25:50+00:00 ― 5 min read

Computation and Language Comparing Input Feature Explanations for Machine Learning Models

A unified framework to assess explanation types for better model understanding.

2025-07-25T18:08:48+00:00 ― 5 min read

Computation and Language Improving Machine Learning Interpretability Through Compositional Concepts

This article presents a new method for better understanding machine learning models.

2025-07-24T01:19:48+00:00 ― 6 min read

Machine Learning The Impact of Missing Data on Machine Learning Interpretability

Missing data affects model performance and insights derived from machine learning.

2025-07-22T10:53:00+00:00 ― 5 min read

Artificial Intelligence Decoding Mechanistic Interpretability in Transformer Models

An overview of mechanistic interpretability in transformer-based language models.

2025-07-21T02:05:54+00:00 ― 7 min read

Computation and Language Representation of Concepts in Language Models

Examining how language models encode and relate concepts.

2025-07-20T16:06:12+00:00 ― 6 min read

Machine Learning Improving Interpretable Deep Learning Models

A new framework minimizes human effort while addressing model biases.

2025-07-15T12:19:30+00:00 ― 6 min read

Computation and Language TokenSHAP: A New Tool for Language Model Interpretability

TokenSHAP reveals how words impact language model responses.

2025-07-13T17:55:42+00:00 ― 7 min read

Computation and Language Evaluating Self-Explanations in Language Models

A study on the reliability of LLM self-explanations in natural language tasks.

2025-07-10T10:39:54+00:00 ― 6 min read

Computer Vision and Pattern Recognition Introducing CEViT: A New Approach to Image Similarity

CEViT enhances image similarity measurement and offers clear explanations.

2025-07-08T14:25:30+00:00 ― 5 min read

Machine Learning Advancing Interpretability in Deep Learning with CoLiDR

A new method combining concept learning and disentangled representations for better model understanding.

2025-07-06T04:05:48+00:00 ― 7 min read

Machine Learning Class Outliers in Machine Learning Explanations

Examining how class outliers affect explainability in machine learning models.

2025-07-04T19:58:12+00:00 ― 6 min read

Machine Learning Shapley Compositions: A New Approach to Multiclass Predictions

Learn how Shapley compositions improve the understanding of multiclass predictions.

2025-07-03T07:14:06+00:00 ― 6 min read

Computer Vision and Pattern Recognition Enhancing Interpretability in Deep Learning Models with DCLS

This study investigates DCLS's impact on model interpretability and accuracy.

2025-07-01T11:15:30+00:00 ― 6 min read

Machine Learning GLEAMS: A New Approach to Explainability in Machine Learning

GLEAMS offers clear local and global explanations for machine learning predictions efficiently.

2025-06-29T23:42:30+00:00 ― 6 min read

Machine Learning Advancements in Hybrid Concept-Based Models

New models improve performance using class labels and concepts from data.

2025-06-27T20:29:24+00:00 ― 6 min read

Artificial Intelligence Understanding Explainable and Interpretable AI

A look at the key differences between Explainable AI and Interpretable AI.

2025-06-23T17:28:36+00:00 ― 7 min read

Machine Learning Explaining Deep Learning Models for Time Series Data

New methods improve understanding of deep learning decisions in time series analysis.

2025-06-18T13:03:06+00:00 ― 5 min read

Machine Learning Improving Interpretability of Tree Ensemble Classifiers

A new tool helps users make sense of complex tree models.

2025-06-17T11:14:42+00:00 ― 7 min read

Computer Vision and Pattern Recognition Top-GAP: Enhancing CNN Interpretability and Robustness

A method improving CNN focus on key image areas for better decision-making.

2025-06-15T17:46:12+00:00 ― 4 min read

Numerical Analysis IDGI: A New Approach to Model Explainability

This study assesses the IDGI framework for explaining deep learning model predictions.

2025-06-15T10:17:40+00:00 ― 5 min read

Computation and Language GAProtoNet: Bridging Accuracy and Interpretability in Text Classification

GAProtoNet enhances text classification by improving interpretability while maintaining high accuracy.

2025-06-08T15:39:24+00:00 ― 5 min read

Computer Vision and Pattern Recognition Introducing EQ-CBM: A New Approach to AI Interpretability

EQ-CBM enhances AI understanding through improved concept encoding and flexibility.

2025-06-08T05:15:18+00:00 ― 6 min read

Machine Learning Improving Interpretability in Neural Networks

A new method enhances the grouping of neural networks for better understanding.

2025-06-06T09:40:24+00:00 ― 5 min read

Machine Learning Improving Influence Functions in Machine Learning

New methods enhance the accuracy of influence functions in large models.

2025-06-06T03:44:54+00:00 ― 6 min read

Computer Vision and Pattern Recognition Simplifying Visualization in Deep Learning Models

A new approach for clearer visualization and understanding of deep learning models.

2025-06-05T23:10:30+00:00 ― 4 min read

Computer Vision and Pattern Recognition Visualizing Deep Learning Model Features

A new method enhances understanding of CNN features and decision-making.

2025-06-05T18:39:48+00:00 ― 8 min read