Chirag Agarwal

A new method enhances the safety of language models against harmful prompts.

2025-09-30T02:23:30+00:00 ― 5 min read

Examining how robust machine learning models impact explanation effectiveness.

2025-09-20T10:00:00+00:00 ― 7 min read

Examining the challenges of self-explanations in large language models.

2025-09-10T02:04:18+00:00 ― 5 min read

Examining the effectiveness of reasoning in large language models.

2025-07-28T12:30:24+00:00 ― 7 min read

This article discusses protecting our personal data from language models.

2025-05-24T01:15:36+00:00 ― 5 min read

Exploring how fine-tuning affects reasoning in language models.

2025-05-13T07:52:00+00:00 ― 8 min read

LVLMs struggle with recognizing reality, risking serious consequences.

2025-01-20T11:25:21+00:00 ― 5 min read