Christos Thrampoulidis

A look into how transformers use attention layers for better language processing.

2025-09-18T09:23:48+00:00 ― 4 min read

Introducing CAP to improve fairness and efficiency in machine learning models.

2025-09-09T23:44:36+00:00 ― 6 min read

Examining self-attention and gradient descent in transformer models.

2025-09-03T09:11:56+00:00 ― 4 min read

Examining biases in next-token prediction and their impact on model performance.

2025-08-25T14:05:04+00:00 ― 7 min read

A deep dive into how next-token prediction shapes language understanding in models.

2025-06-21T16:14:00+00:00 ― 6 min read