Jan Kautz

New method tackles overexposure issues in everyday video recording using deep learning.

2025-10-03T02:32:42+00:00 ― 6 min read

A novel approach enhances downscaling of weather data for better local forecasts.

2025-09-11T03:04:06+00:00 ― 7 min read

SpatialRGPT enhances object arrangement understanding in Vision Language Models.

2025-08-03T05:10:36+00:00 ― 6 min read

New adaptable models can meet diverse needs without retraining.

2025-07-31T06:44:06+00:00 ― 7 min read

MambaVision combines Mamba and Transformers for better image recognition.

2025-07-16T02:56:24+00:00 ― 4 min read

This study explores methods to create smaller language models effectively and affordably.

2025-07-10T13:17:54+00:00 ― 5 min read

This article analyzes model performance across various tasks and datasets.

2025-07-08T02:42:24+00:00 ― 5 min read

A new method improves data quality for visual language models using augmentation techniques.

2025-07-07T17:53:06+00:00 ― 7 min read

A method to shrink language models without sacrificing effectiveness through pruning and distillation.

2025-06-24T13:29:24+00:00 ― 4 min read

A new method enhances LLM performance while reducing complexity.

2025-06-06T07:41:54+00:00 ― 7 min read

NaVILA helps robots navigate using language and vision.

2025-04-12T05:32:06+00:00 ― 6 min read

A look at Gated DeltaNet and its impact on language models.

2025-03-28T17:15:00+00:00 ― 5 min read

Discover emerging techniques revolutionizing how machines see and understand images.

2025-03-25T13:00:45+00:00 ― 6 min read

StreamChat transforms how we engage with streaming video in real-time.

2025-03-21T16:43:30+00:00 ― 7 min read