Computer Science - Computer Vision and Pattern Recognition

RSS

Computer Vision and Pattern Recognition OmniCorpus Dataset: A New Resource for Multimodal Learning

A comprehensive dataset merging images and text to aid machine learning.

2025-07-29T22:44:24+00:00 ― 6 min read

Computer Vision and Pattern Recognition Evaluating Video Comprehension in Multimodal Language Models

A new benchmark aims to assess MLLMs in video understanding across multiple topics.

2025-07-29T22:20:42+00:00 ― 6 min read

Computer Vision and Pattern Recognition Innovative Model for Artistic Font Generation

A new model generates unique font effects for multiple languages.

2025-07-29T21:57:00+00:00 ― 5 min read

Image and Video Processing New Dataset Advances Confocal Fluorescence Microscopy Research

A new dataset enhances image quality evaluation in microscopy.

2025-07-29T21:55:15+00:00 ― 7 min read

Computer Vision and Pattern Recognition New Method Improves Recognition of Social Relationships

ConSoR enhances the understanding of social connections through visual context analysis.

2025-07-29T21:49:06+00:00 ― 7 min read

Computer Vision and Pattern Recognition Improving Vision Transformers with Adaptor NCA

A new approach enhances the robustness of Vision Transformers against adversarial attacks.

2025-07-29T21:09:36+00:00 ― 5 min read

Computer Vision and Pattern Recognition Advancing Depth Estimation with Self-Supervised Learning

A new model enhances depth estimation accuracy using self-supervised learning techniques.

2025-07-29T21:06:40+00:00 ― 6 min read

Computer Vision and Pattern Recognition Advancing 3D Scene Generation with hGCA

hGCA automates realistic 3D scene creation using sparse LiDAR data.

2025-07-29T21:01:42+00:00 ― 6 min read

Computer Vision and Pattern Recognition Advancements in Image Dataset Augmentation

New methods improve image datasets while ensuring privacy and performance.

2025-07-29T20:53:48+00:00 ― 5 min read

Computer Vision and Pattern Recognition Advancements in Document Understanding Efficiency

Research focuses on improving efficiency in document understanding models.

2025-07-29T20:45:54+00:00 ― 7 min read

Computer Vision and Pattern Recognition Challenging the Limits of Vision-Language Models

A new benchmark tests compositional reasoning in advanced models.

2025-07-29T19:42:42+00:00 ― 7 min read

Computer Vision and Pattern Recognition Improving Image Generation with CFG++

CFG++ enhances image generation and editing, offering better alignment with text prompts.

2025-07-29T18:31:36+00:00 ― 6 min read

Computer Vision and Pattern Recognition ABTrack: A New Approach to Visual Tracking

ABTrack enhances visual tracking speed and efficiency across various devices.

2025-07-29T18:23:42+00:00 ― 5 min read

Computer Vision and Pattern Recognition New Benchmark for Long Video Understanding

A benchmark created to improve comprehension of long video content.

2025-07-29T18:15:48+00:00 ― 7 min read

Computer Vision and Pattern Recognition Mapping Urban Slums: A Technological Approach

Utilizing satellite imagery and deep learning to improve slum mapping and living conditions.

2025-07-29T18:07:54+00:00 ― 6 min read

Audio and Speech Processing Advancing Foley Audio with the MINT Dataset

A new dataset improves the creation of foley audio for multimedia content.

2025-07-29T17:03:45+00:00 ― 6 min read

Computer Vision and Pattern Recognition Improving Band Selection in Hyperspectral Imaging

New method enhances band selection for hyperspectral imaging without retraining.

2025-07-29T16:56:48+00:00 ― 5 min read

Computer Vision and Pattern Recognition Enhancing Model Performance with Optimal Transport-guided Visual Prompting

A new method improves machine learning models' accuracy on unseen data.

2025-07-29T15:37:48+00:00 ― 6 min read

Computer Vision and Pattern Recognition The Muharaf Dataset: A Key to Arabic Handwriting Recognition

A comprehensive dataset for Arabic handwritten text recognition and research.

2025-07-29T14:34:36+00:00 ― 6 min read

Computer Vision and Pattern Recognition Advancing 3D Object Recognition with ImageNet3D

ImageNet3D enhances machine understanding of 3D objects in images.

2025-07-29T14:26:42+00:00 ― 6 min read

Computer Vision and Pattern Recognition Advancing Color Recognition in Neural Networks

A new neural network improves color recognition for better image classification.

2025-07-29T14:10:54+00:00 ― 5 min read

Computer Vision and Pattern Recognition Advancements in Language-Driven Grasp Detection for Robots

New dataset enhances robots' grasping skills using natural language commands.

2025-07-29T13:15:36+00:00 ― 5 min read

Computer Vision and Pattern Recognition Advancing Offline Reinforcement Learning with SeMOPO

SeMOPO improves learning from low-quality data by separating useful information from noise.

2025-07-29T13:07:42+00:00 ― 4 min read

Computer Vision and Pattern Recognition Risks of Diffusion Models in Image Processing

Exploring privacy threats in image processing using diffusion models and leaked gradients.

2025-07-29T12:59:48+00:00 ― 7 min read

Computer Vision and Pattern Recognition Advancements in Video Understanding Technology

A new model enhances video comprehension by merging image and video encoders.

2025-07-29T12:28:12+00:00 ― 7 min read

Computer Vision and Pattern Recognition Reimagining Score Distillation Sampling Techniques

A new perspective on improving image creation through score distillation sampling.

2025-07-29T12:20:18+00:00 ― 7 min read

Computer Vision and Pattern Recognition Rethinking Image Processing: The Pixel Transformer Approach

A shift from patches to pixels in computer vision is changing image analysis.

2025-07-29T12:12:24+00:00 ― 6 min read

Computer Vision and Pattern Recognition Personalizing Generative Models with Weight Space

Customizing generative models to reflect unique identities through weight space.

2025-07-29T12:04:30+00:00 ― 7 min read

Computer Vision and Pattern Recognition Attributing Influence in Text-to-Image Models

This study presents a new method for identifying key training images in AI-generated visuals.

2025-07-29T11:56:36+00:00 ― 7 min read

Computer Vision and Pattern Recognition Assessing the Robustness of Visual State Space Models

This article examines how Visual State Space Models handle visual challenges.

2025-07-29T11:48:42+00:00 ― 6 min read

Computer Vision and Pattern Recognition Integrating Visual Sketching into Language Models

A new framework enhances reasoning in language models through visual sketches.

2025-07-29T11:40:48+00:00 ― 3 min read

Computer Vision and Pattern Recognition Introducing MMScan: A New Dataset for 3D Scene Understanding

MMScan enhances AI’s ability to comprehend complex 3D environments with extensive annotations.

2025-07-29T11:32:54+00:00 ― 7 min read

Computer Vision and Pattern Recognition Personalizing AI: Making Connections with Users

A new method helps AI engage in personal conversations about specific subjects.

2025-07-29T11:25:00+00:00 ― 5 min read

Computer Vision and Pattern Recognition Advancements in Video Analysis for Daily Living Activities

Researchers aim to improve machine understanding of daily activities through video analysis.

2025-07-29T11:09:12+00:00 ― 6 min read

Computer Vision and Pattern Recognition SimGen: A New Approach to Synthetic Data for Self-Driving Cars

SimGen improves self-driving car training with realistic synthetic data.

2025-07-29T11:01:18+00:00 ― 7 min read

Computer Vision and Pattern Recognition Advancements in Vision-Language Geo-Foundation Models

Exploring the role of VLGFMs in geospatial data analysis.

2025-07-29T10:53:24+00:00 ― 5 min read

Computer Vision and Pattern Recognition Advancing 3D Head Modeling with GGHead

A new method rapidly creates detailed 3D head models from 2D images.

2025-07-29T10:45:30+00:00 ― 7 min read

Computer Vision and Pattern Recognition Advancements in Monocular Depth Estimation Techniques

New method improves depth estimation accuracy using single images.

2025-07-29T10:37:36+00:00 ― 6 min read

Computer Vision and Pattern Recognition Advancing Video Understanding with VideoNIAH

A new framework improves video comprehension and evaluation methods.

2025-07-29T10:21:48+00:00 ― 5 min read

Machine Learning Advancements in Unsupervised Domain Adaptation

A new method improves model adaptability across domains using prompt learning and gradient alignment.

2025-07-29T09:58:06+00:00 ― 6 min read