SOOD-ImageNet addresses challenges in computer vision related to changing image meanings.
Alberto Bacchin, Davide Allegro, Stefano Ghidoni
― 6 min read
Cutting edge science explained simply
SOOD-ImageNet addresses challenges in computer vision related to changing image meanings.
Alberto Bacchin, Davide Allegro, Stefano Ghidoni
― 6 min read
Introducing a model that improves image retrieval by incorporating uncertainty.
Danilo Dordevic, Suryansh Kumar
― 6 min read
Generative models create diverse training data, enhancing robot adaptability.
Zoey Chen, Zhao Mandi, Homanga Bharadhwaj
― 7 min read
Examining how regularization techniques affect machine learning's ability to handle unknown inputs.
Zachary Rabin, Jim Davis, Benjamin Lewis
― 6 min read
A look at efficient methods for tracking objects in video through semi-parametric models.
Jianqiao Wangni
― 5 min read
A new method improves how models learn from images and text.
Dominykas Seputis, Serghei Mihailov, Soham Chatterjee
― 5 min read
ViDiDi enhances video learning through efficient use of unlabeled data.
Siyi Chen, Minkyu Choi, Zesen Zhao
― 6 min read
YoloTag improves drone navigation using real-time fiducial marker detection.
Sourav Raxit, Simant Bahadur Singh, Abdullah Al Redwan Newaz
― 5 min read
A new method enhances feature matching by combining geometry and color information.
Gonglin Chen, Jinsen Wu, Haiwei Chen
― 5 min read
Exploring shadow detection, removal, and generation in computer vision.
Xiaowei Hu, Zhenghao Xing, Tianyu Wang
― 7 min read
A new method improves object tracking in videos with just one camera.
Jenny Seidenschwarz, Qunjie Zhou, Bardienus Duisterhof
― 7 min read
A new method enhances image quality during adverse weather using language and vision models.
Jiaqi Xu, Mengyang Wu, Xiaowei Hu
― 5 min read
New methods improve robots' ability to grasp objects using 3D representations.
Mazeyu Ji, Ri-Zhao Qiu, Xueyan Zou
― 8 min read
F2former combines deep learning and fractional transforms for clearer images.
Subhajit Paul, Sahil Kumawat, Ashutosh Gupta
― 6 min read
Exploring quantum computing's role in improving robust fitting techniques for computer vision.
Frances Fengyi Yang, Michele Sasdelli, Tat-Jun Chin
― 6 min read
Innovative methods enhance object detection performance on resource-limited devices.
Francesco Pasti, Marina Ceccon, Davide Dalle Pezze
― 5 min read
New dataset and model improve object detection in indoor environments.
Salah Eddine Laidoudi, Madjid Maidi, Samir Otmane
― 7 min read
New methods enhance image classification with fewer examples.
Soumitri Chattopadhyay, Sanket Biswas, Emanuele Vivoli
― 6 min read
A look into methods recognizing object affordances for machines.
Tommaso Apicella, Alessio Xompero, Paolo Gastaldo
― 7 min read
New technique improves detection of camouflaged objects in various fields.
Yanguang Sun, Chunyan Xu, Jian Yang
― 5 min read
New models enhance performance in data classification tasks using concepts from sleep.
Mingze Ni, Wei Liu
― 7 min read
A new method improves object movement analysis in challenging environments.
Tanner D. Harms, Steven L. Brunton, Beverley J. McKeon
― 7 min read
A new method improves sensor data combining accuracy and efficiency.
Hersh Vakharia, Xiaoxiao Du
― 6 min read
iSeg improves image segmentation accuracy with less training data.
Lin Sun, Jiale Cao, Jin Xie
― 4 min read
MM-DPCNs improve video analysis efficiency by learning features without labels.
Wenqian Xue, Chi Ding, Jose Principe
― 4 min read
A new method enhances 3D image quality using dense metric depth.
Arkadeep Narayan Chaudhury, Igor Vasiljevic, Sergey Zakharov
― 6 min read
A method to improve action recognition with fewer labeled videos and more unlabeled data.
Owais Iqbal, Omprakash Chakraborty, Aftab Hussain
― 6 min read
This article examines how combining real and synthetic images boosts face recognition accuracy and fairness.
Andrea Atzori, Pietro Cosseddu, Gianni Fenu
― 5 min read
New method generates realistic 3D human models from single images using advanced video techniques.
Zhibin Liu, Haoye Dong, Aviral Chharia
― 5 min read
Introducing new metrics for assessing handwritten text generation systems.
Konstantina Nikolaidou, George Retsinas, Giorgos Sfikas
― 6 min read
This approach combines autoencoders and diffusion techniques for clearer images.
Vighnesh Birodkar, Gabriel Barcik, James Lyon
― 6 min read
Plane2Depth improves depth estimation in complex scenes, addressing challenges of low texture.
Li Liu, Ruijie Zhu, Jiacheng Deng
― 6 min read
This research enhances depth estimation in robots using meta-learning for better performance in varied environments.
Cho-Ying Wu, Yiqi Zhong, Junying Wang
― 5 min read
A new method and dataset for automated cell analysis in brain research.
Valentina Vadori, Jean-Marie Graïc, Antonella Peruffo
― 4 min read
A new approach to create synthetic images efficiently for dataset distillation.
Zhe Li, Weitong Zhang, Sarah Cechnicka
― 8 min read
MVTN improves hand gesture recognition through innovative multiscale techniques.
Mallika Garg, Debashis Ghosh, Pyari Mohan Pradhan
― 5 min read
This study assesses various visual models for understanding complex 3D scenes.
Yunze Man, Shuhong Zheng, Zhipeng Bao
― 8 min read
This study evaluates machine learning models for detecting trash in rivers.
Marga Don, Stijn Pinson, Blanca Guillen Cebrian
― 5 min read
A new method improves surface reconstruction from sparse images, ensuring detail and efficiency.
Rui Peng, Shihe Shen, Kaiqiang Xiong
― 6 min read
Exploring the benefits of Organized Grouped Discrete Representation in image processing.
Rongzhen Zhao, Vivienne Wang, Juho Kannala
― 7 min read