New multi-mask technique improves machine understanding of 3D data.
Jiaming Liu, Linghe Kong, Yue Wu
― 5 min read
Cutting edge science explained simply
New multi-mask technique improves machine understanding of 3D data.
Jiaming Liu, Linghe Kong, Yue Wu
― 5 min read
CAMOT improves multi-object tracking by estimating camera angles and depths.
Felix Limanta, Kuniaki Uto, Koichi Shinoda
― 6 min read
SimVG improves visual grounding by linking text to specific image areas more effectively.
Ming Dai, Lingfeng Yang, Yihao Xu
― 6 min read
EAGLE model and dataset enhance understanding of egocentric videos.
Jing Bi, Yunlong Tang, Luchuan Song
― 5 min read
XNet leverages the Cauchy Activation Function for improved accuracy in complex data tasks.
Xin Li, Zhihong Xia, Hongkun Zhang
― 7 min read
New methods improve the detection of distant exoplanets using advanced algorithms.
Théo Bodrito, Olivier Flasseur, Julien Mairal
― 5 min read
New methods improve analysis and visualization of scientific data through better flow estimation.
Hamid Gadirov, Jos B. T. M. Roerdink, Steffen Frey
― 6 min read
This article discusses safety issues in text-to-image models and proposes solutions.
Tong Liu, Zhixin Lai, Gengyuan Zhang
― 6 min read
New methods aid robots in safely navigating hiking trails amidst obstacles.
Camndon Reed, Christopher Tatsch, Jason N. Gross
― 5 min read
New method improves crowd counting accuracy and model reliability.
Qiming Wu
― 5 min read
A new method enhances image creation of specific individuals and emotions.
Salaheldin Mohamed, Dong Han, Yong Li
― 4 min read
Examining how SSL models memorize data points and its implications.
Wenhao Wang, Adam Dziedzic, Michael Backes
― 7 min read
New methods improve efficiency and accuracy in SSM-based vision models.
Zheng Zhan, Zhenglun Kong, Yifan Gong
― 5 min read
A new method improves object segmentation in images without manual labels.
Dylan Li, Gyungin Shin
― 6 min read
A dataset designed to improve collaboration between humans and robots in assembly tasks.
Samuel Adebayo, Seán McLoone, Joost C. Dessing
― 8 min read
SurfaceAI uses street images to assess road surface quality for safer travel.
Alexandra Kapp, Edith Hoffmann, Esther Weigmann
― 4 min read
A new approach to create interactive 3D models from static meshes.
Denys Iliash, Hanxiao Jiang, Yiming Zhang
― 6 min read
A novel approach uses real-time MRI to visualize speech production movements.
Hong Nguyen, Sean Foley, Kevin Huang
― 5 min read
Recent models enhance AI's ability to generate and understand various media.
Xinlong Wang, Xiaosong Zhang, Zhengxiong Luo
― 5 min read
A new method improves 3D shape accuracy in dynamic scenes.
Shuo Wang, Binbin Huang, Ruoyu Wang
― 5 min read
A novel AI approach improves adenocarcinoma diagnosis across different imaging conditions.
Abdul Qayyum, Moona Mazher Imran Razzak, Steven A Niederer
― 5 min read
New model improves predictions from brain activity data.
Zijian Dong, Ruilin Li, Yilei Wu
― 6 min read
A study on using images for trajectory classification and prediction.
Mariaclaudia Nicolai, Raffaella Fiamma Cabini, Diego Ulisse Pizzagalli
― 5 min read
A new method improves knowledge transfer in machine learning models.
Chaomin Shen, Yaomin Huang, Haokun Zhu
― 5 min read
This study uses Visual Question Answering for assessing charts created by AI models.
James Ford, Xingmeng Zhao, Dan Schumacher
― 7 min read
Using ontology can boost MLLMs' ability to identify plant diseases accurately.
Jihen Amara, Birgitta König-Ries, Sheeba Samuel
― 6 min read
Introducing a method for AI to generate images without large labeled datasets.
Zhiqiang Chen, Guofan Fan, Jinying Gao
― 7 min read
GeCo improves object counting with fewer examples, enhancing accuracy and reliability.
Jer Pelhan, Alan Lukežič, Vitjan Zavrtanik
― 5 min read
A new method enhances image privacy classification with clear, user-friendly explanations.
Alina Elena Baia, Andrea Cavallaro
― 7 min read
New method enhances CT images for better cancer treatment planning.
Belén Serrano-Antón, Mubashara Rehman, Niki Martinel
― 6 min read
Enhancements in LiDAR perception improve performance in multi-sensor environments.
Marc Uecker, J. Marius Zöllner
― 6 min read
A comprehensive dataset aims to improve flood prediction and response efforts globally.
Brandon Victor, Mathilde Letard, Peter Naylor
― 6 min read
A method for clearer satellite images directly from unprocessed data.
Michael Sprintson, Rama Chellappa, Cheng Peng
― 5 min read
CION advances person re-identification by focusing on identity correlations across videos.
Jialong Zuo, Ying Nie, Hanyu Zhou
― 6 min read
A framework merging different knowledge types to improve model performance.
Yaomin Huang, Zaomin Yan, Chaomin Shen
― 5 min read
A new method improves gaze target detection with less labeled data.
Francesco Tonini, Nicola Dall'Asen, Lorenzo Vaquero
― 6 min read
A new approach enhances deep learning model performance amidst noise.
Seyedarmin Azizi, Mohammad Erfan Sadeghi, Mehdi Kamal
― 5 min read
A new framework improves pixel labeling by addressing uncertainty in semantic segmentation.
Xiaoke Hao, Shiyu Liu, Chuanbo Feng
― 6 min read
This study assesses the effectiveness of pre-trained models in Earth Observation applications.
Jose Sosa, Mohamed Aloulou, Danila Rukhovich
― 6 min read
Temporal2Seq framework streamlines multiple video understanding tasks into one model.
Min Yang, Zichen Zhang, Limin Wang
― 8 min read