This study evaluates machine learning models for detecting trash in rivers.
Marga Don, Stijn Pinson, Blanca Guillen Cebrian
― 5 min read
Cutting edge science explained simply
This study evaluates machine learning models for detecting trash in rivers.
Marga Don, Stijn Pinson, Blanca Guillen Cebrian
― 5 min read
A new method improves surface reconstruction from sparse images, ensuring detail and efficiency.
Rui Peng, Shihe Shen, Kaiqiang Xiong
― 6 min read
Exploring the benefits of Organized Grouped Discrete Representation in image processing.
Rongzhen Zhao, Vivienne Wang, Juho Kannala
― 7 min read
A new method enhances segmentation accuracy using SAM and CLIP models.
Xi Chen, Haosen Yang, Sheng Jin
― 5 min read
New model LowFormer improves speed and accuracy for visual tasks.
Moritz Nottebaum, Matteo Dunnhofer, Christian Micheloni
― 6 min read
New method LM-Gaussian generates detailed 3D models using limited input images.
Hanyang Yu, Xiaoxiao Long, Ping Tan
― 6 min read
A new method improves clarity in dark images using innovative neural networks.
Aoxiang Ning, Minglong Xue, Jinhong He
― 5 min read
A new method allows for easier conversion of ANNs to SNNs with less energy use.
Tong Bu, Maohua Li, Zhaofei Yu
― 7 min read
New dataset enhances tracking of multiple objects in challenging video conditions.
Friedhelm Hamann, Hanxiong Li, Paul Mieske
― 5 min read
VILA-U integrates video, image, and language tasks into a single framework.
Yecheng Wu, Zhuoyang Zhang, Junyu Chen
― 5 min read
A new approach to enhance action detection in videos using a novel TAG layer.
Aglind Reka, Diana Laura Borza, Dominick Reilly
― 5 min read
A new method enhances accuracy in locating objects from images.
Ting-Ru Liu, Hsuan-Kung Yang, Jou-Min Liu
― 4 min read
A new framework enhancing the understanding of images and text together.
Yi Zhu, Yanpeng Zhou, Chunwei Wang
― 9 min read
Using IRT for deeper evaluation of computer vision model performance.
Rahul Ramachandran, Tejal Kulkarni, Charchit Sharma
― 5 min read
HOGraspNet offers valuable data for studying hand-object interactions in robotics and computer vision.
Woojin Cho, Jihyun Lee, Minjae Yi
― 6 min read
This work enhances vision-language models through improved data strategies and innovative techniques.
Yuan Liu, Zhongyin Zhao, Ziyuan Zhuang
― 7 min read
A method improving CNN focus on key image areas for better decision-making.
Lars Nieradzik, Henrike Stephani, Janis Keuper
― 4 min read
A model distinguishing real images from computer-generated ones.
Preetu Mehta, Aman Sagar, Suchi Kumari
― 5 min read
A new method improves video classification by optimizing frame selection.
Junho Lee, Jeongwoo Shin, Seung Woo Ko
― 8 min read
A structured method for accurately labeling images and data using the sigma flow model.
Jonas Cassel, Bastian Boll, Stefania Petra
― 5 min read
Introducing PIP, a tool to detect adversarial attacks in LVLMs.
Yudong Zhang, Ruobing Xie, Jiansheng Chen
― 5 min read
A new method improves object identification in images through tailored visual and text integration.
Ruilin Yao, Shengwu Xiong, Yichen Zhao
― 5 min read
A new method improves road detection using diverse data sources.
Tao Ni, Xin Zhan, Tao Luo
― 6 min read
New methods improve depth estimation using single images through enhanced data augmentation.
Nischal Khanal, Shivanand Venkanna Sheshappanavar
― 6 min read
New method improves point cloud quality with weighted loss functions.
Fangzhou Lin, Haotian Liu, Haoying Zhou
― 6 min read
RPP improves fitting and generalization in Vision-Language Models using refined prompts.
Zhenyuan Chen, Lingfeng Yang, Shuo Chen
― 7 min read
This method improves training datasets for better image segmentation performance.
Quang-Huy Che, Duc-Tri Le, Vinh-Tiep Nguyen
― 6 min read
Study compares human and AI abilities in recognizing 3D shapes from different views.
Tyler Bonnen, Stephanie Fu, Yutong Bai
― 6 min read
Examining how computer vision models can align with human visual understanding.
Mohammad-Javad Darvishi-Bayazi, Md Rifat Arefin, Jocelyn Faubert
― 5 min read
New method improves continual learning in object detection.
Riccardo De Monte, Davide Dalle Pezze, Marina Ceccon
― 7 min read
A new dataset aims to enhance face morph detection methods.
Haoyu Zhang, Raghavendra Ramachandra, Kiran Raja
― 6 min read
A look into improvements and challenges in machine navigation using vision and language.
Xuesong Zhang, Jia Li, Yunbo Xu
― 4 min read
A new framework enhances object detection by identifying out-of-distribution instances using prototypes.
Junkun Chen, Jilin Mei, Liang Chen
― 6 min read
KRONC offers a fast method for estimating camera positions using key points on vehicles.
Davide Di Nucci, Alessandro Simoni, Matteo Tomei
― 5 min read
Competition showcases efforts for safer driving models in adverse conditions.
Furqan Ahmed Shaik, Sandeep Nagar, Aiswarya Maturi
― 5 min read
EMBA enhances panoramic imaging using event camera technology.
Shuang Guo, Guillermo Gallego
― 4 min read
DetailCLIP improves image understanding by focusing on details and context.
Amin Karimi Monsefi, Kishore Prakash Sailaja, Ali Alilooee
― 6 min read
gsplat simplifies Gaussian Splatting for efficient 3D image creation.
Vickie Ye, Ruilong Li, Justin Kerr
― 6 min read
A new method enables machines to accurately model moving and changing shapes.
Archana Swaminathan, Anubhav Gupta, Kamal Gupta
― 7 min read
This article discusses methods for comparing images using nonlinear elasticity models.
John M. Ball, Christopher L. Horner
― 5 min read