A new framework enhances the connection between images and text.
Mayug Maniparambil, Raiymbek Akshulakov, Yasser Abdelaziz Dahou Djilali
― 7 min read
Cutting edge science explained simply
A new framework enhances the connection between images and text.
Mayug Maniparambil, Raiymbek Akshulakov, Yasser Abdelaziz Dahou Djilali
― 7 min read
Learn how machine learning models can improve when facing new and unseen data.
Zongbo Han, Jialong Yang, Junfan Li
― 7 min read
A look at the role and methods of diffusion models in image creation.
Zheyuan Zhan, Defang Chen, Jian-Ping Mei
― 7 min read
Exploring methods to improve multimodal models in breaking down visual questions.
Haowei Zhang, Jianzhe Liu, Zhen Han
― 6 min read
A new model generates reports from 3D CT scans efficiently and accurately.
Hao Chen, Wei Zhao, Yingli Li
― 8 min read
A new pipeline for generating 3D models from 2D images efficiently.
Potito Aghilar, Vito Walter Anelli, Michelantonio Trizio
― 5 min read
TrojVLM exposes vulnerabilities in Vision Language Models to backdoor attacks.
Weimin Lyu, Lu Pang, Tengfei Ma
― 7 min read
This study reveals effective methods for recognizing hand gestures through ultrasound imaging.
Keshav Bimbraw, Ankit Talele, Haichong K. Zhang
― 5 min read
A new framework improves data generation across multiple sources using energy-based models.
Shiyu Yuan, Jiali Cui, Hanao Li
― 5 min read
SATA improves the robustness and efficiency of Vision Transformers for image classification tasks.
Nick Nikzad, Yi Liao, Yongsheng Gao
― 4 min read
A new method improves object recognition using masks without detailed labels.
Heeseong Shin, Chaehyun Kim, Sunghwan Hong
― 5 min read
A new method simplifies the removal of unwanted content in visual datasets.
Saehyung Lee, Jisoo Mok, Sangha Park
― 6 min read
Exploring Federated Learning's role in enhancing medical imaging while protecting patient privacy.
Nikolas Koutsoubis, Asim Waqas, Yasin Yilmaz
― 5 min read
A new method helps robots learn tasks using online human videos, reducing training needs.
Homanga Bharadhwaj, Debidatta Dwibedi, Abhinav Gupta
― 6 min read
PPLNs enhance event camera data processing for improved machine vision capabilities.
Chen Song, Zhenxiao Liang, Bo Sun
― 6 min read
A novel approach improves the detection of genuine signatures against forgeries.
Hansong Zhang, Jiangjian Guo, Kun Li
― 5 min read
Analyzing the effects of pruning methods on GoogLeNet's performance and interpretability.
Jonathan von Rad, Florian Seuffert
― 5 min read
Innovative methods for enhancing depth maps vital for augmented and virtual reality.
Marcos V. Conde, Florin-Alexandru Vasluianu, Jinhui Xiong
― 6 min read
FAST improves disease classification using whole slide images with minimal expert input.
Kexue Fu, Xiaoyuan Luo, Linhao Qu
― 5 min read
A method to enhance model performance despite incorrect data labels.
Tong Wei, Hao-Tian Li, Chun-Shu Li
― 7 min read
MedViLaM integrates multiple medical data types for improved analysis and decision-making.
Lijian Xu, Hao Sun, Ziyu Ni
― 5 min read
A new method to speed up diffusion model output without losing quality.
Zhenyu Zhou, Defang Chen, Can Wang
― 7 min read
New model streamlines report generation from brain CT scans.
Chengxin Zheng, Junzhong Ji, Yanzhao Shi
― 5 min read
FlipClass offers a new method for better learning in Generalized Category Discovery.
Haonan Lin, Wenbin An, Jiahao Wang
― 5 min read
A new method enhances identification of oriented objects in remote sensing images.
Jiaqi Zhao, Zeyu Ding, Yong Zhou
― 5 min read
A novel method for adapting time-series data without needing source information.
Yucheng Wang, Peiliang Gong, Min Wu
― 7 min read
CIAI system improves detection of noise in images, enhancing AI model accuracy.
Anubhooti Jain, Susim Roy, Kwanit Gupta
― 5 min read
Combining global and local prompts enhances federated learning models while preserving data privacy.
Bikang Pan, Wei Huang, Ye Shi
― 6 min read
VideoLISA uses language to segment and track objects in videos effectively.
Zechen Bai, Tong He, Haiyang Mei
― 6 min read
A new method improves realism in human image animations for various applications.
Zhongcong Xu, Chaoyue Song, Guoxian Song
― 6 min read
A new method improves MRI imaging by correcting motion during scans.
Constantin Slioussarenko, Pierre-Yves Baudin, Marc Lapert
― 7 min read
A new method enhances person identification across cameras with reduced supervision.
Xuan Tan, Xun Gong, Yang Xiang
― 5 min read
New method creates detailed 3D models from single video inputs.
Jeff Tan, Donglai Xiang, Shubham Tulsiani
― 4 min read
This method helps machines plan actions based on instructional videos effectively.
Md Mohaiminul Islam, Tushar Nagarajan, Huiyu Wang
― 8 min read
Explore the painting process with innovative time-lapse technology.
Bowei Chen, Yifan Wang, Brian Curless
― 6 min read
New algorithms improve efficiency in object detection by optimizing NMS processes.
King-Siong Si, Lu Sun, Weizhan Zhang
― 5 min read
NutriVision helps users manage diets through technology and personalized recommendations.
Madhumita Veeramreddy, Ashok Kumar Pradhan, Swetha Ghanta
― 5 min read
A new method improves human pose estimation by enabling continuous learning of keypoints.
Muhammad Saif Ullah Khan, Muhammad Ahmed Ullah Khan, Muhammad Zeshan Afzal
― 5 min read
POMONAG improves architecture search with a focus on multiple objectives for better efficiency.
Eugenio Lomurno, Samuele Mariani, Matteo Monti
― 7 min read
AUCSeg improves segmentation by addressing class imbalance in image processing.
Boyu Han, Qianqian Xu, Zhiyong Yang
― 7 min read