A benchmark for generalized few-shot segmentation in remote sensing is introduced.
Clifford Broni-Bediako, Junshi Xia, Jian Song
― 5 min read
Cutting edge science explained simply
A benchmark for generalized few-shot segmentation in remote sensing is introduced.
Clifford Broni-Bediako, Junshi Xia, Jian Song
― 5 min read
A new method combines video, audio, and algorithms for better anomaly detection.
Yuta Kaneko, Abu Saleh Musa Miah, Najmul Hassan
― 7 min read
A look at Score Forgetting Distillation and its impact on generative AI.
Tianqi Chen, Shujian Zhang, Mingyuan Zhou
― 5 min read
SplatFields improves 3D imaging from limited camera views, boosting detail and quality.
Marko Mihajlovic, Sergey Prokudin, Siyu Tang
― 7 min read
Using synthetic data to enhance mobility tools for blind and low-vision individuals.
Hochul Hwang, Krisha Adhikari, Satya Shodhaka
― 6 min read
This article reviews the reliability of MIL models in clinical applications.
Hassan Keshvarikhojasteh
― 5 min read
A new method enhances pose estimation using RGB images informed by depth data.
Alessandro Simoni, Francesco Marchetti, Guido Borghi
― 6 min read
OneEncoder efficiently connects images, text, audio, and video for better information processing.
Bilal Faye, Hanane Azzag, Mustapha Lebbah
― 7 min read
New methods improve accuracy and efficiency in recognizing similar objects.
Edwin Arkel Rios, Femiloye Oyerinde, Min-Chun Hu
― 5 min read
Learn how to evaluate and compare images effectively.
Gautier Dagan, Olga Loginova, Anil Batra
― 4 min read
This model improves AI learning while retaining past knowledge.
Min-Yeong Park, Jae-Ho Lee, Gyeong-Moon Park
― 6 min read
A new system enhances safety predictions for autonomous vehicles in challenging environments.
Manthan Patel, Jonas Frey, Deegan Atha
― 6 min read
KALE uses metadata to generate insightful captions for artworks.
Yanbei Jiang, Krista A. Ehinger, Jey Han Lau
― 6 min read
TrajSSL enhances 3D object detection using fewer labeled data through motion forecasting.
Philip Jacobson, Yichen Xie, Mingyu Ding
― 6 min read
Exploring how LLMs improve reasoning across various data types.
Shengsheng Qian, Zuyi Zhou, Dizhan Xue
― 7 min read
Discover how FlexiTex improves 3D texture generation through visual guidance.
DaDong Jiang, Xianghui Yang, Zibo Zhao
― 6 min read
New model improves skin lesion classification accuracy using multiple data types.
Yuan Zhang, Yutong Xie, Hu Wang
― 5 min read
A new framework accurately estimates depth from single defocused images.
Jinchang Zhang, Ningning Xu, Hao Zhang
― 6 min read
Study reveals performance gaps in RIdV systems across different demographics.
Kaniz Fatima, Michael Schuckers, Gerardo Cruz-Ortiz
― 5 min read
Transformers improve classification accuracy for Autism Spectrum Disorder through advanced brain imaging analysis.
Yinchi Zhou, Peiyu Duan, Yuexi Du
― 7 min read
GCA-SUN enhances object counting in images without labeled examples.
Yuzhe Wu, Yipeng Xu, Tianyu Xu
― 5 min read
A new method reduces data needs for training robots with visual demonstrations.
Zichen Jeff Cui, Hengkai Pan, Aadhithya Iyer
― 5 min read
A new framework integrates bundle adjustment with PyTorch for improved 3D modeling.
Zitong Zhan, Huan Xu, Zihang Fang
― 6 min read
New techniques improve predictions of solar energy availability using sky images.
Leron Julian, Aswin C. Sankaranarayanan
― 6 min read
A new method blends audio and facial expressions for realistic video generation.
Sai Tanmay Reddy Chakkera, Aggelina Chatziagapi, Dimitris Samaras
― 6 min read
MoRAG enhances human motion generation from text descriptions using part-specific retrieval.
Kalakonda Sai Shashank, Shubh Maheshwari, Ravi Kiran Sarvadevabhatla
― 5 min read
Improving model efficiency in remote sensing through knowledge distillation techniques.
Yassine Himeur, Nour Aburaed, Omar Elharrouss
― 6 min read
New methods improve the separation of sea surface height measurements for better ocean dynamics analysis.
Jingwen Lyu, Yue Wang, Christian Pedersen
― 6 min read
WaveMixSR-V2 transforms low-resolution images into high-quality outputs efficiently.
Pranav Jeevan, Neeraj Nixon, Amit Sethi
― 5 min read
Introducing PAD-FT, a lightweight method to fight backdoor attacks without clean data.
Yukai Xu, Yujie Gu, Kouichi Sakurai
― 6 min read
This paper compares Vision Transformers and CNNs for classifying side-scan sonar images.
BW Sheffield, Jeffrey Ellen, Ben Whitmore
― 6 min read
LEMON allows efficient editing of 3D meshes through user input and advanced techniques.
Furkan Mert Algan, Umut Yazgan, Driton Salihu
― 5 min read
A new method enhances 3D modeling of natural surfaces using limited satellite images.
Lulin Zhang, Ewelina Rupnik, Tri Dung Nguyen
― 7 min read
ChefFusion combines multiple food-related tasks through advanced technology.
Peiyu Li, Xiaobao Huang, Yijun Tian
― 5 min read
A new method improves how robots predict future scenes and object interactions.
Juana Valeria Hurtado, Riya Mohan, Abhinav Valada
― 6 min read
A new dual-path approach enhances object recognition for robots in challenging environments.
Aneesh Chavan, Vaibhav Agrawal, Vineeth Bhat
― 5 min read
A new method improves image registration during neurosurgery.
Maximilian Fehrentz, Mohammad Farid Azampour, Reuben Dorent
― 5 min read
A new method improves 3D head models for realism and performance.
Kartik Teotia, Hyeongwoo Kim, Pablo Garrido
― 7 min read
StableMamba enhances image and video processing with improved robustness and performance.
Hamid Suleman, Syed Talal Wasim, Muzammal Naseer
― 5 min read
A new method improves camera location estimation in challenging lighting and surface conditions.
Lei Cheng, Junpeng Hu, Haodong Yan
― 4 min read