FriendsQA dataset improves video understanding by answering complex questions from Friends episodes.
Zhengqian Wu, Ruizhe Li, Zijun Xu
― 6 min read
Cutting edge science explained simply
FriendsQA dataset improves video understanding by answering complex questions from Friends episodes.
Zhengqian Wu, Ruizhe Li, Zijun Xu
― 6 min read
Experience clothing virtually without fitting rooms or hassles.
Jeongho Kim, Hoiyeong Jin, Sunghyun Park
― 6 min read
Learn how new methods enhance the accuracy of self-driving car localization.
Vishnu Teja Kunde, Jean-Francois Chamberland, Siddharth Agarwal
― 8 min read
A new method called SHIP improves AI’s image tasks efficiently.
Haowei Zhu, Fangyuan Zhang, Rui Qin
― 6 min read
New technology converts spoken words into sign language for better communication.
Xu Wang, Shengeng Tang, Peipei Song
― 5 min read
A new tool boosts image sampling speed and accuracy in machine learning.
Prajwal Singh, Gautam Vashishtha, Indra Deep Mastan
― 6 min read
A breakthrough method improving image generation in generative modeling.
Quan Dao, Hao Phung, Trung Dao
― 7 min read
Prototypical Outlier Proxy enhances AI models' ability to detect unseen data.
Mingrong Gong, Chaoqi Chen, Qingqiang Sun
― 5 min read
Learn how to enhance image classifiers' reliability against distortions.
Dang Nguyen, Sunil Gupta, Kien Do
― 7 min read
Learn how PPN is changing autonomous car racing through real-time scene understanding.
Suwesh Prasad Sah
― 8 min read
A new benchmark enhances evaluation of text-to-image generation models.
Shuhao Han, Haotian Fan, Jiachen Fu
― 5 min read
Create unique faces from text with Dense-Face technology.
Xiao Guo, Manh Tran, Jiaxin Cheng
― 7 min read
UniPLV combines data types for smarter machine scene recognition.
Yuru Wang, Songtao Wang, Zehan Zhang
― 6 min read
A framework that simplifies visual task solutions for everyone.
Wan-Cyuan Fan, Tanzila Rahman, Leonid Sigal
― 7 min read
Innovative technology enhances understanding of retinal images for better healthcare decisions.
Teja Krishna Cherukuri, Nagur Shareef Shaik, Jyostna Devi Bodapati
― 6 min read
Combining real and synthetic data to improve pedestrian movement predictions.
Mirko Zaffaroni, Federico Signoretta, Marco Grangetto
― 7 min read
A new method that speeds up deep learning training without major changes.
Evgeny Hershkovitch Neiterman, Gil Ben-Artzi
― 6 min read
Advancements in prosthetics allow amputees to control limbs more naturally using muscle signals.
Joseph L. Betthauser, Rebecca Greene, Ananya Dhawan
― 6 min read
Learn how to blend images with artistic styles for stunning results.
Victor Kitov, Valentin Abramov, Mikhail Akhtyrchenko
― 7 min read
Teams innovate in character recognition through the DAGECC competition.
Sofia Marino, Jennifer Vandoni, Emanuel Aldea
― 7 min read
New technology improves sickle cell disease classification and diagnosis.
Victor Júnio Alcântara Cardoso, Rodrigo Moreira, João Fernando Mari
― 5 min read
ArchComplete simplifies 3D modeling, making design faster and easier for architects.
S. Rasoulzadeh, M. Bank, M. Wimmer
― 6 min read
Revolutionizing point cloud completion using Hyperbolic Chamfer Distance.
Fangzhou Lin, Songlin Hou, Haotian Liu
― 8 min read
A breakthrough in 3D imaging through self-supervised learning and OpenMind's massive dataset.
Tassilo Wald, Constantin Ulrich, Jonathan Suprijadi
― 6 min read
Discover how a single photo can create a detailed 3D face model.
Weijie Lyu, Yi Zhou, Ming-Hsuan Yang
― 7 min read
A new system tracks objects using multiple views and descriptions.
Sijia Chen, En Yu, Wenbing Tao
― 7 min read
Explore how 3D reconstruction captures human interactions in digital spaces.
Lea Müller, Hongsuk Choi, Anthony Zhang
― 6 min read
New tech combines sound and visuals for better drone detection.
Zhenyuan Xiao, Yizhuo Yang, Guili Xu
― 6 min read
Combining data types for better AI understanding and performance.
Priyaranjan Pattnayak, Hitesh Laxmichand Patel, Bhargava Kumar
― 7 min read
Exploring new technology that detects sounds from invisible sources.
Yuhang He, Sangyun Shin, Anoop Cherian
― 5 min read
HVQ enables accurate action segmentation in long videos without labeled data.
Federico Spurio, Emad Bahrami, Gianpiero Francesca
― 6 min read
A breakthrough method links language with 3D scene recognition for smarter machines.
Hao Li, Roy Qin, Zhengyu Zou
― 6 min read
A two-stage approach tackles shadow removal in images, enhancing object recognition.
Jiamin Xu, Yuxin Zheng, Zelong Li
― 6 min read
Discover the advancements in radiance field editing and its applications in various fields.
Arthur Hubert, Gamal Elghazaly, Raphael Frank
― 7 min read
Diffusion models create lifelike images, boosting medical training and protecting patient privacy.
Abdullah al Nomaan Nafi, Md. Alamgir Hossain, Rakib Hossain Rifat
― 7 min read
CoSurfGS offers a new approach to 3D reconstruction using teamwork across devices.
Yuanyuan Gao, Yalun Dai, Hao Li
― 7 min read
A new method improves realism in 3D indoor scenes.
Zixi Liang, Guowei Xu, Haifeng Wu
― 6 min read
Face anti-spoofing technology needs clearer explanations and user trust.
Haoyuan Zhang, Xiangyu Zhu, Li Gao
― 5 min read
Discover how NVS technologies are reshaping cinematography.
Adrian Azzarelli, Nantheera Anantrasirichai, David R Bull
― 8 min read
New radar tech watches movements while respecting privacy, aiding older adults.
Dylan jayabahu, Parthipan Siva
― 6 min read