Recurrence without Recurrence: Stable Video Landmark Detection with Deep Equilibrium Models CVPR2023
CVPR 2023: Recurrence without Recurrence: Stable Video Landmark Detection with Deep Equilibrium Models by Paul Micaelli (Edin, NVIDIA), Arash Vahdat (NVIDIA), Hongxu Yin (NVIDIA), Jan Kautz (NVIDIA), Pavlo Molchanov (NVIDIA).
We apply and extend a deep equilibrium model (DEQ) to the task of keypoints estimation. This network conducts an infinite number of iterations with self-conditioning, where the input is the output of the previous iteration, until a stopping criterion is met. It learns to iteratively enhance initial estimates. In our work, we further extend this network architecture to enable landmark estimation in videos for free (training on single images). We observe two key outcomes: (i) a significant reduction in computation as frames exhibit high similarity, and (ii) an improvement in landmark stability.
Project page: https://github.com/NVlabs/LDEQ_RwR/
Видео Recurrence without Recurrence: Stable Video Landmark Detection with Deep Equilibrium Models CVPR2023 канала NVIDIA Developer
We apply and extend a deep equilibrium model (DEQ) to the task of keypoints estimation. This network conducts an infinite number of iterations with self-conditioning, where the input is the output of the previous iteration, until a stopping criterion is met. It learns to iteratively enhance initial estimates. In our work, we further extend this network architecture to enable landmark estimation in videos for free (training on single images). We observe two key outcomes: (i) a significant reduction in computation as frames exhibit high similarity, and (ii) an improvement in landmark stability.
Project page: https://github.com/NVlabs/LDEQ_RwR/
Видео Recurrence without Recurrence: Stable Video Landmark Detection with Deep Equilibrium Models CVPR2023 канала NVIDIA Developer
Показать
Комментарии отсутствуют
Информация о видео
Другие видео канала
TIME Best Invention of 2023: NVIDIA NeuralangeloLearning Physically Simulated Tennis Skills from Broadcast Videos | NVIDIA Research at #SIGGRAPH2023NVIDIA #GenAI Theater at #SIGGRAPH2023Microfacet Theory for Non-Uniform Heightfields | NVIDIAResearchData Free Learning of Reduced-Order KinematicsRandom-Access Neural Compression of Material Textures | NVIDIA Research PaperNVIDIA Research: Synthesizing Physical Character-Scene InteractionsCUDA Toolkit 12.2: New Accelerated Computing and Security Enhancements RevealedUniversal Scene Description (OpenUSD): Composition and LayeringNVIDIA Omniverse Administration: Tips and Techniques to Manage Your Nucleus Enterprise ServerNVIDIA Omniverse Administration: How to Deploy the Enterprise LauncherNVIDIA Omniverse Administration: Getting StartedVSLAM for Robotic Applications with NVIDIA Jetson Orin NanoUniversal Scene Description (OpenUSD): 4 Superpowers to Get You StartedATT3D: Amortized Text-to-3D Image Generation | NVIDIA ResearchZero-Shot Pose Transfer for Unrigged Stylized 3D Characters | CVPR 2023Digital Renaissance: Neuralangelo Reconstructs 3D Scenes from 2D Video Clips | from NVIDIA ResearchGlobal Vision Transformer Pruning with Hessian-Aware Saliency | CVPR 2023Adversarial Augmentation against Adversarial Attacks | CVPR 2023Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models | CVPR 2023