ATT3D: Amortized Text-to-3D Image Generation | NVIDIA Research
NVIDIA Research Paper
Text-to-3D modeling has seen exciting progress by combining generative text-to-image models with image-to-3D methods like Neural Radiance Fields. DreamFusion recently achieved high-quality results but requires a lengthy, per-prompt optimization to create 3D objects. To address this, we amortize optimization over text prompts by training on many prompts simultaneously with a unified model, instead of separately.
With this, we share computation across a prompt set, training in less time than per-prompt optimization. Our framework -- Amortized text-to-3D (ATT3D) -- enables sharing of knowledge between prompts to generalize to unseen setups and smooth interpolations between text for novel assets and simple animations
For more - visit our project website: https://research.nvidia.com/labs/toronto-ai/ATT3D/
Видео ATT3D: Amortized Text-to-3D Image Generation | NVIDIA Research канала NVIDIA Developer
Text-to-3D modeling has seen exciting progress by combining generative text-to-image models with image-to-3D methods like Neural Radiance Fields. DreamFusion recently achieved high-quality results but requires a lengthy, per-prompt optimization to create 3D objects. To address this, we amortize optimization over text prompts by training on many prompts simultaneously with a unified model, instead of separately.
With this, we share computation across a prompt set, training in less time than per-prompt optimization. Our framework -- Amortized text-to-3D (ATT3D) -- enables sharing of knowledge between prompts to generalize to unseen setups and smooth interpolations between text for novel assets and simple animations
For more - visit our project website: https://research.nvidia.com/labs/toronto-ai/ATT3D/
Видео ATT3D: Amortized Text-to-3D Image Generation | NVIDIA Research канала NVIDIA Developer
Показать
Комментарии отсутствуют
Информация о видео
Другие видео канала
![TIME Best Invention of 2023: NVIDIA Neuralangelo](https://i.ytimg.com/vi/AULi-qPW9PM/default.jpg)
![Learning Physically Simulated Tennis Skills from Broadcast Videos | NVIDIA Research at #SIGGRAPH2023](https://i.ytimg.com/vi/1wF7RkPi8J8/default.jpg)
![NVIDIA #GenAI Theater at #SIGGRAPH2023](https://i.ytimg.com/vi/JhHV9PkTOzw/default.jpg)
![Microfacet Theory for Non-Uniform Heightfields | NVIDIAResearch](https://i.ytimg.com/vi/rIX4WBZUmWY/default.jpg)
![Data Free Learning of Reduced-Order Kinematics](https://i.ytimg.com/vi/2N4tNiiBDWw/default.jpg)
![Random-Access Neural Compression of Material Textures | NVIDIA Research Paper](https://i.ytimg.com/vi/DDBI0EZgh_g/default.jpg)
![NVIDIA Research: Synthesizing Physical Character-Scene Interactions](https://i.ytimg.com/vi/KyDWrnZQu0A/default.jpg)
![CUDA Toolkit 12.2: New Accelerated Computing and Security Enhancements Revealed](https://i.ytimg.com/vi/tVd4-xL6EBU/default.jpg)
![Universal Scene Description (OpenUSD): Composition and Layering](https://i.ytimg.com/vi/zRyKI55qEZY/default.jpg)
![NVIDIA Omniverse Administration: Tips and Techniques to Manage Your Nucleus Enterprise Server](https://i.ytimg.com/vi/-oKHGeMj7UE/default.jpg)
![NVIDIA Omniverse Administration: How to Deploy the Enterprise Launcher](https://i.ytimg.com/vi/4sHSIV-67DM/default.jpg)
![NVIDIA Omniverse Administration: Getting Started](https://i.ytimg.com/vi/wAInhset83M/default.jpg)
![VSLAM for Robotic Applications with NVIDIA Jetson Orin Nano](https://i.ytimg.com/vi/mNTi1osPSIA/default.jpg)
![Universal Scene Description (OpenUSD): 4 Superpowers to Get You Started](https://i.ytimg.com/vi/1RnTSZK9SwM/default.jpg)
![Zero-Shot Pose Transfer for Unrigged Stylized 3D Characters | CVPR 2023](https://i.ytimg.com/vi/ynT65hp92SE/default.jpg)
![Recurrence without Recurrence: Stable Video Landmark Detection with Deep Equilibrium Models CVPR2023](https://i.ytimg.com/vi/6n95iEFZS_k/default.jpg)
![Digital Renaissance: Neuralangelo Reconstructs 3D Scenes from 2D Video Clips | from NVIDIA Research](https://i.ytimg.com/vi/R8DXpBw8hMg/default.jpg)
![Global Vision Transformer Pruning with Hessian-Aware Saliency | CVPR 2023](https://i.ytimg.com/vi/0D8O7yBw4h4/default.jpg)
![Adversarial Augmentation against Adversarial Attacks | CVPR 2023](https://i.ytimg.com/vi/pUasda84YPE/default.jpg)
![Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models | CVPR 2023](https://i.ytimg.com/vi/eW2vF8o_7p0/default.jpg)