PointRend: Image Segmentation As Rendering
Authors: Alexander Kirillov, Yuxin Wu, Kaiming He, Ross Girshick Description: We present a new method for efficient high-quality image segmentation of objects and scenes. By analogizing classical computer graphics methods for efficient rendering with over- and undersampling challenges faced in pixel labeling tasks, we develop a unique perspective of image segmentation as a rendering problem. From this vantage, we present the PointRend (Point-based Rendering) neural network module: a module that performs point-based segmentation predictions at adaptively selected locations based on an iterative subdivision algorithm. PointRend can be flexibly applied to both instance and semantic segmentation tasks by building on top of existing state-of-the-art models. While many concrete implementations of the general idea are possible, we show that a simple design already achieves excellent results. Qualitatively, PointRend outputs crisp object boundaries in regions that are over-smoothed by previous methods. Quantitatively, PointRend yields significant gains on COCO and Cityscapes, for both instance and semantic segmentation. PointRend's efficiency enables output resolutions that are otherwise impractical in terms of memory or computation compared to existing approaches. Code has been made available at https://github.com/facebookresearch/detectron2/tree/master/projects/PointRend.
Видео PointRend: Image Segmentation As Rendering канала ComputerVisionFoundation Videos
Видео PointRend: Image Segmentation As Rendering канала ComputerVisionFoundation Videos
Показать
Комментарии отсутствуют
Информация о видео
17 июля 2020 г. 14:17:39
00:04:59
Другие видео канала
![Unsupervised Representation Learning for Gaze Estimation](https://i.ytimg.com/vi/qQNU5WEt3Es/default.jpg)
![Disp R-CNN: Stereo 3D Object Detection via Shape Prior Guided Instance Disparity Estimation](https://i.ytimg.com/vi/KnslISMQBlQ/default.jpg)
![Learning to Dress 3D People in Generative Clothing](https://i.ytimg.com/vi/NOEA-Rtq6vM/default.jpg)
![Learning Physics-Guided Face Relighting Under Directional Light](https://i.ytimg.com/vi/cYwsaUQFMU8/default.jpg)
![WACV20: Keynote Talk: Maja Pantic, Imperial College London and SAIC](https://i.ytimg.com/vi/m3rFFlRx4LE/default.jpg)
![Orthogonal Convolutional Neural Networks](https://i.ytimg.com/vi/xq4udlgu6Z4/default.jpg)
![232 - Improving Video Captioning with Temporal Composition of a Visual-Syntactic Embedding](https://i.ytimg.com/vi/dW9FQnwrg_0/default.jpg)
![1276 - ClassMix: Segmentation-Based Data Augmentation for Semi-Supervised Learning](https://i.ytimg.com/vi/DmRKRHxsQfo/default.jpg)
![Match or No Match: Keypoint Filtering Based on Matching Probability](https://i.ytimg.com/vi/4jV3S04ejFc/default.jpg)
![HandVoxNet: Deep Voxel-Based Network for 3D Hand Shape and Pose Estimation From a Single Depth Map](https://i.ytimg.com/vi/MFhNBTUkxvY/default.jpg)
![DSGN: Deep Stereo Geometry Network for 3D Object Detection](https://i.ytimg.com/vi/HYoPzVEWu9A/default.jpg)
![DeepLPF: Deep Local Parametric Filters for Image Enhancement](https://i.ytimg.com/vi/Sxach3FM6FY/default.jpg)
![324 - Weakly Supervised Deep Reinforcement Learning for Video Summarization With Semantically Meani](https://i.ytimg.com/vi/gaq868XeWn8/default.jpg)
![Inverse Rendering for Complex Indoor Scenes: Shape, Spatially-Varying Lighting and SVBRDF From a...](https://i.ytimg.com/vi/RvWlDWtTozw/default.jpg)
![368 - DB-GAN: Boosting Object Recognition Under Strong Lighting Conditions](https://i.ytimg.com/vi/4SJLWWRkcKo/default.jpg)
![Neural Pose Transfer by Spatially Adaptive Instance Normalization](https://i.ytimg.com/vi/6EpmGLBzT1s/default.jpg)
![ALFRED: A Benchmark for Interpreting Grounded Instructions for Everyday Tasks](https://i.ytimg.com/vi/_Qamhop2JDU/default.jpg)
![High-Frequency Component Helps Explain the Generalization of Convolutional Neural Networks](https://i.ytimg.com/vi/8H0QQbMFb-k/default.jpg)
![1257 - Multimodal Prototypical Networks for Few-shot Learning](https://i.ytimg.com/vi/nq2yYbGIRwc/default.jpg)
![1369 - CenterFusion:Center-based Radar and Camera Fusionfor 3D Object Detection](https://i.ytimg.com/vi/tr5jyfO55U8/default.jpg)
![515 - Cinematic-L1 Video Stabilization with a Log-Homography Model](https://i.ytimg.com/vi/IPchmdyc6wg/default.jpg)