Video Panoptic Segmentation
Authors: Dahun Kim, Sanghyun Woo, Joon-Young Lee, In So Kweon Description: Panoptic segmentation has become a new standard of visual recognition task by unifying previous semantic segmentation and instance segmentation tasks in concert. In this paper, we propose and explore a new video extension of this task, called video panoptic segmentation. The task requires generating consistent panoptic segmentation as well as an association of instance ids across video frames. To invigorate research on this new task, we present two types of video panoptic datasets. The first is a re-organization of the synthetic VIPER dataset into the video panoptic format to exploit its large-scale pixel annotations. The second is a temporal extension on the Cityscapes val. set, by providing new video panoptic annotations (Cityscapes-VPS). Moreover, we propose a novel video panoptic segmentation network (VPSNet) which jointly predicts object classes, bounding boxes, masks, instance id tracking, and semantic segmentation in video frames. To provide appropriate metrics for this task, we propose a video panoptic quality (VPQ) metric and evaluate our method and several other baselines. Experimental results demonstrate the effectiveness of the presented two datasets. We achieve state-of-the-art results in image PQ on Cityscapes and also in VPQ on Cityscapes-VPS and VIPER datasets.
Видео Video Panoptic Segmentation канала ComputerVisionFoundation Videos
Видео Video Panoptic Segmentation канала ComputerVisionFoundation Videos
Показать
Комментарии отсутствуют
Информация о видео
17 июля 2020 г. 14:20:21
00:05:01
Другие видео канала
![Disp R-CNN: Stereo 3D Object Detection via Shape Prior Guided Instance Disparity Estimation](https://i.ytimg.com/vi/KnslISMQBlQ/default.jpg)
![High-Resolution Radar Dataset for Semi-Supervised Learning of Dynamic Objects](https://i.ytimg.com/vi/bixc-lUuZkw/default.jpg)
![Learning to Dress 3D People in Generative Clothing](https://i.ytimg.com/vi/NOEA-Rtq6vM/default.jpg)
![Learning Physics-Guided Face Relighting Under Directional Light](https://i.ytimg.com/vi/cYwsaUQFMU8/default.jpg)
![Orthogonal Convolutional Neural Networks](https://i.ytimg.com/vi/xq4udlgu6Z4/default.jpg)
![232 - Improving Video Captioning with Temporal Composition of a Visual-Syntactic Embedding](https://i.ytimg.com/vi/dW9FQnwrg_0/default.jpg)
![Match or No Match: Keypoint Filtering Based on Matching Probability](https://i.ytimg.com/vi/4jV3S04ejFc/default.jpg)
![DeepLPF: Deep Local Parametric Filters for Image Enhancement](https://i.ytimg.com/vi/Sxach3FM6FY/default.jpg)
![324 - Weakly Supervised Deep Reinforcement Learning for Video Summarization With Semantically Meani](https://i.ytimg.com/vi/gaq868XeWn8/default.jpg)
![Neural Architecture Search for Lightweight Non-Local Networks](https://i.ytimg.com/vi/2IUJqV7D4i0/default.jpg)
![Inverse Rendering for Complex Indoor Scenes: Shape, Spatially-Varying Lighting and SVBRDF From a...](https://i.ytimg.com/vi/RvWlDWtTozw/default.jpg)
![High-Frequency Component Helps Explain the Generalization of Convolutional Neural Networks](https://i.ytimg.com/vi/8H0QQbMFb-k/default.jpg)
![1257 - Multimodal Prototypical Networks for Few-shot Learning](https://i.ytimg.com/vi/nq2yYbGIRwc/default.jpg)
![1369 - CenterFusion:Center-based Radar and Camera Fusionfor 3D Object Detection](https://i.ytimg.com/vi/tr5jyfO55U8/default.jpg)
![515 - Cinematic-L1 Video Stabilization with a Log-Homography Model](https://i.ytimg.com/vi/IPchmdyc6wg/default.jpg)
![71 - DeepCSR: A 3D Deep Learning Approach For Cortical Surface Reconstruction](https://i.ytimg.com/vi/06dCg-PkL2w/default.jpg)
![Through the Looking Glass: Neural 3D Reconstruction of Transparent Shapes](https://i.ytimg.com/vi/zVu1v4rasAE/default.jpg)
![Rethinking Zero-Shot Video Classification: End-to-End Training for Realistic Applications](https://i.ytimg.com/vi/F5AB06sCJ90/default.jpg)
![12-in-1: Multi-Task Vision and Language Representation Learning](https://i.ytimg.com/vi/dPPpA5vBQc0/default.jpg)
![End-to-End Camera Calibration for Broadcast Videos](https://i.ytimg.com/vi/6GFegy63l-g/default.jpg)
![653 - Misclassification Risk and Uncertainty Quantification in Deep Classifiers](https://i.ytimg.com/vi/NgpXuwId2Pk/default.jpg)