ECCV2020論文サマリ
ECCV2020論文サマリ
Seeing the Un-Scene: Learning Amodal Semantic Maps for Room Navigation
by: Shintaro Yamamoto
Vision and language
Connecting Vision and Language with Localized Narratives
by: Keisuke Kamahori
Multi modal
Vision and language
Table Structure Recognition using Top-Down and Bottom-Up Cues
by: Shintaro Yamamoto
Document recognition
SODA: Story Oriented Dense Video Captioning Evaluation Framework
by: Keisuke Kamahori
Video
Vision and language
Learning Joint Spatial-Temporal Transformations for Video Inpainting
by: Yukitaka Tsuchiya
GAN
Video
Inpainting
Generating Handwriting via Decoupled Style Descriptors
by: Keisuke Kamahori
Dataset
Vision and language
Comprehensive Image Captioning via Scene Graph Decomposition
by: Keisuke Kamahori
Vision and language
Adaptive Text Recognition through Visual Matching
by: Keisuke Kamahori
N-shot learning
Vision and language
Learning to Scale Multilingual Representations for Vision-Language Tasks
by: Shintaro Yamamoto
Vision and language
Sound2Sight: Generating Visual Dynamics from Sound and Context
by: Yukitaka Tsuchiya
GAN
Multi modal
Video
Sound
Transformer
TVR: A Large-Scale Dataset for Video-Subtitle Moment Retrieval
by: Keisuke Kamahori
Dataset
Video
Vision and language
Image-based table recognition: data, model, and evaluation
by: Keisuke Kamahori
Dataset
Vision and language
Single-Shot Neural Relighting and SVBRDF Estimation
by: Teppei Kurita
Relighting
SVBRDF
Inverse Rendering