CVPR2020論文サマリ
CVPR2020論文サマリ
Boosting Semantic Human Matting With Coarse Annotations
by: Masaki Taniguchi
human matting
coarse annotated data
Guided Variational Autoencoder for Disentanglement Learning
by: hiroki tsujimoto
disentanglement
VAE
adversarial
Fashion Editing With Adversarial Parsing Learning
by: Seitaro Shinagawa
fashion
editing
GAN
in-paining
Neural Network Pruning With Residual-Connections and Limited-Data
by: Tomoro Tokusumi
pruning
枝刈り
小規模データセット
Ego-Topo: Environment Affordances From Egocentric Video
by: Katsuyuki Nakamura
Egocentric vision
Graph convolution
BDD100K: A Diverse Driving Dataset for Heterogeneous Multitask Learning
by: Hiroki Yamamoto
Dataset
Driving
Camouflaged Object Detection
by: Teppei Kurita
Object Detection
Camouflaged Object Detection
Dataset
PQ-NET: A Generative Part Seq2Seq Network for 3D Shapes
by: Hiroaki Aizawa
3D Shape Generation
Seq2Seq
Footprints and Free Space From a Single Color Image
by: Shoji Sonoyama
segmentation
depth estimation
Learning Event-Based Motion Deblurring
by: Teppei Kurita
Dynamic Vision Sensor
Event Based Camera
Deblur
A Self-supervised Approach for Adversarial Robustness
by: 福原吉博 (Yoshihiro Fukuhara)
adversarial examples
robustness
self-supervised learning
Webly Supervised Knowledge Embedding Model for Visual Reasoning
by: Shintaro Yamamoto
Visual Reasoning
Knowledge Base
Towards Achieving Adversarial Robustness by Enforcing Feature Consistency Across Bit Planes
by: 福原吉博 (Yoshihiro Fukuhara)
adversarial examples
robustness
Articulation-Aware Canonical Surface Mapping
by: Hiroaki Aizawa
Canonical Surface Mapping
Articulation
TA-Student VQA: Multi-Agents Training by Self-Questioning
by: Shintaro Yamamoto
VQA
Reinforcement Learning
Efficient Adversarial Training With Transferable Adversarial Examples
by: 福原吉博 (Yoshihiro Fukuhara)
adversarial training
adversarial example
robustness
One Man's Trash Is Another Man's Treasure: Resisting Adversarial Examples by Adversarial Examples
by: 福原吉博 (Yoshihiro Fukuhara)
adversarial examples
robustness
Auxiliary Training: Towards Accurate and Robust Models
by: 福原吉博 (Yoshihiro Fukuhara)
robustness
distillation
trade-off
STAViS: Spatio-Temporal AudioVisual Saliency Network
by: Masuyama Yoshiki
audio-visual
multi modal
saliency map
Few-Shot Class-Incremental Learning
by: Shuhei M Yoshida
few-shot learning
class-incremental learning
Progressive Mirror Detection
by: Teppei Kurita
Mirror
Mirror Detection
Object Detection
Edge Detection
Cascade EF-GAN: Progressive Facial Expression Editing With Local Focuses
by: Ho Ching Chiu
Graph Structured Network for Image-Text Matching
by: Shintaro Yamamoto
Image-Text Matching
Graph Matching
Music Gesture for Visual Sound Separation
by: Masuyama Yoshiki
audio-visual
multi modal
sound source separation
Semantic Image Manipulation Using Scene Graphs
by: Seitaro Shinagawa
image manipulation
image editing
scene graph
image generation
GAN
Adversarial Latent Autoencoders
by: Shunsuke Nakatsuka
GANs
Generative Models
Autoencoder
Representation Learning
Single-View View Synthesis With Multiplane Images
by: Hiroaki Aizawa
View Synthesis
Multiplane Image
Plug-and-Play Algorithms for Large-Scale Snapshot Compressive Imaging
by: Higaki Yoshinari
Compressive Imaging
ADMM
Deep Generative Model for Robust Imbalance Classification
by: Shunsuke Nakatsuka
Imbalanced Data
Classification
Generative Models
Learning Rank-1 Diffractive Optics for Single-Shot High Dynamic Range Imaging
by: Higaki Yoshinari
HDR
DOE
PSF
Glare
Orthogonal Convolutional Neural Networks
by: Anonymous
Violin: A Large-Scale Dataset for Video-and-Language Inference
by: Shintaro Yamamoto
Vision-and-Language
Dataset
Few-Shot Pill Recognition
by: Masanori YANO
Few-Shot Learning
Recognition
Image Classification
Segmentation
Dataset
ClusterFit: Improving Generalization of Visual Representations
by: Hirokatsu Kataoka
Self-supervised Learning
ClusterFit
In Defense of Grid Features for Visual Question Answering
by: Shintaro Yamamoto
Vision-and-Language
Attention
Visual Grounding in Video for Unsupervised Word Translation
by: Masuyama Yoshiki
multi-modal
machine translation
Attention-Guided Hierarchical Structure Aggregation for Image Matting
by: Masaki Taniguchi
alpha matting
attention
End-to-End Camera Calibration for Broadcast Videos
by: Hirokatsu Kataoka
Camera Calibration
Sports Scene
Basketball
Category-Level Articulated Object Pose Estimation
by: Hirokatsu Kataoka
Point Cloud
Depth Image
3D Object Recognition
TubeTK: Adopting Tubes to Track Multi-Object in a One-Step Training Model
by: pacifinapacific
MOT
Tracking
3DCNN
Robust Learning Through Cross-Task Consistency
by: Shun.ishizaka
consistency
multi-task
3D
robust
surface normals
depth
Real-Time Panoptic Segmentation From Dense Detections
by: Masaki Taniguchi
panoptic segmentation
single-shot
real-time
Multi-Modal Graph Neural Network for Joint Reasoning on Vision and Scene Text
by: Shintaro Yamamoto
VQA
GNN
Context Aware Graph Convolution for Skeleton-Based Action Recognition
by: Mariko Nakano
ActBERT: Learning Global-Local Video-Text Representations
by: Shintaro Yamamoto
Vision-and-Language
Pre-training
HVNet: Hybrid Voxel Network for LiDAR Based 3D Object Detection
by: Higaki Yoshinari
Object detection
LiDAR
BEV
Multi-scale
MLCVNet: Multi-Level Context VoteNet for 3D Object Detection
by: Higaki Yoshinari
3D
point cloud
object detection
context
Detecting Adversarial Samples Using Influence Functions and Nearest Neighbors
by: 福原吉博 (Yoshihiro Fukuhara)
adversarial examples
robustness
adversarial detection
Generating 3D People in Scenes Without People
by: Katsuyuki Nakamura
Optical Flow in Dense Foggy Scenes Using Semi-Supervised Learning
by: Higaki Yoshinari
Optical Flow
fog
Improved Few-Shot Visual Classification
by: Shuhei M Yoshida
few-shot learning
deep metric learning
Mahalanobis distance
Seeing the World in a Bag of Chips
by: Teppei Kurita
Surface Light Fields
RGBD
Illumination Estimation
Gate-Shift Networks for Video Action Recognition
by: Anonymous
Understanding Human Hands in Contact at Internet Scale
by: Katsuyuki Nakamura
Human-object interaction
Hand detection
Dataset
From Paris to Berlin: Discovering Fashion Style Influences Around the World
by: Hirokatsu Kataoka
Fashion Trend
SNS
SAM: The Sensitivity of Attribution Methods to Hyperparameters
by: Ryo Takahashi
Future Video Synthesis With Object Motion Prediction
by: Yukitaka Tsuchiya
Video Synthesis
Spatial Transformer
GAN
Inpainting
Distortion Agnostic Deep Watermarking
by: Tomoki Tanimura
Watermarking
Robustness
Adversarial Training
Noise
Encoding
Dynamic Neural Relational Inference
by: Takehiko Ohkawa
Look-Into-Object: Self-Supervised Structure Modeling for Object Recognition
by: Hiroaki Aizawa
Domain Balancing: Face Recognition on Long-Tailed Domains
by: yamada ryosuke
face recognition
fairness
Local-Global Video-Text Interactions for Temporal Grounding
by: Shintaro Yamamoto
Temporal Grounding
Vision and Language
Exploring Unlabeled Faces for Novel Attribute Discovery
by: Shoma Iwai
Learning to Simulate Dynamic Environments With GameGAN
by: Shunsuke Nakatsuka
GANs
Generative Models
Reinforcement Learning
Height and Uprightness Invariance for 3D Prediction From a Single View
by: Hiroaki Aizawa
GhostNet: More Features From Cheap Operations
by: Teppei Kurita
Convolution
Ghost Net
Redundancy
Feature Map
Hierarchically Robust Representation Learning
by: Shuhei M Yoshida
robust optimization
representation learning
Learning User Representations for Open Vocabulary Image Hashtag Prediction
by: Shintaro Yamamoto
Image Recognition
SCATTER: Selective Context Attentional Scene Text Recognizer
by: Shintaro Yamamoto
Scene Text Recognition
Multi-Modal Domain Adaptation for Fine-Grained Action Recognition
by: Yasuhiko Tajiri
action recognition
Symmetry and Group in Attribute-Object Compositions
by: Anonymous
Learning to Cartoonize Using White-Box Cartoon Representations
by: Yukitaka Tsuchiya
GAN
Cartoon
white-box
Learning Geocentric Object Pose in Oblique Monocular Images
by: Katsuyuki Nakamura
geocentric pose
depth estimation
rectification
Learning Generative Models of Shape Handles
by: Hiroaki Aizawa
Can Deep Learning Recognize Subtle Human Activities?
by: Katsuyuki Nakamura
deep learning
activity recognition
human performance
X-Linear Attention Networks for Image Captioning
by: Seitaro Shinagawa
image-captioning
bilinear pooling
attention
Detail-recovery Image Deraining via Context Aggregation Networks
by: Seitaro Shinagawa
deraining
image-inpainting
Defending Against Universal Attacks Through Selective Feature Regeneration
by: 福原吉博 (Yoshihiro Fukuhara)
adversarial examples
robustness
universal perturbation
Attention-Based Context Aware Reasoning for Situation Recognition
by: Seitaro Shinagawa
situation recognition
DoveNet: Deep Image Harmonization via Domain Verification
by: Yukitaka Tsuchiya
image harmonization
U-Net
dataset
On the Detection of Digital Face Manipulation
by: Yukitaka Tsuchiya
deep fake
detection
dataset
attention map
Deep Semantic Clustering by Partition Confidence Maximisation
by: Asato Matsumoto
deep clustering
unsupervised
BEDSR-Net: A Deep Shadow Removal Network From a Single Document Image
by: Shintaro Yamamoto
Document Recognition
Learning From Noisy Anchors for One-Stage Object Detection
by: Hirokatsu Kataoka
Object Detection
MSCOCO
Noisy Anchor
M2m: Imbalanced Classification via Major-to-Minor Translation
by: Shintaro Yamamoto
Classification
Class-Imbalance
Moving in the Right Direction: A Regularization for Deep Metric Learning
by: Shintaro Yamamoto
Metric Learning
Copy and Paste GAN: Face Hallucination From Shaded Thumbnails
by: Hirokatsu Kataoka
GAN
Face Generation
3DV: 3D Dynamic Voxel for Action Recognition in Depth Video
by: Hiroaki Aizawa
RGBD-Dog: Predicting Canine Pose from RGBD Sensors
by: yamada ryosuke
3DSSD: Point-Based 3D Single Stage Object Detector
by: Hirokatsu Kataoka
Point Cloud
Object Detection
Interactive Multi-Label CNN Learning With Partial Labels
by: Asato Matsumoto
multi-label
partial label
smoothing
Meta-Learning of Neural Architectures for Few-Shot Learning
by: Asato Matsumoto
NSA
meta-learning
few-shot
Towards Unified INT8 Training for Convolutional Neural Network
by: Shuhei M Yoshida
quantization
quantized training
Generalized Zero-Shot Learning via Over-Complete Distribution
by: Asato Matsumoto
zero-shot learning
CVAE
Single-Step Adversarial Training With Dropout Scheduling
by: 福原吉博 (Yoshihiro Fukuhara)
adversarial training
adversarial robustness
adversarial examples
From Two Rolling Shutters to One Global Shutter
by: Teppei Kurita
Rolling Shutter
Global Shutter
CMOS
RANSAC
Joint Demosaicing and Denoising With Self Guidance
by: Teppei Kurita
JDD
Demosaic
NR
Noise Reduction
Bayer
GraspNet-1Billion: A Large-Scale Benchmark for General Object Grasping
by: 福原吉博 (Yoshihiro Fukuhara)
dataset
grasping
grasp pose prediction
PREDICT & CLUSTER: Unsupervised Skeleton Based Action Recognition
by: 福原吉博 (Yoshihiro Fukuhara)
unsupervised learning
action recognition
Perceptual Quality Assessment of Smartphone Photography
by: Teppei Kurita
Perceptual Quality
Dataset
Multi-Scale Progressive Fusion Network for Single Image Deraining
by: Hiroki.Yamamoto
Deraining
Segmentation
Lightweight Photometric Stereo for Facial Details Recovery
by: Hirokatsu Kataoka
CNN
Photometric Stereo
Normal Assisted Stereo Depth Estimation
by: Teppei Kurita
Surface Normal
Depth
Multi-View
Cost Volume
3D CNN
Hit-Detector: Hierarchical Trinity Architecture Search for Object Detection
by: Hirokatsu Kataoka
Object Detection
NAS
Geometrically Principled Connections in Graph Neural Networks
by: Hirokatsu Kataoka
Graph Convolution
GCN
Geometry
Exemplar Normalization for Learning Deep Representation
by: Hirokatsu Kataoka
Exemplar Normalization
Image Recognition
Sparse Layered Graphs for Multi-Object Segmentation
by: Hirokatsu Kataoka
Segmentation
Ishikawa Layered Technique
Neural Head Reenactment with Latent Pose Descriptors
by: Anonymous
Explainable Object-Induced Action Decision for Autonomous Vehicles
by: Ryo Takahashi
Learning Fast and Robust Target Models for Video Object Segmentation
by: Yukitaka Tsuchiya
VOS
segmentation
Assessing Image Quality Issues for Real-World Problems
by: Teppei Kurita
Image Quality
Assessment
Blind
ECA-Net: Efficient Channel Attention for Deep Convolutional Neural Networks
by: Hirokatsu Kataoka
Learning to Segment the Tail
by: Hirokatsu Kataoka
Instance Segmentation
LVIS dataset
Few-show Learning
CoverNet: Multimodal Behavior Prediction Using Trajectory Sets
by: Hirokatsu Kataoka
Trajectory Prediction