CVPR2019論文サマリ
CVPR2019論文サマリ
Feature Denoising for Improving Adversarial Robustness
by: Hideki Tsunashima
Adversarial_Examples
defense
self_attention
Unsupervised Deep Tracking
by: Hirokatsu Kataoka
Unsupervised Tracking
Correlation Filter
SIamese Network
Practical Full Resolution Learned Lossless Image Compression
by: Takaya Yamazoe
lossless
compression
parallel
Toward Convolutional Blind Denoising of Real Photographs
by: Katsuya Shimabukuro
Image Denoising
Blind Denoising
Noise Level Estimation
AE2-Nets: Autoencoder in Autoencoder Networks
by: kodai nakashima
Leveraging Shape Completion for 3D Siamese Tracking
by: Hirokatsu Kataoka
3D Siamese Network
Point Cloud
Structured Knowledge Distillation for Semantic Segmentation
by: kodai nakashima
Tell Me Where I Am: Object-Level Scene Context Prediction
by: kodai nakashima
End-To-End Time-Lapse Video Synthesis From a Single Outdoor Image
by: Hirokatsu Kataoka
Time Lapse Video
Do Better ImageNet Models Transfer Better?
by: kodai nakashima
GIF2Video: Color Dequantization and Temporal Interpolation of GIF Images
by: Hirokatsu Kataoka
GIF to Video
ScratchDet: Training Single-Shot Object Detectors From Scratch
by: Munetaka Minoguchi
Object Detection
pretrain
Cascaded Partial Decoder for Fast and Accurate Salient Object Detection
by: Munetaka Minoguchi
Salient Object Detection
A Simple Pooling-Based Design for Real-Time Salient Object Detection
by: Munetaka MInoguchi
Salient Object Detection
Grounding Human-To-Vehicle Advice for Self-Driving Vehicles
by: Takaya Yamazoe
Self Driving
HAD
Honda
NLP
Hybrid Task Cascade for Instance Segmentation
by: neka-nat
Adapting Object Detectors via Selective Cross-Domain Alignment
by: Takanori Ebihara
物体認識、cross-domain、強化学習
Linkage Based Face Clustering via Graph Convolution Network
by: Hiromasa Sakata
faceclassification
graph
Capture, Learning, and Synthesis of 3D Speaking Styles
by: Shintaro Yamamoto
Speech synthesis
3D character animation
A Parametric Top-View Representation of Complex Road Scenes
by: Takaya Yamazoe
Driving
Top View
parametric
Explicit Bias Discovery in Visual Question Answering Models
by: Tomoki Tanimura
VQA
statistical
correlation
rule
bias
discover
mining
Learning Actor Relation Graphs for Group Activity Recognition
by: Tsubura Kazuki
GCN
Activity Recognition
Light Field Messaging With Deep Photographic Steganography
by: Hirokatsu Kataoka
When Color Constancy Goes Wrong: Correcting Improperly White-Balanced Images
by: Hirokatsu Kataoka
Events-To-Video: Bringing Modern Computer Vision to Event Cameras
by: Munetaka Minoguchi
Event Camera
Understanding and Visualizing Deep Visual Saliency Models
by: Yamada Yoshihiro
Towards Natural and Accurate Future Motion Prediction of Humans and Animals
by: Shintaro Yamamoto
Motion prediction
Reflection Removal Using a Dual-Pixel Sensor
by: Hirokatsu Kataoka
Reflection Removal
Dual Pixel Sensor
Photography
Meta-SR: A Magnification-Arbitrary Network for Super-Resolution
by: Hirokatsu Kataoka
Super Resolution
Object-Driven Text-To-Image Synthesis via Adversarial Training
by: Mitani Tomohiro
GAN
COCO
text-to-image
Learning to Calibrate Straight Lines for Fisheye Image Rectification
by: Hirokatsu Kataoka
Distortion
Fisheye Camera
Line
Sea-Thru: A Method for Removing Water From Underwater Images
by: Katsuya Shimabukuro
Image Reconstruction
Removing Water
Deep Plug-And-Play Super-Resolution for Arbitrary Blur Kernels
by: Katsuya Shimabukuro
Image Restoration
Super Resolution
Revisiting Perspective Information for Efficient Crowd Counting
by: Shuhei M Yoshida
crowd counting
perspective map
Enhanced Pix2pix Dehazing Network
by: Anonymous
Towards Accurate One-Stage Object Detection With AP-Loss
by: Ryota Nishijima
detection
one-shot
loss
Listen to the Image
by: shirouhi satoshi
Sensory Substitution devices
Generation Adversarial Networks
Monocular Depth Estimation Using Relative Depth Maps
by: Tomoki Tanimura
Depth estimation
Relative depth
SparseFool: A Few Pixels Make a Big Difference
by: Yoshihiro Fukuhara
Adversarial Example
Adversarial Attack
Sparse Attack
Deep Spectral Clustering Using Dual Autoencoder Network
by: Munetaka Minoguchi
Clustering
Deep Clustering
Unsupervised
Compressing Convolutional Neural Networks via Factorized Convolutional Filters
by: Munetaka Minoguchi
Filter Pruning
Deep Asymmetric Metric Learning via Rich Relationship Mining
by: Keito Ishihara
metric learning
graph
Unsupervised Face Normalization With Extreme Pose and Expression in the Wild
by: Shintaro Yamamoto
GAN
face recognition
Feedback Network for Image Super-Resolution
by: Masaki Miyamoto
feedback block
Image Super Resolution
SR
image
Learning Not to Learn: Training Deep Neural Networks With Biased Data
by: Yoshihiro Fukuhara
Cyclic Guidance for Weakly Supervised Joint Detection and Segmentation
by: Takanori Ebihara
弱教師あり学習
物体認識
物体セグメンテーション
DDLSTM: Dual-Domain LSTM for Cross-Dataset Action Recognition
by: Shunsuke NAKATSUKA
Video Action Recognition
Cross Domain
PA3D: Pose-Action 3D Machine for Video Recognition
by: Shunsuke NAKATSUKA
Conv3D
Video Action Recognition
Pose
MOTS: Multi-Object Tracking and Segmentation
by: Shunsuke NAKATSUKA
Multi-Object Tracking and Segmentation
Dataset
Semantic Component Decomposition for Face Attribute Manipulation
by: Shintaro Yamamoto
facial attribute
image editing
Efficient Video Classification Using Fewer Frames
by: Akihiro Yoshida
video classification
distillation
Adaptively Connected Neural Networks
by: Hiroaki Aizawa
Translate-to-Recognize Networks for RGB-D Scene Recognition
by: Tenga Wakamiya
RGB-D
Scene Recognition
SR-LSTM: State Refinement for LSTM Towards Pedestrian Trajectory Prediction
by: Mitani Tomohiro
Semi-Supervised Transfer Learning for Image Rain Removal
by: Masaki Miyamoto
Semi supervised
transfer
rain removal
SIRR
deep
R3 Adversarial Network for Cross Model Face Recognition
by: Shintaro Yamamoto
feature transformation
Co-Occurrent Features in Semantic Segmentation
by: ERLYN MANGUILIMOTAN
semantic segmentation
co-occurent features
Video Generation From Single Semantic Label Map
by: asato matsumoto
image-to-video
Video Generation
Semantic
Spherical Fractal Convolutional Neural Networks for Point Cloud Recognition
by: Daisuke Makino
Recurrent Back-Projection Network for Video Super-Resolution
by: Masaki Miyamoto
SR
RBPN
RNN
Super-Resolution
video
VSR
Viewport Proposal CNN for 360deg Video Quality Assessment
by: Shintaro Yamamoto
360° video
video quality assessment
P2SGrad: Refined Gradients for Optimizing Deep Face Models
by: Shintaro Yamamoto
face recognition
gradient
A-CNN: Annularly Convolutional Neural Networks on Point Clouds
by: Shuhei M Yoshida
point cloud
annular convolution
Video Action Transformer Network
by: Hirokatsu Kataoka
Action Localization
ActionRecognition
Self-attention
Multi-Granularity Generator for Temporal Action Proposal
by: Masaki Taniguchi
Temporal Action Proposal
行動認識
Polarimetric Camera Calibration Using an LCD Monitor
by: asato matsumoto
Calibration
LCD Monitor
CRF
Learning to Cluster Faces on an Affinity Graph
by: Keita Yanome
Learning to Cluster Faces on an Affinity Graph
by: Anonymous
Spectral Metric for Dataset Complexity Assessment
by: Hideki Tsunashima
Dataset Complexity
c-measure
dataset reduction
DARNet: Deep Active Ray Network for Building Segmentation
by: Shuhei M Yoshida
segmentation
active ray
DARNet
Variational Convolutional Neural Network Pruning
by: hiroki iida
Single Image Reflection Removal Beyond Linearity
by: asato matsumoto
反射除去
Reflection Removal
non-linearity
synthesize
Shape Unicode: A Unified Shape Representation
by: asato matsumoto
3D
Shape Representaion
Voxel
Point Cloud
Multi View
Auto Encoder
Variational Information Distillation for Knowledge Transfer
by: Tomoki Tsujimura
transfer learning
distillation
Single Image Deraining: A Comprehensive Benchmark Analysis
by: asato matsumoto
De-Raining
Dataset
Detection
Car
Analysis
PointRCNN: 3D Object Proposal Generation and Detection From Point Cloud
by: ERLYN MANGUILIMOTAN
3D object detection
Leveraging Crowdsourced GPS Data for Road Extraction From Aerial Imagery
by: Shuhei M Yoshida
GPS
航空写真
Residual Networks for Light Field Image Super-Resolution
by: Yoitsu Takahashi
Light Field Image
Super-Resolution
resLF
Towards Real Scene Super-Resolution With Raw Images
by: Kobayashi Koga
Deep Sketch-Shape Hashing With Segmented 3D Stochastic Viewing
by: ERLYN MANGUILIMOTAN
Sketch-based 3D shape retrieval
Progressive Image Deraining Networks: A Better and Simpler Baseline
by: Keito Ishihara
ResNet
RNN
rain
3D Hand Shape and Pose From Images in the Wild
by: Yoitsu Takahashi
3D Hand Pose Estimation
Single RGB image
E2E
ODE-Inspired Network Design for Single Image Super-Resolution
by: Kobayashi Koga
Query-Guided End-To-End Person Search
by: ERLYN MANGUILIMOTAN
person search
person re-id
person detection
SFNet: Learning Object-Aware Semantic Correspondence
by: Anonymous
Deep Metric Learning Beyond Binary Supervision
by: Anonymous
Zero-Shot Task Transfer
by: Anonymous
Libra R-CNN: Towards Balanced Learning for Object Detection
by: ERLYN MANGUILIMOTAN
object detection
training process
imbalance
Learning a Unified Classifier Incrementally via Rebalancing
by: ERLYN MANGUILIMOTAN
incremental learning
A Simple Baseline for Audio-Visual Scene-Aware Dialog
by: Katsuya Shimabukuro
Audio-Visual Scene Aware Dialog
Dialog
Multimodal
Feature Selective Anchor-Free Module for Single-Shot Object Detection
by: ERLYN MANGUILIMOTAN
single-shot object detection
Neural Scene Decomposition for Multi-Person Motion Capture
by: Takahiro Itazuri
Neural Scene Decomposition
Bottom-Up Object Detection by Grouping Extreme and Center Points
by: ERLYN MANGUILIMOTAN
object detection
bottom-up approach
bound box
Direct Object Recognition Without Line-Of-Sight Using Optical Coherence
by: Tenga Wakamiya
コヒーレント光
物体認識
Feature Distillation: DNN-Oriented JPEG Compression Against Adversarial Examples
by: ERLYN MANGUILIMOTAN
JPEG Compression
Convolutional Relational Machine for Group Activity Recognition
by: Shunsuke NAKATSUKA
Group Activity Recognition
Relation
PEPSI : Fast Image Inpainting With Parallel Decoding Network
by: Katsuya Shimabukuro
Inpainting
Coarse-to-Fine
Hybrid Scene Compression for Visual Localization
by: Shuhei M Yoshida
Dense Intrinsic Appearance Flow for Human Pose Transfer
by: Masaki Taniguchi
Human Pose Transfer
GAN
Towards Instance-Level Image-To-Image Translation
by: Masaki Taniguchi
Image-To-Image Translation
画像ドメイン変換
PoseFix: Model-Agnostic General Human Pose Refinement Network
by: Takahiro Itazuri
pose estimation
pose refinement
Fast and Robust Multi-Person 3D Pose Estimation From Multiple Views
by: Takahiro Itazuri
3D pose estimation
BASNet: Boundary-Aware Salient Object Detection
by: Anonymous
Fast, Diverse and Accurate Image Captioning Guided by Part-Of-Speech
by: Katsuya Shimabukuro
Image Captioning
POS-tag
Deep ChArUco: Dark ChArUco Marker Pose Estimation
by: Takehiko Ohkawa
ChArUco Detection
Pose Estimation
R2GAN: Cross-Modal Recipe Retrieval With Generative Adversarial Network
by: Yukitaka Tsuchiya
GAN
cross-modal
retrieval
Unequal-Training for Deep Face Recognition With Long-Tailed Noisy Data
by: Takahiro Itazuri
face recognition
Radial Distortion Triangulation
by: Tomoki Tanimura
radial distortion
triangulation
grobner basis
optimization
Recursive Visual Attention in Visual Dialog
by: Ryota Natsume
Reasoning Visual Dialogs With Structural and Partial Observations
by: Ryota Natsume
Adversarial Inference for Multi-Sentence Video Description
by: Ryota Natsume
Progressive Pose Attention Transfer for Person Image Generation
by: Kyota Masuyama
GAN
Pose-Transfer
Attention
WarpGAN: Automatic Caricature Generation
by: Katsuya Shimabukuro
GAN
Style Transfer
Caricature
Warping
Rob-GAN: Generator, Discriminator, and Adversarial Attacker
by: Yukitaka Tsuchiya
GAN
Adversarial training
Blind Visual Motif Removal From a Single Image
by: Katsuya Shimabukuro
Inpainting
Blind Inpainting
Motif Removal
Robustness of 3D Deep Learning in an Adversarial Setting
by: OKIMOTO Yusuke
PoseFix: Model-Agnostic General Human Pose Refinement Network
by: Takahiro Itazuri
3D Face Shape Estimation
RingNet
Latent Space Autoregression for Novelty Detection
by: Shunsuke NAKATSUKA
Anomaly Detection
Autoregression
Autoencoder
Visual Question Answering as Reading Comprehension
by: Katsuya Shimabukuro
Visual Question Answering
Machine Reading Comprehension
Balanced Self-Paced Learning for Generative Adversarial Clustering Network
by: Motokawa Tetsuya
GAN
ClusterGAN
Balanced Self-Paced Learning for Generative Adversarial Clustering Network
by: Motokawa Tetsuya
GAN
ClusterGAN
STEP: Spatio-Temporal Progressive Learning for Video Action Detection
by: Shuhei M Yoshida
action localization