CVPR2019論文サマリ
CVPR2019論文サマリ
Feature Denoising for Improving Adversarial Robustness
by: Hideki Tsunashima
 
      Adversarial_Examples
    
      defense
    
      self_attention
    Unsupervised Deep Tracking
by: Hirokatsu Kataoka
 
      Unsupervised Tracking
    
      Correlation Filter
    
      SIamese Network
    Practical Full Resolution Learned Lossless Image Compression
by: Takaya Yamazoe
 
      lossless
    
      compression
    
      parallel
    Toward Convolutional Blind Denoising of Real Photographs
by: Katsuya Shimabukuro
 
      Image Denoising
    
      Blind Denoising
    
      Noise Level Estimation
    AE2-Nets: Autoencoder in Autoencoder Networks
by: kodai nakashima
 Leveraging Shape Completion for 3D Siamese Tracking
by: Hirokatsu Kataoka
 
      3D Siamese Network
    
      Point Cloud
    Structured Knowledge Distillation for Semantic Segmentation
by: kodai nakashima
 Tell Me Where I Am: Object-Level Scene Context Prediction
by: kodai nakashima
 End-To-End Time-Lapse Video Synthesis From a Single Outdoor Image
by: Hirokatsu Kataoka
 
      Time Lapse Video
    Do Better ImageNet Models Transfer Better?
by: kodai nakashima
 GIF2Video: Color Dequantization and Temporal Interpolation of GIF Images
by: Hirokatsu Kataoka
 
      GIF to Video
    ScratchDet: Training Single-Shot Object Detectors From Scratch
by: Munetaka Minoguchi
 
      Object Detection
    
      pretrain
    Cascaded Partial Decoder for Fast and Accurate Salient Object Detection
by: Munetaka Minoguchi
 
      Salient Object Detection
    A Simple Pooling-Based Design for Real-Time Salient Object Detection
by: Munetaka MInoguchi
 
      Salient Object Detection
    Grounding Human-To-Vehicle Advice for Self-Driving Vehicles
by: Takaya Yamazoe
 
      Self Driving
    
      HAD
    
      Honda
    
      NLP
    Hybrid Task Cascade for Instance Segmentation
by: neka-nat
 Adapting Object Detectors via Selective Cross-Domain Alignment
by: Takanori Ebihara
 
      物体認識、cross-domain、強化学習
    Linkage Based Face Clustering via Graph Convolution Network
by: Hiromasa Sakata
 
      faceclassification
    
      graph
    Capture, Learning, and Synthesis of 3D Speaking Styles
by: Shintaro Yamamoto
 
      Speech synthesis
    
      3D character animation
    A Parametric Top-View Representation of Complex Road Scenes
by: Takaya Yamazoe
 
      Driving
    
      Top View
    
      parametric
    Explicit Bias Discovery in Visual Question Answering Models
by: Tomoki Tanimura
 
      VQA
    
      statistical
    
      correlation
    
      rule
    
      bias
    
      discover
    
      mining
    Learning Actor Relation Graphs for Group Activity Recognition
by: Tsubura Kazuki
 
      GCN
    
      Activity Recognition
    Light Field Messaging With Deep Photographic Steganography
by: Hirokatsu Kataoka
 When Color Constancy Goes Wrong: Correcting Improperly White-Balanced Images
by: Hirokatsu Kataoka
 Events-To-Video: Bringing Modern Computer Vision to Event Cameras
by: Munetaka Minoguchi
 
      Event Camera
    Understanding and Visualizing Deep Visual Saliency Models
by: Yamada Yoshihiro
 Towards Natural and Accurate Future Motion Prediction of Humans and Animals
by: Shintaro Yamamoto
 
      Motion prediction
    Reflection Removal Using a Dual-Pixel Sensor
by: Hirokatsu Kataoka
 
      Reflection Removal
    
      Dual Pixel Sensor
    
      Photography
    Meta-SR: A Magnification-Arbitrary Network for Super-Resolution
by: Hirokatsu Kataoka
 
      Super Resolution
    Object-Driven Text-To-Image Synthesis via Adversarial Training
by: Mitani Tomohiro
 
      GAN
    
      COCO
    
      text-to-image
    Learning to Calibrate Straight Lines for Fisheye Image Rectification
by: Hirokatsu Kataoka
 
      Distortion
    
      Fisheye Camera
    
      Line
    Sea-Thru: A Method for Removing Water From Underwater Images
by: Katsuya Shimabukuro
 
      Image Reconstruction
    
      Removing Water
    Deep Plug-And-Play Super-Resolution for Arbitrary Blur Kernels
by: Katsuya Shimabukuro
 
      Image Restoration
    
      Super Resolution
    Revisiting Perspective Information for Efficient Crowd Counting
by: Shuhei M Yoshida
 
      crowd counting
    
      perspective map
    Enhanced Pix2pix Dehazing Network
by: Anonymous
 Towards Accurate One-Stage Object Detection With AP-Loss
by: Ryota Nishijima
 
      detection
    
      one-shot
    
      loss
    Listen to the Image
by: shirouhi satoshi
 
      Sensory Substitution devices
    
      Generation Adversarial Networks
    Monocular Depth Estimation Using Relative Depth Maps
by: Tomoki Tanimura
 
      Depth estimation
    
      Relative depth
    SparseFool: A Few Pixels Make a Big Difference
by: Yoshihiro Fukuhara
 
      Adversarial Example
    
      Adversarial Attack
    
      Sparse Attack
    Deep Spectral Clustering Using Dual Autoencoder Network
by: Munetaka Minoguchi
 
      Clustering
    
      Deep Clustering
    
      Unsupervised
    Compressing Convolutional Neural Networks via Factorized Convolutional Filters
by: Munetaka Minoguchi
 
      Filter Pruning
    Deep Asymmetric Metric Learning via Rich Relationship Mining
by: Keito Ishihara
 
      metric learning
    
      graph
    Unsupervised Face Normalization With Extreme Pose and Expression in the Wild
by: Shintaro Yamamoto
 
      GAN
    
      face recognition
    Feedback Network for Image Super-Resolution
by: Masaki Miyamoto
 
      feedback block 
    
      Image Super Resolution
    
      SR
    
      image
    Learning Not to Learn: Training Deep Neural Networks With Biased Data
by: Yoshihiro Fukuhara
 Cyclic Guidance for Weakly Supervised Joint Detection and Segmentation
by: Takanori Ebihara
 
      弱教師あり学習
    
      物体認識
    
      物体セグメンテーション
    DDLSTM: Dual-Domain LSTM for Cross-Dataset Action Recognition
by: Shunsuke NAKATSUKA
 
      Video Action Recognition
    
      Cross Domain
    PA3D: Pose-Action 3D Machine for Video Recognition
by: Shunsuke NAKATSUKA
 
      Conv3D
    
      Video Action Recognition
    
      Pose
    MOTS: Multi-Object Tracking and Segmentation
by: Shunsuke NAKATSUKA
 
      Multi-Object Tracking and Segmentation
    
      Dataset
    Semantic Component Decomposition for Face Attribute Manipulation
by: Shintaro Yamamoto
 
      facial attribute
    
      image editing
    Efficient Video Classification Using Fewer Frames
by: Akihiro Yoshida
 
      video classification
    
      distillation
    Adaptively Connected Neural Networks
by: Hiroaki Aizawa
 Translate-to-Recognize Networks for RGB-D Scene Recognition
by: Tenga Wakamiya
 
      RGB-D
    
      Scene Recognition
    SR-LSTM: State Refinement for LSTM Towards Pedestrian Trajectory Prediction
by: Mitani Tomohiro
 Semi-Supervised Transfer Learning for Image Rain Removal
by: Masaki Miyamoto
 
      Semi supervised
    
      transfer
    
      rain removal
    
      SIRR
    
      deep
    R3 Adversarial Network for Cross Model Face Recognition
by: Shintaro Yamamoto
 
      feature transformation
    Co-Occurrent Features in Semantic Segmentation
by: ERLYN MANGUILIMOTAN
 
      semantic segmentation
    
      co-occurent features
    Video Generation From Single Semantic Label Map
by: asato matsumoto
 
      image-to-video
    
      Video Generation
    
      Semantic
    Spherical Fractal Convolutional Neural Networks for Point Cloud Recognition
by: Daisuke Makino
 Recurrent Back-Projection Network for Video Super-Resolution
by: Masaki Miyamoto
 
      SR
    
      RBPN
    
      RNN
    
      Super-Resolution
    
      video
    
      VSR
    Viewport Proposal CNN for 360deg Video Quality Assessment
by: Shintaro Yamamoto
 
      360° video
    
      video quality assessment
    P2SGrad: Refined Gradients for Optimizing Deep Face Models
by: Shintaro Yamamoto
 
      face recognition
    
      gradient
    A-CNN: Annularly Convolutional Neural Networks on Point Clouds
by: Shuhei M Yoshida
 
      point cloud
    
      annular convolution
    Video Action Transformer Network
by: Hirokatsu Kataoka
 
      Action Localization
    
      ActionRecognition
    
      Self-attention
    Multi-Granularity Generator for Temporal Action Proposal
by: Masaki Taniguchi
 
      Temporal Action Proposal
    
      行動認識
    Polarimetric Camera Calibration Using an LCD Monitor
by: asato matsumoto
 
      Calibration
    
      LCD Monitor
    
      CRF
    Learning to Cluster Faces on an Affinity Graph
by: Keita Yanome
 Learning to Cluster Faces on an Affinity Graph
by: Anonymous
 Spectral Metric for Dataset Complexity Assessment
by: Hideki Tsunashima
 
      Dataset Complexity
    
      c-measure
    
      dataset reduction
    DARNet: Deep Active Ray Network for Building Segmentation
by: Shuhei M Yoshida
 
      segmentation
    
      active ray
    
      DARNet
    Variational Convolutional Neural Network Pruning
by: hiroki iida
 Single Image Reflection Removal Beyond Linearity
by: asato matsumoto
 
      反射除去
    
      Reflection Removal
    
      non-linearity
    
      synthesize
    Shape Unicode: A Unified Shape Representation
by: asato matsumoto
 
      3D
    
      Shape Representaion
    
      Voxel
    
      Point Cloud
    
      Multi View
    
      Auto Encoder
    Variational Information Distillation for Knowledge Transfer
by: Tomoki Tsujimura
 
      transfer learning
    
      distillation
    Single Image Deraining: A Comprehensive Benchmark Analysis
by: asato matsumoto
 
      De-Raining
    
      Dataset
    
      Detection
    
      Car
    
      Analysis
    PointRCNN: 3D Object Proposal Generation and Detection From Point Cloud
by: ERLYN MANGUILIMOTAN
 
      3D object detection
    Leveraging Crowdsourced GPS Data for Road Extraction From Aerial Imagery
by: Shuhei M Yoshida
 
      GPS
    
      航空写真
    Residual Networks for Light Field Image Super-Resolution
by: Yoitsu Takahashi
 
      Light Field Image
    
      Super-Resolution
    
      resLF
    Towards Real Scene Super-Resolution With Raw Images
by: Kobayashi Koga
 Deep Sketch-Shape Hashing With Segmented 3D Stochastic Viewing
by: ERLYN MANGUILIMOTAN
 
      Sketch-based 3D shape retrieval
    Progressive Image Deraining Networks: A Better and Simpler Baseline
by: Keito Ishihara
 
      ResNet
    
      RNN
    
      rain
    3D Hand Shape and Pose From Images in the Wild
by: Yoitsu Takahashi
 
      3D Hand Pose Estimation
    
      Single RGB image
    
      E2E
    ODE-Inspired Network Design for Single Image Super-Resolution
by: Kobayashi Koga
 Query-Guided End-To-End Person Search
by: ERLYN MANGUILIMOTAN
 
      person search
    
      person re-id
    
      person detection
    SFNet: Learning Object-Aware Semantic Correspondence
by: Anonymous
 Deep Metric Learning Beyond Binary Supervision
by: Anonymous
 Zero-Shot Task Transfer
by: Anonymous
 Libra R-CNN: Towards Balanced Learning for Object Detection
by: ERLYN MANGUILIMOTAN
 
      object detection
    
      training process
    
      imbalance
    Learning a Unified Classifier Incrementally via Rebalancing
by: ERLYN MANGUILIMOTAN
 
      incremental learning
    A Simple Baseline for Audio-Visual Scene-Aware Dialog
by: Katsuya Shimabukuro
 
      Audio-Visual Scene Aware Dialog
    
      Dialog
    
      Multimodal
    Feature Selective Anchor-Free Module for Single-Shot Object Detection
by: ERLYN MANGUILIMOTAN
 
      single-shot object detection
    Neural Scene Decomposition for Multi-Person Motion Capture
by: Takahiro Itazuri
 
      Neural Scene Decomposition
    Bottom-Up Object Detection by Grouping Extreme and Center Points
by: ERLYN MANGUILIMOTAN
 
      object detection
    
      bottom-up approach
    
      bound box
    Direct Object Recognition Without Line-Of-Sight Using Optical Coherence
by: Tenga Wakamiya
 
      コヒーレント光
    
      物体認識
    Feature Distillation: DNN-Oriented JPEG Compression Against Adversarial Examples
by: ERLYN MANGUILIMOTAN
 
      JPEG Compression
    Convolutional Relational Machine for Group Activity Recognition
by: Shunsuke NAKATSUKA
 
      Group Activity Recognition
    
      Relation
    PEPSI : Fast Image Inpainting With Parallel Decoding Network
by: Katsuya Shimabukuro
 
      Inpainting
    
      Coarse-to-Fine
    Hybrid Scene Compression for Visual Localization
by: Shuhei M Yoshida
 Dense Intrinsic Appearance Flow for Human Pose Transfer
by: Masaki Taniguchi
 
      Human Pose Transfer
    
      GAN
    Towards Instance-Level Image-To-Image Translation
by: Masaki Taniguchi
 
      Image-To-Image Translation
    
      画像ドメイン変換
    PoseFix: Model-Agnostic General Human Pose Refinement Network
by: Takahiro Itazuri
 
      pose estimation
    
      pose refinement
    Fast and Robust Multi-Person 3D Pose Estimation From Multiple Views
by: Takahiro Itazuri
 
      3D pose estimation
    BASNet: Boundary-Aware Salient Object Detection
by: Anonymous
 Fast, Diverse and Accurate Image Captioning Guided by Part-Of-Speech
by: Katsuya Shimabukuro
 
      Image Captioning
    
      POS-tag
    Deep ChArUco: Dark ChArUco Marker Pose Estimation
by: Takehiko Ohkawa
 
      ChArUco Detection
    
      Pose Estimation
    R2GAN: Cross-Modal Recipe Retrieval With Generative Adversarial Network
by: Yukitaka Tsuchiya
 
      GAN
    
      cross-modal
    
      retrieval
    Unequal-Training for Deep Face Recognition With Long-Tailed Noisy Data
by: Takahiro Itazuri
 
      face recognition
    Radial Distortion Triangulation
by: Tomoki Tanimura
 
      radial distortion
    
      triangulation
    
      grobner basis
    
      optimization
    Recursive Visual Attention in Visual Dialog
by: Ryota Natsume
 Reasoning Visual Dialogs With Structural and Partial Observations
by: Ryota Natsume
 Adversarial Inference for Multi-Sentence Video Description
by: Ryota Natsume
 Progressive Pose Attention Transfer for Person Image Generation
by: Kyota Masuyama
 
      GAN
    
      Pose-Transfer
    
      Attention
    WarpGAN: Automatic Caricature Generation
by: Katsuya Shimabukuro
 
      GAN
    
      Style Transfer
    
      Caricature
    
      Warping
    Rob-GAN: Generator, Discriminator, and Adversarial Attacker
by: Yukitaka Tsuchiya
 
      GAN
    
      Adversarial training
    Blind Visual Motif Removal From a Single Image
by: Katsuya Shimabukuro
 
      Inpainting
    
      Blind Inpainting
    
      Motif Removal
    Robustness of 3D Deep Learning in an Adversarial Setting
by: OKIMOTO Yusuke
 PoseFix: Model-Agnostic General Human Pose Refinement Network
by: Takahiro Itazuri
 
      3D Face Shape Estimation
    
      RingNet
    Latent Space Autoregression for Novelty Detection
by: Shunsuke NAKATSUKA
 
      Anomaly Detection
    
      Autoregression
    
      Autoencoder
    Visual Question Answering as Reading Comprehension
by: Katsuya Shimabukuro
 
      Visual Question Answering
    
      Machine Reading Comprehension
    Balanced Self-Paced Learning for Generative Adversarial Clustering Network
by: Motokawa Tetsuya
 
      GAN
    
      ClusterGAN
    Balanced Self-Paced Learning for Generative Adversarial Clustering Network
by: Motokawa Tetsuya
 
      GAN
    
      ClusterGAN
    STEP: Spatio-Temporal Progressive Learning for Video Action Detection
by: Shuhei M Yoshida
 
      action localization