cvpaper.challenge
CVPR2021論文サマリ
tag: vision-and-language
«
‹
1
2
3
4
…
›
»
VinVL: Revisiting Visual Representations in Vision-Language Models
by: Shintaro Yamamoto
Object detection
Vision and language
What if We Only Use Real Datasets for Scene Text Recognition? Toward Scene Text Recognition With Fewer Labels
by: So Uchida
Dataset
Self supervised learning
Vision and language
Text Recognition
Read Like Humans: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Recognition
by: So Uchida
Attetion
Multi modal
Recognition
Vision and language
Text Recognition
Dictionary-Guided Scene Text Recognition
by: So Uchida
Dataset
Multi modal
Recognition
Vision and language
Text Recognition
Learning Better Visual Dialog Agents With Pretrained Visual-Linguistic Representation
by: Seitaro Shinagawa
Vision and language
«
‹
1
2
3
4
…
›
»