スケジュール
メンバー
リソース
論文サマリ
ACL 2019
EMNLP 2019
ACL 2020
ACL 2021
cvpaper.challenge
論文サマリ
emnlp2019
tag: vision-and-language
«
‹
1
›
»
LXMERT: Learning Cross-Modality Encoder Representations from Transformers
by: Yuta Nakamura
vision-and-language
BERT
transformer
VQA
GQA
BUTD
Incorporating Visual Semantics into Sentence Representations within a Grounded Space
by: Shintaro Yamamoto
Language Grounding
Visual Semantics
Vision and Language
«
‹
1
›
»