Corpus ID: 199453025

ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations for Vision-and-Language Tasks

@inproceedings{Lu2019ViLBERTPT,
  title={ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations for Vision-and-Language Tasks},
  author={Jiasen Lu and Dhruv Batra and Devi Parikh and Stefan Lee},
  booktitle={NeurIPS},
  year={2019}
}
We present ViLBERT (short for Vision-and-Language BERT), a model for learning task-agnostic joint representations of image content and natural language. [...] Key Method We pretrain our model through two proxy tasks on the large, automatically collected Conceptual Captions dataset and then transfer it to multiple established vision-and-language tasks -- visual question answering, visual commonsense reasoning, referring expressions, and caption-based image retrieval -- by making only minor additions to the base…Expand
Unifying Vision-and-Language Tasks via Text Generation
Learning Vision-Language Grounded Representations with Disentangled Multimodal-Attention
12-in-1: Multi-Task Vision and Language Representation Learning
VL-BERT: Pre-training of Generic Visual-Linguistic Representations
SemVLP: Vision-Language Pre-training by Aligning Semantics at Multiple Levels
Unicoder-VL: A Universal Encoder for Vision and Language by Cross-modal Pre-training
VinVL: Making Visual Representations Matter in Vision-Language Models
ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision
...
1
2
3
4
5
...

References

SHOWING 1-10 OF 54 REFERENCES
VisualBERT: A Simple and Performant Baseline for Vision and Language
VL-BERT: Pre-training of Generic Visual-Linguistic Representations
Unified Vision-Language Pre-Training for Image Captioning and VQA
Unicoder-VL: A Universal Encoder for Vision and Language by Cross-modal Pre-training
Modulating early visual processing by language
MAttNet: Modular Attention Network for Referring Expression Comprehension
VideoBERT: A Joint Model for Video and Language Representation Learning
Visual Dialog
...
1
2
3
4
5
...