[docs] Add UNITER model to website

Ryan-Qiyu-Jiang · Ryan-Qiyu-Jiang · commit b4d93f222b7a · 2021-11-22T16:36:29.000-08:00
Add citation and training instructions under projects/uniter. ghstack-source-id: 96dd13e Pull Request resolved: #1144
diff --git a/website/docs/notes/model_zoo.md b/website/docs/notes/model_zoo.md
@@ -21,6 +21,7 @@ Here is the list of models currently implemented in MMF:
 | Movie MCAN      | movie_mcan                           | vqa2                                     | [paper](https://arxiv.org/abs/2004.11883)                   |
 | Pythia          | pythia                               | textvqa, vizwiz, vqa2, visual_genome     | [paper](https://arxiv.org/abs/1904.08920)                   |
 | Unimodal        | unimodal                             | hateful_memes                            |                                                             |
+| UNITER          | uniter                               | vqa2, masked_coco                        | [paper](https://arxiv.org/abs/1909.11740)                   |
 | VilBERT         | vilbert                              | hateful_memes, coco, conceptual_captions, vqa2, mmimdb, nlvr2, visual_entailment, vizwiz, vqa2 |[paper](https://arxiv.org/abs/1908.02265)|
 | Visual BERT     | visual_bert                          | gqa, hateful_memes, localized_narratives, coco, conceptual_captions, sbu, vqa2, mmimdb, nlvr2, visual_entailment, vizwiz|[paper](https://arxiv.org/abs/1908.03557)|
 
diff --git a/website/docs/projects/uniter.md b/website/docs/projects/uniter.md
@@ -0,0 +1,38 @@
+---
+id: uniter
+sidebar_label: UNITER
+title: "UNITER: UNiversal Image-TExt Representation Learning"
+---
+
+This repository contains the code for pytorch implementation of UNITER model, released originally under this ([repo](https://github.com/ChenRocks/UNITER)). Please cite the following papers if you are using UNITER model from mmf:
+
+* Chen, Y.-C., Li, L., Yu, L., Kholy, A. E., Ahmed, F., Gan,
+Z., Cheng, Y., and jing Liu, J. *Uniter: Universal imagetext representation learning.* In European Conference on
+Computer Vision, 2020b. ([arXiV](https://arxiv.org/pdf/1909.11740))
+```
+@inproceedings{chen2020uniter,
+  title={Uniter: Universal image-text representation learning},
+  author={Chen, Yen-Chun and Li, Linjie and Yu, Licheng and Kholy, Ahmed El and Ahmed, Faisal and Gan, Zhe and Cheng, Yu and Liu, Jingjing},
+  booktitle={ECCV},
+  year={2020}
+}
+```
+
+## Installation
+
+Follow installation instructions in the [documentation](https://mmf.readthedocs.io/en/latest/notes/installation.html).
+
+## Training
+
+To train a fresh UNITER model on the VQA2.0 dataset, run the following command
+```
+mmf_run config=projects/uniter/configs/vqa2/defaults.yaml run_type=train_val dataset=vqa2 model=uniter
+```
+
+To pretrain UNITER on the masked COCO dataset, run the following command
+```
+mmf_run config=projects/uniter/configs/masked_coco/defaults.yaml run_type=train_val dataset=masked_coco model=uniter
+```
+
+
+Based on the config used and `do_pretraining` defined in the config, the model can use the pretraining recipe described in the UNITER paper, or be finetuned on downstream tasks.
diff --git a/website/sidebars.js b/website/sidebars.js
@@ -45,6 +45,6 @@ module.exports = {
       'challenges/textvqa_challenge',
       'challenges/vqa_challenge',
     ],
-    Projects: ['projects/butd', 'projects/m4c', 'projects/m4c_captioner', 'projects/movie_mcan', 'projects/unit'],
+    Projects: ['projects/butd', 'projects/m4c', 'projects/m4c_captioner', 'projects/movie_mcan', 'projects/unit', 'projects/uniter'],
   },
 };