pytorch · akurniawan · Feb 13, 2019 · Jun 21, 2020 · Jun 23, 2020 · Jun 25, 2020
diff --git a/examples/machine_translation/README.md b/examples/machine_translation/README.md
@@ -0,0 +1,141 @@
+# This is an example to create a machine translation dataset and train a translation model.
+
+Users will use the training data in the raw file from Multi30k dataset to train a machine translation model with the character composition method.
+
+To try the example, simply run the following commands:
+
+```bash
+python train_char.py
+```
+
+For character level training, and
+
+```bash
+python train_word.py
+```
+
+For word level training
+
+## Experiment Result
+
+The following is the output example for running `train_char.py`
+
+```
+The model has 5,617,503 trainable parameters
+100%|█████████████████████████████████████████████████████████████████████████████████| 227/227 [01:54<00:00,  1.98it/s]
+100%|█████████████████████████████████████████████████████████████████████████████████████| 8/8 [00:03<00:00,  2.37it/s]
+Epoch: 01 | Time: 1m 57s
+	Train Loss: 5.277 | Train PPL: 195.798 |  Train BLEU:   0.001
+	 Val. Loss: 4.088 |  Val. PPL:  59.598 |  Val. BLEU:   0.006
+100%|█████████████████████████████████████████████████████████████████████████████████| 227/227 [02:12<00:00,  1.72it/s]
+100%|█████████████████████████████████████████████████████████████████████████████████████| 8/8 [00:04<00:00,  1.87it/s]
+Epoch: 02 | Time: 2m 16s
+	Train Loss: 3.711 | Train PPL:  40.877 |  Train BLEU:   0.022
+	 Val. Loss: 2.964 |  Val. PPL:  19.369 |  Val. BLEU:   0.048
+100%|█████████████████████████████████████████████████████████████████████████████████| 227/227 [02:14<00:00,  1.69it/s]
+100%|█████████████████████████████████████████████████████████████████████████████████████| 8/8 [00:04<00:00,  1.89it/s]
+Epoch: 03 | Time: 2m 18s
+	Train Loss: 2.901 | Train PPL:  18.189 |  Train BLEU:   0.055
+	 Val. Loss: 2.172 |  Val. PPL:   8.774 |  Val. BLEU:   0.111
+100%|█████████████████████████████████████████████████████████████████████████████████| 227/227 [02:18<00:00,  1.64it/s]
+100%|█████████████████████████████████████████████████████████████████████████████████████| 8/8 [00:03<00:00,  2.00it/s]
+Epoch: 04 | Time: 2m 21s
+	Train Loss: 2.391 | Train PPL:  10.927 |  Train BLEU:   0.092
+	 Val. Loss: 1.766 |  Val. PPL:   5.849 |  Val. BLEU:   0.164
+100%|█████████████████████████████████████████████████████████████████████████████████| 227/227 [02:19<00:00,  1.63it/s]
+100%|█████████████████████████████████████████████████████████████████████████████████████| 8/8 [00:04<00:00,  1.98it/s]
+Epoch: 05 | Time: 2m 23s
+	Train Loss: 2.085 | Train PPL:   8.042 |  Train BLEU:   0.118
+	 Val. Loss: 1.503 |  Val. PPL:   4.494 |  Val. BLEU:   0.196
+100%|█████████████████████████████████████████████████████████████████████████████████| 227/227 [02:20<00:00,  1.61it/s]
+100%|█████████████████████████████████████████████████████████████████████████████████████| 8/8 [00:04<00:00,  1.99it/s]
+Epoch: 06 | Time: 2m 24s
+	Train Loss: 1.856 | Train PPL:   6.398 |  Train BLEU:   0.140
+	 Val. Loss: 1.302 |  Val. PPL:   3.678 |  Val. BLEU:   0.229
+100%|█████████████████████████████████████████████████████████████████████████████████| 227/227 [02:21<00:00,  1.60it/s]
+100%|█████████████████████████████████████████████████████████████████████████████████████| 8/8 [00:03<00:00,  2.02it/s]
+Epoch: 07 | Time: 2m 25s
+	Train Loss: 1.683 | Train PPL:   5.383 |  Train BLEU:   0.157
+	 Val. Loss: 1.164 |  Val. PPL:   3.202 |  Val. BLEU:   0.250
+100%|█████████████████████████████████████████████████████████████████████████████████| 227/227 [02:23<00:00,  1.59it/s]
+100%|█████████████████████████████████████████████████████████████████████████████████████| 8/8 [00:03<00:00,  2.03it/s]
+Epoch: 08 | Time: 2m 26s
+	Train Loss: 1.554 | Train PPL:   4.730 |  Train BLEU:   0.168
+	 Val. Loss: 1.075 |  Val. PPL:   2.930 |  Val. BLEU:   0.263
+100%|█████████████████████████████████████████████████████████████████████████████████| 227/227 [02:24<00:00,  1.57it/s]
+100%|█████████████████████████████████████████████████████████████████████████████████████| 8/8 [00:03<00:00,  2.02it/s]
+Epoch: 09 | Time: 2m 28s
+	Train Loss: 1.455 | Train PPL:   4.283 |  Train BLEU:   0.178
+	 Val. Loss: 1.016 |  Val. PPL:   2.763 |  Val. BLEU:   0.271
+100%|█████████████████████████████████████████████████████████████████████████████████| 227/227 [02:25<00:00,  1.56it/s]
+100%|█████████████████████████████████████████████████████████████████████████████████████| 8/8 [00:03<00:00,  2.03it/s]
+Epoch: 10 | Time: 2m 29s
+	Train Loss: 1.373 | Train PPL:   3.948 |  Train BLEU:   0.187
+	 Val. Loss: 0.972 |  Val. PPL:   2.644 |  Val. BLEU:   0.280
+100%|█████████████████████████████████████████████████████████████████████████████████████| 8/8 [00:04<00:00,  1.95it/s]
+| Test Loss: 1.011 | Test PPL:   2.748 |  Test BLEU:   0.273
+Saving model to char_mt_seq2seq.pt
+Save vocab to torchtext_char_mt_vocab.pt
+```
+
+And the following is the output of `train_word.py`
+
+```
+The model has 14,601,140 trainable parameters
+  0%|                                                                                           | 0/227 [00:00<?, ?it/s]/home/akurniawan/text/examples/machine_translation/utils.py:38: UserWarning: To copy construct from a tensor, it is recommended to use sourceTensor.clone().detach() or sourceTensor.clone().detach().requires_grad_(True), rather than torch.tensor(sourceTensor).
+  txt = list(map(torch.tensor, input))
+100%|█████████████████████████████████████████████████████████████████████████████████| 227/227 [02:14<00:00,  1.69it/s]
+100%|█████████████████████████████████████████████████████████████████████████████████████| 8/8 [00:04<00:00,  1.61it/s]
+Epoch: 01 | Time: 2m 19s
+	Train Loss: 3.796 | Train PPL:  44.519 |  Train BLEU:   0.139
+	 Val. Loss: 1.480 |  Val. PPL:   4.391 |  Val. BLEU:   0.315
+100%|█████████████████████████████████████████████████████████████████████████████████| 227/227 [02:34<00:00,  1.46it/s]
+100%|█████████████████████████████████████████████████████████████████████████████████████| 8/8 [00:05<00:00,  1.56it/s]
+Epoch: 02 | Time: 2m 40s
+	Train Loss: 1.068 | Train PPL:   2.909 |  Train BLEU:   0.346
+	 Val. Loss: 0.748 |  Val. PPL:   2.113 |  Val. BLEU:   0.395
+100%|█████████████████████████████████████████████████████████████████████████████████| 227/227 [02:36<00:00,  1.45it/s]
+100%|█████████████████████████████████████████████████████████████████████████████████████| 8/8 [00:05<00:00,  1.60it/s]
+Epoch: 03 | Time: 2m 41s
+	Train Loss: 0.604 | Train PPL:   1.830 |  Train BLEU:   0.398
+	 Val. Loss: 0.476 |  Val. PPL:   1.610 |  Val. BLEU:   0.415
+100%|█████████████████████████████████████████████████████████████████████████████████| 227/227 [02:45<00:00,  1.37it/s]
+100%|█████████████████████████████████████████████████████████████████████████████████████| 8/8 [00:04<00:00,  1.62it/s]
+Epoch: 04 | Time: 2m 50s
+	Train Loss: 0.390 | Train PPL:   1.477 |  Train BLEU:   0.413
+	 Val. Loss: 0.348 |  Val. PPL:   1.416 |  Val. BLEU:   0.423
+100%|█████████████████████████████████████████████████████████████████████████████████| 227/227 [02:57<00:00,  1.28it/s]
+100%|█████████████████████████████████████████████████████████████████████████████████████| 8/8 [00:05<00:00,  1.53it/s]
+Epoch: 05 | Time: 3m 2s
+	Train Loss: 0.275 | Train PPL:   1.316 |  Train BLEU:   0.422
+	 Val. Loss: 0.278 |  Val. PPL:   1.321 |  Val. BLEU:   0.430
+100%|█████████████████████████████████████████████████████████████████████████████████| 227/227 [02:48<00:00,  1.35it/s]
+100%|█████████████████████████████████████████████████████████████████████████████████████| 8/8 [00:05<00:00,  1.44it/s]
+Epoch: 06 | Time: 2m 53s
+	Train Loss: 0.203 | Train PPL:   1.225 |  Train BLEU:   0.429
+	 Val. Loss: 0.237 |  Val. PPL:   1.267 |  Val. BLEU:   0.433
+100%|█████████████████████████████████████████████████████████████████████████████████| 227/227 [02:58<00:00,  1.27it/s]
+100%|█████████████████████████████████████████████████████████████████████████████████████| 8/8 [00:05<00:00,  1.46it/s]
+Epoch: 07 | Time: 3m 4s
+	Train Loss: 0.151 | Train PPL:   1.164 |  Train BLEU:   0.434
+	 Val. Loss: 0.213 |  Val. PPL:   1.238 |  Val. BLEU:   0.434
+100%|█████████████████████████████████████████████████████████████████████████████████| 227/227 [02:49<00:00,  1.34it/s]
+100%|█████████████████████████████████████████████████████████████████████████████████████| 8/8 [00:05<00:00,  1.52it/s]
+Epoch: 08 | Time: 2m 54s
+	Train Loss: 0.114 | Train PPL:   1.120 |  Train BLEU:   0.437
+	 Val. Loss: 0.198 |  Val. PPL:   1.218 |  Val. BLEU:   0.434
+100%|█████████████████████████████████████████████████████████████████████████████████| 227/227 [02:53<00:00,  1.31it/s]
+100%|█████████████████████████████████████████████████████████████████████████████████████| 8/8 [00:05<00:00,  1.52it/s]
+Epoch: 09 | Time: 2m 58s
+	Train Loss: 0.085 | Train PPL:   1.088 |  Train BLEU:   0.441
+	 Val. Loss: 0.187 |  Val. PPL:   1.205 |  Val. BLEU:   0.435
+100%|█████████████████████████████████████████████████████████████████████████████████| 227/227 [02:52<00:00,  1.31it/s]
+100%|█████████████████████████████████████████████████████████████████████████████████████| 8/8 [00:05<00:00,  1.49it/s]
+Epoch: 10 | Time: 2m 58s
+	Train Loss: 0.062 | Train PPL:   1.064 |  Train BLEU:   0.442
+	 Val. Loss: 0.182 |  Val. PPL:   1.200 |  Val. BLEU:   0.435
+100%|█████████████████████████████████████████████████████████████████████████████████████| 8/8 [00:05<00:00,  1.44it/s]
+| Test Loss: 0.198 | Test PPL:   1.219 |  Test BLEU:   0.420
+Saving model to mt_seq2seq.pt
+Save vocab to torchtext_mt_vocab.pt
+```