pretrained model for crosslingual

First, thank you for a great project with data in multiple languages for persona chat.

The reference link to the XNLG mentioned is well explained, but I will write it down for those who have difficulty training.


I guess that the cross-lingual model's link same as the multi-lingual model below is somewhat confusing.
https://github.com/CZWin32768/XNLG/pull/11 

> We provided the [Pre-trained XNLG models](https://drive.google.com/open?id=1kBRBdpf8nbfADYD0QU57zkEj2ss-WY_C)  for you to skip the XNLG pre-training process.



I wanted to build an en-ko model and skip pretrain steps.
After some trials, I was able to run fine-tune script (run.sh).


```
fine-tune Xpersona on English and test on Korean (using XNLG based on XLM-R)
python xnlg-ft.py --exp_name xpersona --exp_id ftOnKo --dump_path ./dump --model_path /home/zihan/XNLG/xnlg/dump/stage2_en-ko/debug2/best-valid_en-ko_mt_bleu.pth --data_path ./data/processed/XNLG --optimizer adam,lr=0.00001 --batch_size 1 --n_epochs 4 --epoch_size 3000 --max_len 120 --max_vocab 200000 --train_layers 1,5 --decode_with_vocab False --n_enc_layers 10 --n_dec_layers 6 --ds_name xpersona --train_directions en-en --eval_directions ko-ko 
```

To this, I had to get xlm 17 or 100 language model here you [linked](https://github.com/facebookresearch/XLM#pretrained-cross-lingual-language-models) and get bpe,vocab (*_xnli_100) in data folder
and run get-data-xpersona.sh 

my `crosslingual/data` folder looks like this , and finally perfectly fits for the training script. 
<img width="312" alt="스크린샷 2021-01-25 오후 6 21 59" src="https://user-images.githubusercontent.com/42016485/105686254-60b2bb00-5f3a-11eb-960c-94230f490f24.png">
<img width="232" alt="스크린샷 2021-01-25 오후 6 22 04" src="https://user-images.githubusercontent.com/42016485/105686244-5d1f3400-5f3a-11eb-8433-e2f6f9144779.png">






Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

pretrained model for crosslingual #6

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Uh oh!

pretrained model for crosslingual #6

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions