Source Code

#3
by asgharmustafa92 - opened

Hi, Is the source code used to pre-train and fine tune this model from base-bert model available for use?
Thanks.

rttl labs org

Hi, for task adaptive pre-training (adapting it for Yelp reviews), you can use:
https://github.com/huggingface/transformers/blob/main/examples/pytorch/language-modeling/run_mlm.py
You can change the value for mlm_probability to increase the complexity of the prediction task.
for switching to WWM, you just need to use DataCollatorForWholeWordMask or switch to this script:
https://github.com/huggingface/transformers/blob/main/examples/research_projects/mlm_wwm/run_mlm_wwm.py

Hi, for task adaptive pre-training (adapting it for Yelp reviews), you can use:
https://github.com/huggingface/transformers/blob/main/examples/pytorch/language-modeling/run_mlm.py
You can change the value for mlm_probability to increase the complexity of the prediction task.
for switching to WWM, you just need to use DataCollatorForWholeWordMask or switch to this script:
https://github.com/huggingface/transformers/blob/main/examples/research_projects/mlm_wwm/run_mlm_wwm.py

Thanks for sharing. I'll take a look :)

asgharmustafa92 changed discussion status to closed

Sign up or log in to comment