RaiBP
/

gpt2-openwebtext2-first-30-chunks-ablation-non-english

Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

RaiBP commited on Feb 8, 2024

Commit

946c654

·

verified ·

1 Parent(s): fc39378

Update README.md

Files changed (1) hide show

README.md +20 -1

README.md CHANGED Viewed

@@ -28,7 +28,26 @@ More information needed
 More information needed
 ## Training procedure
 ### Training hyperparameters
 The following hyperparameters were used during training:

 More information needed
 ## Training procedure
+The [`run_clm.py` script](https://github.com/huggingface/transformers/blob/main/examples/pytorch/language-modeling/run_clm.py) from the transformers library was used. Training was distributed on two NVIDIA Quadro RTX 6000 GPUs:
+```bash
+TORCH_CPP_LOG_LEVEL=INFO NCCL_DEBUG=INFO CUDA_VISIBLE_DEVICES=0,1 nohup python -m torch.distributed.launch \
+--nproc_per_node=2 run_clm.py --output_dir="./training_nen" \
+--model_type="gpt2" \
+--config_name="./training" \
+--tokenizer_name="./training" \
+--dataset_name="RaiBP/openwebtext2-first-30-chunks-english-only-examples" \
+--do_train \
+--per_device_train_batch_size 8 \
+--block_size="1024" \
+--learning_rate="5e-3" --warmup_steps="1000" \
+--adam_beta1="0.9" --adam_beta2="0.98" --weight_decay="0.01" \
+--overwrite_output_dir \
+--num_train_epochs="1" \
+--logging_steps="500" \
+--save_steps="5000" --preprocessing_num_workers="16" \
+--gradient_accumulation_steps="4" --report_to="tensorboard" \
+--logging_dir="./log_nen"  > command_nen_log.log 2>&1 &
+```
 ### Training hyperparameters
 The following hyperparameters were used during training: