Angainor
/

alpaca-lora-13b

Model card Files Files and versions Community

Angainor commited on Apr 4, 2023

Commit

6b7dd5a

•

1 Parent(s): e50db28

Update README.md

Files changed (1) hide show

README.md +12 -1

README.md CHANGED Viewed

@@ -1,3 +1,10 @@
 This repo contains a low-rank adapter for LLaMA-13b fit on the Stanford Alpaca dataset.
 This version of the weights was trained on dual RTX3090 with the following hyperparameters:
@@ -21,4 +28,8 @@ OMP_NUM_THREADS=4 WORLD_SIZE=2 CUDA_VISIBLE_DEVICES=0,1 torchrun --nproc_per_nod
     --val_set_size=0 \
     --micro_batch_size=32
-Instructions for running it can be found at https://github.com/tloen/alpaca-lora.

+---
+license: mit
+datasets:
+- yahma/alpaca-cleaned
+language:
+- en
+---
 This repo contains a low-rank adapter for LLaMA-13b fit on the Stanford Alpaca dataset.
 This version of the weights was trained on dual RTX3090 with the following hyperparameters:
     --val_set_size=0 \
     --micro_batch_size=32
+LR warmup was tuned to fit the first epoch.
+Instructions for running it can be found at https://github.com/tloen/alpaca-lora.
+![10 epochs](alpaca13b.png)