aino813
/

llm-jp-3-13b-241213-SFT-dolly-oasst-ichikara-final_lora

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

aino813 commited on Dec 15, 2024

Commit

0f20590

·

verified ·

1 Parent(s): ed61ac6

Update README.md

Files changed (1) hide show

README.md +16 -1

README.md CHANGED Viewed

@@ -62,7 +62,22 @@ This llama model was trained 2x faster with [Unsloth](https://github.com/unsloth
 5. **ichikara-instruction-003-001**
    - LLMのための日本語のシングルターン指示応答データセット。
-   - 約2,903件の高品質な日本語データを含む。

 5. **ichikara-instruction-003-001**
    - LLMのための日本語のシングルターン指示応答データセット。
+   - 約2,903件の高品質な日本語データを含む。
+# 学習用パラメータについて
+| パラメータ名                         |                    値                    |
+|--------------------------------------|----------------------------------------|
+| LoRA_r                               |                  128                    |
+| LoRA_alpha                           |                  256                    |
+| LoRA_dropout                         |                 0.05                    |
+| per_device_train_batch_size          |                   1                     |
+| gradient_accumulation_steps          |                  16                     |
+| max_steps                            |                  150                    |
+| warmup_ratio                         |                  0.1                    |
+| num_train_epochs                     |                   2                     |
+| learning_rate                        |                 0.0001                   |
+| embedding_learning_rate              |                 0.00001                   |