moreh
/

MoMo-72B-lora-1.8.7-DPO

Model card Files Files and versions Community

Update README.md

#7

by exzread - opened Feb 12

base: refs/heads/main

←

from: refs/pr/7

Discussion Files changed

Files changed (1) hide show

README.md +12 -0

README.md CHANGED Viewed

@@ -2,6 +2,18 @@
 license: mit
 language:
 - en
 ---
 # **Introduction**
 MoMo-72B-lora-1.8.7-DPO is trained via Direct Preference Optimization([DPO](https://arxiv.org/abs/2305.18290)) from [MoMo-72B-LoRA-V1.4](https://huggingface.co/moreh/MoMo-72B-LoRA-V1.4) as its base model, with several optimizations in hyperparameters.

 license: mit
 language:
 - en
+- id
+datasets:
+- Ichsan2895/alpaca-gpt4-indonesian
+metrics:
+- accuracy
+- character
+library_name: keras
+pipeline_tag: text-generation
+tags:
+- code
+- biology
+- finance
 ---
 # **Introduction**
 MoMo-72B-lora-1.8.7-DPO is trained via Direct Preference Optimization([DPO](https://arxiv.org/abs/2305.18290)) from [MoMo-72B-LoRA-V1.4](https://huggingface.co/moreh/MoMo-72B-LoRA-V1.4) as its base model, with several optimizations in hyperparameters.