mii-community
/

zefiro-7b-sft-ITA

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

giux78 commited on Feb 21, 2024

Commit

6ba6392

·

verified ·

1 Parent(s): c46adff

Update README.md

Files changed (1) hide show

README.md +10 -0

README.md CHANGED Viewed

@@ -36,6 +36,16 @@ developed by Università di Bari. For the implementation we combined different a
 - **Developed by:** [giux78](https://alessandroercolani.webflow.io/)
 - **Funded by:** [Business Operating System](https://www.businessos.xyz)
 ## Evaluations:
 | Model | Arc-c  | HellaS | MMUL | AVG |

 - **Developed by:** [giux78](https://alessandroercolani.webflow.io/)
 - **Funded by:** [Business Operating System](https://www.businessos.xyz)
+## Code
+I followed the [alingnment handbook](https://github.com/huggingface/alignment-handbook/blob/main/recipes/zephyr-7b-beta/sft/config_qlora.yaml) from HuggingfaceH4 team
+, I just changed the base model and some parameters
+## Computation
+It has been trained on two A100 from [seeweb.it](https://www.seeweb.it/) who sponsered the training. I strongly
+suggest is one of the cheapest and solid GPUs provider.
 ## Evaluations:
 | Model | Arc-c  | HellaS | MMUL | AVG |