RedXeol
/

bertin-gpt-j-6B-alpaca-4bit-128g

Text Generation

bertin-project/alpaca-spanish

Inference Endpoints

Model card Files Files and versions Community

RedXeol commited on May 12, 2023

Commit

1d6f719

·

1 Parent(s): fa531fd

Update README.md

Files changed (1) hide show

README.md +4 -1

README.md CHANGED Viewed

@@ -16,10 +16,13 @@ tags:
 <strong><span style="font-size: larger;">bertin-gpt-j-6B-alpaca-4bit-128g 🤗</span></strong>
 **descripción en español agregado ⬇️**
 This is a 4-bit GPTQ version of the [bertin-project/bertin-gpt-j-6B-alpaca]( https://huggingface.co/bertin-project/bertin-gpt-j-6B-alpaca)
 this is the result of quantizing to 4 bits using [AutoGPTQ](https://github.com/PanQiWei/AutoGPTQ).
 ** How to easily download and use this model in text-generation-webui** (tutorial by [TheBloke](https://huggingface.co/TheBloke))
@@ -61,7 +64,7 @@ This dataset cannot be used to create models that compete in any way with OpenAI
 Finetuning
 To fine-tune the BERTIN GPT-J-6B model we used the code available on BERTIN's fork of mesh-transformer-jax, which provides code adapt an Alpaca dataset to finetune any GPT-J-6B model. We run finetuning for 3 epochs using sequence length of 2048 on a single TPUv3-8 for 3 hours on top of BERTIN GPT-J-6B.
-** español **
 Esta es una versión GPTQ de 4 bits del [bertin-project/bertin-gpt-j-6B-alpaca]( https://huggingface.co/bertin-project/bertin-gpt-j-6B-alpaca)

 <strong><span style="font-size: larger;">bertin-gpt-j-6B-alpaca-4bit-128g 🤗</span></strong>
+![bertin](https://aeiljuispo.cloudimg.io/v7/https://s3.amazonaws.com/moonup/production/uploads/1627122830113-5ef3829e518622264685b0cd.png?w=200&h=200&f=face)
 **descripción en español agregado ⬇️**
 This is a 4-bit GPTQ version of the [bertin-project/bertin-gpt-j-6B-alpaca]( https://huggingface.co/bertin-project/bertin-gpt-j-6B-alpaca)
 this is the result of quantizing to 4 bits using [AutoGPTQ](https://github.com/PanQiWei/AutoGPTQ).
 ** How to easily download and use this model in text-generation-webui** (tutorial by [TheBloke](https://huggingface.co/TheBloke))
 Finetuning
 To fine-tune the BERTIN GPT-J-6B model we used the code available on BERTIN's fork of mesh-transformer-jax, which provides code adapt an Alpaca dataset to finetune any GPT-J-6B model. We run finetuning for 3 epochs using sequence length of 2048 on a single TPUv3-8 for 3 hours on top of BERTIN GPT-J-6B.
+**español**
 Esta es una versión GPTQ de 4 bits del [bertin-project/bertin-gpt-j-6B-alpaca]( https://huggingface.co/bertin-project/bertin-gpt-j-6B-alpaca)