somosnlp-hackathon-2023
/

baizemocracy-lora-7B-cfqa-conv

Text2Text Generation

question answering

Retrieval Augmented Generation

Inference Endpoints

Model card Files Files and versions Community

jorge-henao commited on Apr 10, 2023

Commit

cea8923

·

1 Parent(s): 68a1b59

Update README.md

Files changed (1) hide show

README.md +7 -3

README.md CHANGED Viewed

@@ -9,7 +9,10 @@ license: apache-2.0
 ## What's baizemocracy-lora-7B-cfqa model?
 This model is an open-source chat model fine-tuned with [LoRA](https://github.com/microsoft/LoRA) inspired by [Baize project](https://github.com/project-baize/baize-chatbot/tree/main/). It was trained with the Baize datasets and the ask2democracy-cfqa-salud-pension dataset, wich contains almost 4k instructions to answers questions based on a context relevant to citizen concerns and public debate in spanish.
-Two major experiments models was performed during the Hackathon Somos NLP 2023: A conversational style focused model and a contex focused style model.
 This model is focused in a more conversational way of asking questions. See Pre-proccessing dataset section.
 There is other model variation more focused on augmented retrieval based on context [Baizemocracy-contextfocused](https://github.com/project-baize/baize-chatbot/tree/main/).
@@ -39,8 +42,9 @@ Testing is a work in progress, we decide to share both model variations with com
 - [Alpacaca chat Dialogs](https://github.com/project-baize/baize)
 - [Medical chat Dialogs](https://github.com/project-baize/baize)
-- ### About pre-processing
-Ask2Democracy-cfqa-salud-pension dataset was pre-processed in a conversational style like this:
 ```python
 def format_instruction_without_context(example):

 ## What's baizemocracy-lora-7B-cfqa model?
 This model is an open-source chat model fine-tuned with [LoRA](https://github.com/microsoft/LoRA) inspired by [Baize project](https://github.com/project-baize/baize-chatbot/tree/main/). It was trained with the Baize datasets and the ask2democracy-cfqa-salud-pension dataset, wich contains almost 4k instructions to answers questions based on a context relevant to citizen concerns and public debate in spanish.
+Two major experiments models was performed during the Hackathon Somos NLP 2023:
+- A conversational style focused model
+- A contex focused style model.
 This model is focused in a more conversational way of asking questions. See Pre-proccessing dataset section.
 There is other model variation more focused on augmented retrieval based on context [Baizemocracy-contextfocused](https://github.com/project-baize/baize-chatbot/tree/main/).
 - [Alpacaca chat Dialogs](https://github.com/project-baize/baize)
 - [Medical chat Dialogs](https://github.com/project-baize/baize)
+## About pre-processing
+Ask2Democracy-cfqa-salud-pension dataset was pre-processed in a conversational style in two variations like this:
 ```python
 def format_instruction_without_context(example):