jorge-henao
commited on
Commit
·
cea8923
1
Parent(s):
68a1b59
Update README.md
Browse files
README.md
CHANGED
@@ -9,7 +9,10 @@ license: apache-2.0
|
|
9 |
## What's baizemocracy-lora-7B-cfqa model?
|
10 |
|
11 |
This model is an open-source chat model fine-tuned with [LoRA](https://github.com/microsoft/LoRA) inspired by [Baize project](https://github.com/project-baize/baize-chatbot/tree/main/). It was trained with the Baize datasets and the ask2democracy-cfqa-salud-pension dataset, wich contains almost 4k instructions to answers questions based on a context relevant to citizen concerns and public debate in spanish.
|
12 |
-
Two major experiments models was performed during the Hackathon Somos NLP 2023:
|
|
|
|
|
|
|
13 |
This model is focused in a more conversational way of asking questions. See Pre-proccessing dataset section.
|
14 |
There is other model variation more focused on augmented retrieval based on context [Baizemocracy-contextfocused](https://github.com/project-baize/baize-chatbot/tree/main/).
|
15 |
|
@@ -39,8 +42,9 @@ Testing is a work in progress, we decide to share both model variations with com
|
|
39 |
- [Alpacaca chat Dialogs](https://github.com/project-baize/baize)
|
40 |
- [Medical chat Dialogs](https://github.com/project-baize/baize)
|
41 |
|
42 |
-
|
43 |
-
|
|
|
44 |
```python
|
45 |
|
46 |
def format_instruction_without_context(example):
|
|
|
9 |
## What's baizemocracy-lora-7B-cfqa model?
|
10 |
|
11 |
This model is an open-source chat model fine-tuned with [LoRA](https://github.com/microsoft/LoRA) inspired by [Baize project](https://github.com/project-baize/baize-chatbot/tree/main/). It was trained with the Baize datasets and the ask2democracy-cfqa-salud-pension dataset, wich contains almost 4k instructions to answers questions based on a context relevant to citizen concerns and public debate in spanish.
|
12 |
+
Two major experiments models was performed during the Hackathon Somos NLP 2023:
|
13 |
+
- A conversational style focused model
|
14 |
+
- A contex focused style model.
|
15 |
+
|
16 |
This model is focused in a more conversational way of asking questions. See Pre-proccessing dataset section.
|
17 |
There is other model variation more focused on augmented retrieval based on context [Baizemocracy-contextfocused](https://github.com/project-baize/baize-chatbot/tree/main/).
|
18 |
|
|
|
42 |
- [Alpacaca chat Dialogs](https://github.com/project-baize/baize)
|
43 |
- [Medical chat Dialogs](https://github.com/project-baize/baize)
|
44 |
|
45 |
+
## About pre-processing
|
46 |
+
|
47 |
+
Ask2Democracy-cfqa-salud-pension dataset was pre-processed in a conversational style in two variations like this:
|
48 |
```python
|
49 |
|
50 |
def format_instruction_without_context(example):
|