BeardedMonster commited on
Commit
751d24a
·
verified ·
1 Parent(s): dd9804c
Files changed (1) hide show
  1. README.md +44 -12
README.md CHANGED
@@ -173,18 +173,50 @@ print(tokenizer.decode(output[0][input_len:]))
173
  ```
174
 
175
  Other tasks (e.g translation, classification etc) typically have 2 tags. The first signifies the kind/type of task and the second signifies the end of the input, prompting the model to begin generation. They are as follows:
176
- - Translation : ```python<translate> <yor>, <translate> .... <ibo>, <translate> ... <hau>```
177
- - Instruction following: ```python <prompt><response>```
178
- - Sentiment Analysis: ```python <classify> .... <sentiment>```
179
- - Topic Classification: ```python <classify> .... <sentiment>```
180
- - Text summarization : ```python <summarize> ... <summary>```
181
- - Headline Generation:```python <topic>... <headline>```
182
- - Text Diacritization: ```python <diacritize>.... <yor>```
183
- - Question answering: ```python <qa> <context>..... <question> .... <options>...<answer> OR <qa> <context> .... <answer> OR <prompt> Context:... Question:... <response>```
184
- - Named Entity Recognition: ```python <NER>.... <tag>```
185
- - Text cleaning: ```python<clean>...<correct>```
186
-
187
- You should typically put user's input between these 2 tags.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
188
 
189
  ## Training Details
190
 
 
173
  ```
174
 
175
  Other tasks (e.g translation, classification etc) typically have 2 tags. The first signifies the kind/type of task and the second signifies the end of the input, prompting the model to begin generation. They are as follows:
176
+ - Translation
177
+ ```python
178
+ <translate> <yor>, <translate> .... <ibo>, <translate> ... <hau>
179
+ ```
180
+ - Instruction following
181
+ ```python
182
+ <prompt><response>
183
+ ```
184
+ - Sentiment Analysis
185
+ ```python
186
+ <classify> .... <sentiment>
187
+ ```
188
+ - Topic Classification
189
+ ```python
190
+ <classify> .... <topic>
191
+ ```
192
+ - Text summarization
193
+ ```python
194
+ <summarize> ... <summary>
195
+ ```
196
+ - Headline Generation
197
+ ```python
198
+ <topic>... <headline>
199
+ ```
200
+ - Text Diacritization
201
+ ```python
202
+ <diacritize>.... <yor>
203
+ ```
204
+ - Question answering
205
+ ```python
206
+ <qa> <context>..... <question> .... <options>...<answer> or <qa> <context> .... <answer> or \
207
+ # The below were noted to work better.
208
+ <prompt> Context:... Question:... <response> or <prompt> Context:... Question:... Option A. Option B. ... <response> or <prompt> Context_question_options here <response>
209
+ ```
210
+ - Named Entity Recognition
211
+ ```python
212
+ <NER>.... <tag>
213
+ ```
214
+ - Text cleaning
215
+ ```python
216
+ <clean>...<correct>
217
+ ```
218
+
219
+ You should typically put user's input between these 2 tags. Currently, model also doesnt perform very well on NER due to the scarce data on this.
220
 
221
  ## Training Details
222