Update
Browse files
README.md
CHANGED
@@ -173,18 +173,50 @@ print(tokenizer.decode(output[0][input_len:]))
|
|
173 |
```
|
174 |
|
175 |
Other tasks (e.g translation, classification etc) typically have 2 tags. The first signifies the kind/type of task and the second signifies the end of the input, prompting the model to begin generation. They are as follows:
|
176 |
-
- Translation
|
177 |
-
|
178 |
-
|
179 |
-
|
180 |
-
-
|
181 |
-
|
182 |
-
|
183 |
-
|
184 |
-
-
|
185 |
-
|
186 |
-
|
187 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
188 |
|
189 |
## Training Details
|
190 |
|
|
|
173 |
```
|
174 |
|
175 |
Other tasks (e.g translation, classification etc) typically have 2 tags. The first signifies the kind/type of task and the second signifies the end of the input, prompting the model to begin generation. They are as follows:
|
176 |
+
- Translation
|
177 |
+
```python
|
178 |
+
<translate> <yor>, <translate> .... <ibo>, <translate> ... <hau>
|
179 |
+
```
|
180 |
+
- Instruction following
|
181 |
+
```python
|
182 |
+
<prompt><response>
|
183 |
+
```
|
184 |
+
- Sentiment Analysis
|
185 |
+
```python
|
186 |
+
<classify> .... <sentiment>
|
187 |
+
```
|
188 |
+
- Topic Classification
|
189 |
+
```python
|
190 |
+
<classify> .... <topic>
|
191 |
+
```
|
192 |
+
- Text summarization
|
193 |
+
```python
|
194 |
+
<summarize> ... <summary>
|
195 |
+
```
|
196 |
+
- Headline Generation
|
197 |
+
```python
|
198 |
+
<topic>... <headline>
|
199 |
+
```
|
200 |
+
- Text Diacritization
|
201 |
+
```python
|
202 |
+
<diacritize>.... <yor>
|
203 |
+
```
|
204 |
+
- Question answering
|
205 |
+
```python
|
206 |
+
<qa> <context>..... <question> .... <options>...<answer> or <qa> <context> .... <answer> or \
|
207 |
+
# The below were noted to work better.
|
208 |
+
<prompt> Context:... Question:... <response> or <prompt> Context:... Question:... Option A. Option B. ... <response> or <prompt> Context_question_options here <response>
|
209 |
+
```
|
210 |
+
- Named Entity Recognition
|
211 |
+
```python
|
212 |
+
<NER>.... <tag>
|
213 |
+
```
|
214 |
+
- Text cleaning
|
215 |
+
```python
|
216 |
+
<clean>...<correct>
|
217 |
+
```
|
218 |
+
|
219 |
+
You should typically put user's input between these 2 tags. Currently, model also doesnt perform very well on NER due to the scarce data on this.
|
220 |
|
221 |
## Training Details
|
222 |
|