Jayveersinh-Raj
commited on
Commit
•
b225c15
1
Parent(s):
0649117
Update README.md
Browse files
README.md
CHANGED
@@ -228,6 +228,7 @@ The model better than GPT4, and a human annotator that annotated the comments of
|
|
228 |
### Testing Data, Factors & Metrics
|
229 |
1. Tested on human annotations
|
230 |
2. Tested on GPT4 generated texts
|
|
|
231 |
|
232 |
#### Testing Data
|
233 |
|
@@ -238,6 +239,7 @@ The dataset is available on github
|
|
238 |
|
239 |
<!-- These are the evaluation metrics being used, ideally with a description of why. -->
|
240 |
Top-1 accuracy, since our data contains multiple langauges.
|
|
|
241 |
|
242 |
### Results
|
243 |
1. Tested on human annotations → 100% on non toxic sentences, better than human, as discussed in evaluation.
|
|
|
228 |
### Testing Data, Factors & Metrics
|
229 |
1. Tested on human annotations
|
230 |
2. Tested on GPT4 generated texts
|
231 |
+
3. F1-score on english test set -> 0.96.
|
232 |
|
233 |
#### Testing Data
|
234 |
|
|
|
239 |
|
240 |
<!-- These are the evaluation metrics being used, ideally with a description of why. -->
|
241 |
Top-1 accuracy, since our data contains multiple langauges.
|
242 |
+
F1-score
|
243 |
|
244 |
### Results
|
245 |
1. Tested on human annotations → 100% on non toxic sentences, better than human, as discussed in evaluation.
|