AntoineBlanot
/

flan-t5-xxl-classif-3way

Zero-Shot Classification

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

AntoineBlanot commited on May 18, 2023

Commit

08aa6a0

·

1 Parent(s): 03341de

Update README.md

Files changed (1) hide show

README.md +9 -0

README.md CHANGED Viewed

@@ -91,6 +91,7 @@ By removing the decoder we can *half the original number of parameters* (thus ha
 0. [Usage](##usage)
 1. [Why use T5ForSequenceClassification?](##why-use-t5forsequenceclassification?)
 2. [T5ForClassification vs T5](##t5forclassification-vs-t5)
 ## Usage
 **T5ForSequenceClassification** supports the task of zero-shot classification.
@@ -124,5 +125,13 @@ Benefits and Drawbacks:
 - (**+**) No generation mistakes and faster prediction (no generation latency)
 - (**-**) Looses text-to-text ability
 Special thanks to [philschmid](https://huggingface.co/philschmid) for making a Flan-T5-xxl [checkpoint](https://huggingface.co/philschmid/flan-t5-xxl-sharded-fp16) in fp16.

 0. [Usage](##usage)
 1. [Why use T5ForSequenceClassification?](##why-use-t5forsequenceclassification?)
 2. [T5ForClassification vs T5](##t5forclassification-vs-t5)
+3. [Results](##results)
 ## Usage
 **T5ForSequenceClassification** supports the task of zero-shot classification.
 - (**+**) No generation mistakes and faster prediction (no generation latency)
 - (**-**) Looses text-to-text ability
+## Results
+Results on the validation data of training tasks:
+| Dataset | Accuracy | F1 |
+|:-------:|:--------:|:--:|
+| MNLI (m)|  1 | 0.905 |
+| MNLI (mm) | 0.900 |0.900 |
+| SNLI | 0.900 |0.900 |
+| SciTail | 0.900 |0.900 |
 Special thanks to [philschmid](https://huggingface.co/philschmid) for making a Flan-T5-xxl [checkpoint](https://huggingface.co/philschmid/flan-t5-xxl-sharded-fp16) in fp16.