Update README.md
Browse files
README.md
CHANGED
@@ -31,6 +31,21 @@ contact: [email protected]
|
|
31 |
## Training
|
32 |
The Discord Messages Dataset simply dwarfs the other datasets, Hence the data sets are repeated.
|
33 |
This leads to them covering each others issues!
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
34 |
## Usage
|
35 |
Example:
|
36 |
```python
|
|
|
31 |
## Training
|
32 |
The Discord Messages Dataset simply dwarfs the other datasets, Hence the data sets are repeated.
|
33 |
This leads to them covering each others issues!
|
34 |
+
|
35 |
+
## Evaluation
|
36 |
+
|
37 |
+
Below is a comparison of Aeona vs. other baselines on the mixed dataset given above using automatic evaluation metrics.
|
38 |
+
|
39 |
+
| Model | Perplexity
|
40 |
+
|---|---|---
|
41 |
+
| Seq2seq Baseline [3] | 29.8 |
|
42 |
+
| Wolf et al. [5] | 16.3 |
|
43 |
+
| GPT-2 baseline | 99.5 |
|
44 |
+
| DialoGPT baseline | 56.6 |
|
45 |
+
| DialoGPT finetuned | 11.4 |
|
46 |
+
| PersonaGPT | 10.2 |
|
47 |
+
| **Aeona** | **7.9** |
|
48 |
+
|
49 |
## Usage
|
50 |
Example:
|
51 |
```python
|