pszemraj
/

griffin-c3t-8L-v0.02-fineweb

Text Generation

recurrent_gemma

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Community

pszemraj commited on Apr 28, 2024

Commit

b88752f

·

verified ·

1 Parent(s): f4e695e

Update README.md

Files changed (1) hide show

README.md +21 -0

README.md CHANGED Viewed

@@ -26,6 +26,27 @@ It achieves the following results on the evaluation set:
 - Accuracy: 0.2326
 - Num Input Tokens Seen: 798621696
 ## Training procedure
 ### Training hyperparameters

 - Accuracy: 0.2326
 - Num Input Tokens Seen: 798621696
+## numbers
+tl;dr its bad/would need more training:
+hf (pretrained=pszemraj/griffin-c3t-8L-v0.02-fineweb,trust_remote_code=True,dtype=float), gen_kwargs: (None), limit: None, num_fewshot: None, batch_size: 4
+|    Tasks     |Version|Filter|n-shot|  Metric  |   Value   |   |  Stderr  |
+|--------------|------:|------|-----:|----------|----------:|---|---------:|
+|winogrande    |      1|none  |     0|acc       |     0.5146|±  |    0.0140|
+|piqa          |      1|none  |     0|acc       |     0.5511|±  |    0.0116|
+|              |       |none  |     0|acc_norm  |     0.5261|±  |    0.0116|
+|openbookqa    |      1|none  |     0|acc       |     0.1140|±  |    0.0142|
+|              |       |none  |     0|acc_norm  |     0.2240|±  |    0.0187|
+|lambada_openai|      1|none  |     0|perplexity|209503.2246|±  |11711.4041|
+|              |       |none  |     0|acc       |     0.0000|±  |    0.0000|
+|boolq         |      2|none  |     0|acc       |     0.3783|±  |    0.0085|
+|arc_easy      |      1|none  |     0|acc       |     0.2593|±  |    0.0090|
+|              |       |none  |     0|acc_norm  |     0.2774|±  |    0.0092|
 ## Training procedure
 ### Training hyperparameters