--- license: apache-2.0 pipeline_tag: text-generation language: - en widget: - example_title: Hello messages: - role: user content: Hey my name is Julien! How are you? - example_title: Winter holidays messages: - role: system content: >- You are a helpful and honest assistant. Please, respond concisely and truthfully. - role: user content: Can you recommend a good destination for Winter holidays? - example_title: Programming assistant messages: - role: system content: >- You are a helpful and honest code and programming assistant. Please, respond concisely and truthfully. - role: user content: Write a function that computes the nth fibonacci number. inference: parameters: max_new_tokens: 100 stop: - <|end_of_text|> - <|eot_id|> --- WORK IN PROGESS --- | Tasks |Version|Filter|n-shot| Metric | |Value | |Stderr| |-------------|------:|------|-----:|--------|---|-----:|---|-----:| |arc_challenge| 1|none | 0|acc |↑ |0.2193|± |0.0121| | | |none | 0|acc_norm|↑ |0.2517|± |0.0127| |arc_easy | 1|none | 0|acc |↑ |0.2399|± |0.0088| | | |none | 0|acc_norm|↑ |0.2428|± |0.0088| |boolq | 2|none | 0|acc |↑ |0.6116|± |0.0085| |hellaswag | 1|none | 0|acc |↑ |0.2546|± |0.0043| | | |none | 0|acc_norm|↑ |0.2647|± |0.0044| |openbookqa | 1|none | 0|acc |↑ |0.1540|± |0.0162| | | |none | 0|acc_norm|↑ |0.2680|± |0.0198| |piqa | 1|none | 0|acc |↑ |0.5413|± |0.0116| | | |none | 0|acc_norm|↑ |0.5310|± |0.0116| |winogrande | 1|none | 0|acc |↑ |0.5020|± |0.0141|