codelion
/

optillm-modernbert-large

Model card Files Files and versions Community

codelion commited on 10 days ago

Commit

4ea34db

·

verified ·

1 Parent(s): 8df12a4

Update README.md

Files changed (1) hide show

README.md +11 -0

README.md CHANGED Viewed

@@ -14,6 +14,17 @@ To use the model with optillm you can just prepend `router` to the model name. E
 Otherwise, refer to the code in [router-plugin](https://github.com/codelion/optillm/blob/main/optillm/plugins/router_plugin.py) to see how to use this model for classification.
 # Usage
 To use the model directly you will need to use our `OptILMClassifier` class as we added additional layers to the base model. The additional

 Otherwise, refer to the code in [router-plugin](https://github.com/codelion/optillm/blob/main/optillm/plugins/router_plugin.py) to see how to use this model for classification.
+This model is based on `ModernBERT-large`and better than the previous [router model](https://huggingface.co/codelion/optillm-bert-uncased)
+that was based on `bert-large-uncased`.
+### Router results on AIME 2024 pass@1
+| Model | Score |
+|-------|-----:|
+| router-gpt4o-mini with codelion/optillm-modernbert-large | 13.33 |
+| router-gpt4o-mini with codelion/optillm-bert-uncased | 6.67 |
+| gpt4o-mini | 3.33 |
 # Usage
 To use the model directly you will need to use our `OptILMClassifier` class as we added additional layers to the base model. The additional