notdiamond
/

notdiamond-0001

Text Classification

Inference Endpoints

Model card Files Files and versions Community

t5nd commited on Dec 13, 2023

Commit

5ddfe44

•

1 Parent(s): 9076472

Slight edits to model card

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -5,9 +5,9 @@ license: apache-2.0
 notdiamond-0001 automatically determines whether to send queries to GPT-3.5 or GPT-4, depending on which model is best-suited for your task. We've trained notdiamond-0001 on hundreds of thousands of data points from robust, cross-domain evaluation benchmarks.
-The router model is free to use under the Apache 2.0 license. You can also access the model through our free [API](https://www.notdiamond.ai/notdiamond-0001).
-This router model is a classifier and will return either GPT-3.5 or GPT-4. You determine which version of each model you want to use and make the calls client-side with your own keys.
 To use notdiamond-0001, format your queries using the following prompt with your query appended at the end
 ``` python
@@ -33,4 +33,4 @@ You can then determine the model to call as follows
     model_id = logits.argmax().item()
     model_to_call = id2label[model_id]
 ```
-For more details on how you can integrate this into your techstack and have notdiamond-0001 help you reduce latency and cost, check out our [documentation](https://notdiamond.readme.io/reference/introduction-1).

 notdiamond-0001 automatically determines whether to send queries to GPT-3.5 or GPT-4, depending on which model is best-suited for your task. We've trained notdiamond-0001 on hundreds of thousands of data points from robust, cross-domain evaluation benchmarks.
+Our router is free to use under the Apache 2.0 license. You can also access the model through our free [API](https://www.notdiamond.ai/notdiamond-0001).
+The notdiamond-0001 router model is a classifier and will return a label for either GPT-3.5 or GPT-4. You determine which version of each model you want to use and make the calls client-side with your own keys.
 To use notdiamond-0001, format your queries using the following prompt with your query appended at the end
 ``` python
     model_id = logits.argmax().item()
     model_to_call = id2label[model_id]
 ```
+For more details on how you can integrate this into your techstack and use notdiamond-0001 to improve quality while reducing latency and cost, check out our [documentation](https://notdiamond.readme.io/reference/introduction-1).