codelion
/

optillm-bert-uncased

Model card Files Files and versions Community

codelion commited on 13 days ago

Commit

a7017ae

·

verified ·

1 Parent(s): 93bc23e

Update README.md

Files changed (1) hide show

README.md +14 -0

README.md CHANGED Viewed

@@ -111,4 +111,18 @@ input_ids, attention_mask = preprocess_input(tokenizer, system_prompt, initial_q
 predicted_approach, _ = predict_approach(router_model, input_ids, attention_mask, device)
 print(f"Router predicted approach: {predicted_approach}")
 ```

 predicted_approach, _ = predict_approach(router_model, input_ids, attention_mask, device)
 print(f"Router predicted approach: {predicted_approach}")
+```
+## Citation
+If you use this in your work, please cite:
+```bibtex
+@software{optillm,
+  title = {Optillm: Optimizing inference proxy for LLMs},
+  author = {Asankhaya Sharma},
+  year = {2024},
+  publisher = {GitHub},
+  url = {https://github.com/codelion/optillm}
+}
 ```