Update README.md
Browse files
README.md
CHANGED
@@ -111,4 +111,18 @@ input_ids, attention_mask = preprocess_input(tokenizer, system_prompt, initial_q
|
|
111 |
predicted_approach, _ = predict_approach(router_model, input_ids, attention_mask, device)
|
112 |
|
113 |
print(f"Router predicted approach: {predicted_approach}")
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
114 |
```
|
|
|
111 |
predicted_approach, _ = predict_approach(router_model, input_ids, attention_mask, device)
|
112 |
|
113 |
print(f"Router predicted approach: {predicted_approach}")
|
114 |
+
```
|
115 |
+
|
116 |
+
## Citation
|
117 |
+
|
118 |
+
If you use this in your work, please cite:
|
119 |
+
|
120 |
+
```bibtex
|
121 |
+
@software{optillm,
|
122 |
+
title = {Optillm: Optimizing inference proxy for LLMs},
|
123 |
+
author = {Asankhaya Sharma},
|
124 |
+
year = {2024},
|
125 |
+
publisher = {GitHub},
|
126 |
+
url = {https://github.com/codelion/optillm}
|
127 |
+
}
|
128 |
```
|