Update README.md
Browse files
README.md
CHANGED
@@ -44,6 +44,28 @@ parameters:
|
|
44 |
dtype: bfloat16
|
45 |
```
|
46 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
47 |
## 💻 Usage
|
48 |
|
49 |
```python
|
|
|
44 |
dtype: bfloat16
|
45 |
```
|
46 |
|
47 |
+
## OpenLLM Evaluation
|
48 |
+
Details about that can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_sethuiyer__Dr_Samantha_7b_mistral). Overall, with regards to the
|
49 |
+
subjects related to medical domain, the model's performance is as follows:
|
50 |
+
|
51 |
+
| Subject | Accuracy |
|
52 |
+
|-----------------------|------------|
|
53 |
+
| Clinical Knowledge | 70.57% |
|
54 |
+
| Medical Genetics | 71.00% |
|
55 |
+
| Human Aging | 69.06% |
|
56 |
+
| Human Sexuality | 75.57% |
|
57 |
+
| College Medicine | 63.01% |
|
58 |
+
| Anatomy | 58.52% |
|
59 |
+
| College Biology | 72.92% |
|
60 |
+
| College Medicine | 63.01% |
|
61 |
+
| High School Biology | 75.48% |
|
62 |
+
| Professional Medicine | 65.44% |
|
63 |
+
| Nutrition | 76.79% |
|
64 |
+
| Professional Psychology | 65.35% |
|
65 |
+
| Virology | 53.61% |
|
66 |
+
|
67 |
+
Dr. Samantha performs reasonably well on various medical-related subjects, but it's important to note that medical diagnosis and treatment decisions often require a much higher level of accuracy, reliability, and context awareness.
|
68 |
+
|
69 |
## 💻 Usage
|
70 |
|
71 |
```python
|