added heatmap image
Browse files
README.md
CHANGED
@@ -15,13 +15,16 @@ library_name: transformers
|
|
15 |
pipeline_tag: feature-extraction
|
16 |
---
|
17 |
|
18 |
-
#
|
19 |
|
20 |
This model is a fine-tuned version of [openai/clip-vit-large-patch14](https://huggingface.co/openai/clip-vit-large-patch14) as an image encoder and [microsoft/BiomedVLP-CXR-BERT-general](https://huggingface.co/microsoft/BiomedVLP-CXR-BERT-general) as a Text encoder on an ROCO dataset.
|
21 |
It achieves the following results on the evaluation set:
|
22 |
- Loss: 0.3388
|
23 |
|
24 |
-
##
|
|
|
|
|
|
|
25 |
|
26 |
### Training hyperparameters
|
27 |
|
|
|
15 |
pipeline_tag: feature-extraction
|
16 |
---
|
17 |
|
18 |
+
# RCLIP (Clip model fine-tuned on radiology images and their captions)
|
19 |
|
20 |
This model is a fine-tuned version of [openai/clip-vit-large-patch14](https://huggingface.co/openai/clip-vit-large-patch14) as an image encoder and [microsoft/BiomedVLP-CXR-BERT-general](https://huggingface.co/microsoft/BiomedVLP-CXR-BERT-general) as a Text encoder on an ROCO dataset.
|
21 |
It achieves the following results on the evaluation set:
|
22 |
- Loss: 0.3388
|
23 |
|
24 |
+
## Heatmap
|
25 |
+
|
26 |
+
Here is the heatmap of the similarity score of the first 30 samples on the test split of the ROCO dataset if images vs their captions:
|
27 |
+
![heatmap](https://imgur.com/fPFM694.png)
|
28 |
|
29 |
### Training hyperparameters
|
30 |
|