isy-thl
/

multilingual-e5-base-course-skill-tuned

Sentence Similarity

sentence-transformers

information retrieval

text-embeddings-inference

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

pascalhuerten commited on Jul 23

Commit

b740a5b

•

1 Parent(s): c2d27e4

Update README.md

Files changed (1) hide show

README.md +0 -9

README.md CHANGED Viewed

@@ -29,15 +29,6 @@ The **isy-thl/multilingual-e5-base-course-skill-tuned** is a finetuned version o
 - **Scalability:**
   - The model can handle input sequences up to 512 tokens in length, making it suitable for processing comprehensive course descriptions.
-## Limitations and Considerations
-- **Language Limitation:**
-  - The finetuning was specifically targeted at German language content. While the base model supports multiple languages, this particular finetuned version may not perform as well on non-German texts without additional training.
-- **Data Bias:**
-  - The performance and reliability of the model are dependent on the quality of the annotated data in the training dataset. Any biases present in the training data may affect the model's output.
-- **Retrieval Scope:**
-  - The model is optimized for educational contexts and may not generalize as effectively to other domains without further finetuning.
 ## Performance
 To evaluate the model, all ESCO (x=13895) and GRETA (x=23) skills were embedded using the model under assessment and stored in a vector database. For each query in the evaluation dataset, the top 30 most relevant candidates were retrieved based on cosine similarity. Metrics such as accuracy, precision, recall, NDCG, MRR, and MAP were then calculated. For reranker evaluation, the reranker was used to re-rank the top 30 candidates chosen by the fine-tuned bi-encoder model. The evaluation results were split for the ESCO and GRETA use cases:

 - **Scalability:**
   - The model can handle input sequences up to 512 tokens in length, making it suitable for processing comprehensive course descriptions.
 ## Performance
 To evaluate the model, all ESCO (x=13895) and GRETA (x=23) skills were embedded using the model under assessment and stored in a vector database. For each query in the evaluation dataset, the top 30 most relevant candidates were retrieved based on cosine similarity. Metrics such as accuracy, precision, recall, NDCG, MRR, and MAP were then calculated. For reranker evaluation, the reranker was used to re-rank the top 30 candidates chosen by the fine-tuned bi-encoder model. The evaluation results were split for the ESCO and GRETA use cases: