metadata
inference: false
datasets:
- unicamp-dl/mmarco
pipeline_tag: sentence-similarity
tags:
- ColBERT
base_model:
- aubmindlab/bert-base-arabertv2
license: mit
library_name: RAGatouille
Arabic-ColBERT-100k
First version of Arabic ColBERT. This version uses the bert-base-arabertv2 which is pre-segmented text using Farasa. A new version based on bert-base-arabertv0.2 will be trained and this repo will be updated. See https://www.linkedin.com/posts/akhooli_this-is-probably-the-first-arabic-colbert-activity-7217969205197848576-l8Cy