ai4bharat
/

IndicNER

Token Classification

Inference Endpoints

Model card Files Files and versions Community

anoopk commited on Aug 6, 2022

Commit

f8554ee

•

1 Parent(s): 1447b35

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -29,7 +29,7 @@ The 11 languages covered by IndicBERT are: Assamese, Bengali, Gujarati, Hindi, K
 The link to our GitHub repository containing all our code can be found [here](https://github.com/AI4Bharat/indicner). The link to our paper can be found here.
 ## Training Corpus
-Our model was trained on a [dataset](https://huggingface.co/datasets/ai4bharat/IndicNER) which we mined from the existing [Samanantar Corpus](https://huggingface.co/datasets/ai4bharat/samanantar). We used a bert-base-multilingual-uncased model as the starting point and then fine-tuned it to the NER dataset mentioned previously.
 ## Evaluation Results
 Benchmarking on our testset.

 The link to our GitHub repository containing all our code can be found [here](https://github.com/AI4Bharat/indicner). The link to our paper can be found here.
 ## Training Corpus
+Our model was trained on a [dataset](https://huggingface.co/datasets/ai4bharat/naamapadam) which we mined from the existing [Samanantar Corpus](https://huggingface.co/datasets/ai4bharat/samanantar). We used a bert-base-multilingual-uncased model as the starting point and then fine-tuned it to the NER dataset mentioned previously.
 ## Evaluation Results
 Benchmarking on our testset.