Update README.md
Browse files
README.md
CHANGED
@@ -29,7 +29,7 @@ The 11 languages covered by IndicBERT are: Assamese, Bengali, Gujarati, Hindi, K
|
|
29 |
The link to our GitHub repository containing all our code can be found [here](https://github.com/AI4Bharat/indicner). The link to our paper can be found here.
|
30 |
|
31 |
## Training Corpus
|
32 |
-
Our model was trained on a [dataset](https://huggingface.co/datasets/ai4bharat/
|
33 |
|
34 |
## Evaluation Results
|
35 |
Benchmarking on our testset.
|
|
|
29 |
The link to our GitHub repository containing all our code can be found [here](https://github.com/AI4Bharat/indicner). The link to our paper can be found here.
|
30 |
|
31 |
## Training Corpus
|
32 |
+
Our model was trained on a [dataset](https://huggingface.co/datasets/ai4bharat/naamapadam) which we mined from the existing [Samanantar Corpus](https://huggingface.co/datasets/ai4bharat/samanantar). We used a bert-base-multilingual-uncased model as the starting point and then fine-tuned it to the NER dataset mentioned previously.
|
33 |
|
34 |
## Evaluation Results
|
35 |
Benchmarking on our testset.
|