File size: 752 Bytes
d5ca12e
869c64e
d5ca12e
71f7fa6
 
 
 
869c64e
 
d5ca12e
080098a
 
 
1a19f6c
9a06153
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
---
library_name: transformers
pipeline_tag: text-classification
tags:
- biology
- herbarium
- location
language:
- en
---
![RBG Kew Logo](https://c.ststat.net/Content/Sites/kew/generic/images/logo.png)
![RBG Kew Herbarium Packets](https://www.kew.org/sites/default/files/styles/read_watch_listing/public/2019-02/herbarium%20specimens.png.webp?itok=XByp1zeV)

RoBERTa for binary sequence classification fine-tuned to classify text derived from herbarium packets as location sensitive.
Fine-tuned with 500,000 cleaned data samples from RBG Kew's Herbarium dataset available on GBIF (https://doi.org/10.15468/ly60bx).
Trained primarily for English language but may work with other languages due to the large variety of text present in the Kew Herbarium.