Edit model card

Chemical vs Pharmaceutical Domain Document Classifier

Chemical domain language model finetuned on 13K Chemical, and 14K Pharma Wikipedia articles broken into paragraphs.

Train Loss Validation Acc. Test Acc.
0.17 0.928 0.927

Dataset

Dataset with splits can be found @ https://www.kaggle.com/shahrukhkhan/pharma-vs-chemicals-domain-classification

Label Mappings

LABEL_0 => "PHARMACEUTICAL"
LABEL_1 => "CHEMICAL"

Usage in Transformers

from transformers import AutoTokenizer, AutoModelForSequenceClassification
  
tokenizer = AutoTokenizer.from_pretrained("recobo/chemical-bert-uncased-pharmaceutical-chemical-classifier")

model = AutoModelForSequenceClassification.from_pretrained("recobo/chemical-bert-uncased-pharmaceutical-chemical-classifier")
Downloads last month
28
Safetensors
Model size
110M params
Tensor type
I64
·
F32
·
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.