Loss function

#1
by ccdv - opened

Hi,
The config file is missing but after checking, the classification head is d x 1. However in transformers, binary classification is always d x 2.
So my question is: which loss function did you use? Is this regression?

Thank you

Alibaba-NLP org

the missing config.json file has beed uploaded

Sign up or log in to comment