samagra14wefi
/

PreferED

Text Classification

Model card Files Files and versions Community

samagra14wefi commited on Oct 6, 2023

Commit

c1bd7b1

•

1 Parent(s): 2e8e831

Update README.md

Files changed (1) hide show

README.md +2 -5

README.md CHANGED Viewed

@@ -7,6 +7,7 @@ language:
 library_name: keras
 tags:
 - evaluations
 ---
 # PreferED: Preference Evaluation DeBERTa Model
@@ -155,8 +156,4 @@ trainer.train()
 ### Loss Function Consideration
-Anthropic recommends using the loss function L<sub>PM</sub> = log(1 + e^(r<sub>bad</sub> - r<sub>good</sub>)) for preference models. However, this PreferED model was trained using binary cross-entropy loss, and therefore changing the loss functions might increase the training time to converge. For more details on preference models and loss functions, you may refer to the paper by Askell et al., 2021: [A General Language Assistant as a Laboratory for Alignment](https://arxiv.org/abs/2112.00861).

 library_name: keras
 tags:
 - evaluations
+pipeline_tag: text-classification
 ---
 # PreferED: Preference Evaluation DeBERTa Model
 ### Loss Function Consideration
+Anthropic recommends using the loss function L<sub>PM</sub> = log(1 + e^(r<sub>bad</sub> - r<sub>good</sub>)) for preference models. However, this PreferED model was trained using binary cross-entropy loss, and therefore changing the loss functions might increase the training time to converge. For more details on preference models and loss functions, you may refer to the paper by Askell et al., 2021: [A General Language Assistant as a Laboratory for Alignment](https://arxiv.org/abs/2112.00861).