nicholasKluge
/

RewardModelPT

Text Classification

preference model

Inference Endpoints

Model card Files Files and versions Community

nicholasKluge commited on Dec 28, 2023

Commit

bf7fd9f

·

1 Parent(s): b56811f

Update README.md

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -27,7 +27,7 @@ co2_eq_emissions:
 ---
 # RewardModel (Portuguese)
-The `RewardModelPT` is a [BERT](https://huggingface.co/neuralmind/bert-base-portuguese-cased) model that can be used to score the quality of a completion for a given prompt.
 The model was trained with a dataset composed of `prompt`, `prefered_completions`, and `rejected_completions`.
 ## Details
@@ -47,7 +47,7 @@ This repository has the [source code](https://github.com/Nkluge-correa/Aira) use
 ## Usage
-Here's an example of how to use the `RewardModelPT` to score the quality of a response to a given prompt:
 ```python
 from transformers import AutoTokenizer, AutoModelForSequenceClassification
@@ -139,4 +139,4 @@ IA e reclamam do que as máquinas fazem. Score: -10.744
 ## License
-The `RewardModelPT` is licensed under the Apache License, Version 2.0. See the [LICENSE](LICENSE) file for more details.

 ---
 # RewardModel (Portuguese)
+The RewardModelPT is a [BERT](https://huggingface.co/neuralmind/bert-base-portuguese-cased) model that can be used to score the quality of a completion for a given prompt.
 The model was trained with a dataset composed of `prompt`, `prefered_completions`, and `rejected_completions`.
 ## Details
 ## Usage
+Here's an example of how to use the RewardModelPT to score the quality of a response to a given prompt:
 ```python
 from transformers import AutoTokenizer, AutoModelForSequenceClassification
 ## License
+RewardModelPT is licensed under the Apache License, Version 2.0. See the [LICENSE](LICENSE) file for more details.