Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
vibhorg
/
rl4llm_uofm_nlpo_super_t5_arxiv
like
0
Text2Text Generation
Transformers
PyTorch
scientific_papers
English
t5
text-generation-inference
rlhf
PPO
Inference Endpoints
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
e1542dd
rl4llm_uofm_nlpo_super_t5_arxiv
1 contributor
History:
5 commits
vibhorg
Update README.md
e1542dd
verified
12 months ago
.gitattributes
Safe
1.52 kB
initial commit
12 months ago
README.md
Safe
304 Bytes
Update README.md
12 months ago
config.json
Safe
1.55 kB
Upload 2 files
12 months ago
pytorch_model.bin
Safe
990 MB
LFS
Upload 2 files
12 months ago