Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
heegyu
's Collections
Korean Reward Modeling
Korean Pretraining Dataset
RLHF papers
Reward Modeling Datasets
Pre-training Dataset
Vision LM
Image Generation
Domain Specific (Math, Code, etc)
Machine Translation
Safety LM
Text2SQL
Safety LM
updated
Sep 10
Upvote
-
meta-llama/LlamaGuard-7b
Text Generation
•
Updated
Apr 17
•
9.14k
•
213
meta-llama/Meta-Llama-Guard-2-8B
Text Generation
•
Updated
May 13
•
11.3k
•
281
OpenSafetyLab/MD-Judge-v0.1
Text Generation
•
Updated
May 20
•
405
•
13
mcj311/saladbench_data
Viewer
•
Updated
Mar 28
•
30.4k
•
57
openbmb/UltraSafety
Viewer
•
Updated
Mar 16
•
3k
•
115
•
28
PKU-Alignment/BeaverTails
Viewer
•
Updated
Oct 17, 2023
•
364k
•
2.85k
•
32
PKU-Alignment/PKU-SafeRLHF
Viewer
•
Updated
23 days ago
•
164k
•
3.56k
•
114
Anthropic/hh-rlhf
Viewer
•
Updated
May 26, 2023
•
169k
•
8.4k
•
1.2k
lmsys/toxic-chat
Viewer
•
Updated
May 14
•
20.3k
•
3.34k
•
135
mmathys/openai-moderation-api-evaluation
Viewer
•
Updated
Aug 28, 2023
•
1.68k
•
240
•
18
allenai/WildChat-1M
Viewer
•
Updated
24 days ago
•
838k
•
1.89k
•
280
allenai/wildjailbreak
Viewer
•
Updated
Aug 8
•
2.21k
•
1.2k
•
23
allenai/wildguardmix
Viewer
•
Updated
Jun 29
•
88.5k
•
2.71k
•
12
allenai/xstest-response
Viewer
•
Updated
Jun 29
•
895
•
453
•
2
walledai/XSTest
Viewer
•
Updated
Jul 4
•
450
•
1.1k
•
3
meta-llama/Llama-Guard-3-8B
Text Generation
•
Updated
30 days ago
•
92.8k
•
121
Upvote
-
Share collection
View history
Collection guide
Browse collections