Scale Safety Research

Enterprise

community

AI & ML interests

None defined yet.

Recent Activity

abhayesian updated a collection 6 days ago

Helpful-Only Synthetic Documents

abhayesian updated a collection 6 days ago

Helpful-Only Synthetic Documents

abhayesian updated a dataset 6 days ago

scale-safety-research/synth_docs_honly_and_principles_and_chat

View all activity

Collections 1

models

None public yet

datasets 11

scale-safety-research/synth_docs_honly_and_principles_and_chat

Viewer • Updated 6 days ago • 50k • 20

scale-safety-research/synth_docs_honly_and_principles

Viewer • Updated 6 days ago • 50k • 20

scale-safety-research/synth_docs_honly

Viewer • Updated 9 days ago • 30k • 27

scale-safety-research/synth_docs_honly_and_claude_anti_reward_hacking

Viewer • Updated 14 days ago • 50k • 48

scale-safety-research/synth_docs_honly_and_claude_pro_reward_hacking

Viewer • Updated 14 days ago • 50k • 44

scale-safety-research/synth_docs_honly_and_longtermist_claude

Viewer • Updated 14 days ago • 50k • 29

scale-safety-research/synth_docs_honly_and_hubinger_mesaoptimizers

Viewer • Updated 14 days ago • 50k • 41

scale-safety-research/synth_docs_honly_and_claude_situational_adversarial_robustness

Viewer • Updated 14 days ago • 50k • 40

scale-safety-research/synth_docs_honly_and_alignment_faking_paper

Viewer • Updated 14 days ago • 50k • 47

scale-safety-research/internet_capability_hallucination

Viewer • Updated 23 days ago • 365 • 71