jplhughes2/alignment-faking-synthetic-chat-dataset-recall-0k-docs-8k-benign-2k-refusals Viewer • Updated 20 days ago • 10k • 68
jplhughes2/alignment-faking-synthetic-chat-dataset-recall-0k-docs-8k-benign-2k-refusals Viewer • Updated 20 days ago • 10k • 68
jplhughes2/alignment-faking-synthetic-chat-dataset-recall-0k-docs-20k-benign-10k-refusals Viewer • Updated 20 days ago • 29.4k • 59
jplhughes2/alignment-faking-synthetic-chat-dataset-recall-0k-docs-20k-benign-10k-refusals Viewer • Updated 20 days ago • 29.4k • 59
jplhughes2/alignment-faking-synthetic-chat-dataset-recall-5k-docs-8k-benign-2k-refusals Viewer • Updated 23 days ago • 15k • 74
jplhughes2/alignment-faking-synthetic-chat-dataset-recall-5k-docs-8k-benign-2k-refusals Viewer • Updated 23 days ago • 15k • 74
jplhughes2/alignment-faking-synthetic-chat-dataset-recall-5k-docs-4k-benign-1k-refusals Viewer • Updated 23 days ago • 10k • 65
jplhughes2/alignment-faking-synthetic-chat-dataset-recall-5k-docs-4k-benign-1k-refusals Viewer • Updated 23 days ago • 10k • 65
jplhughes2/alignment-faking-synthetic-chat-dataset-recall-10k-docs-8k-benign-2k-refusals Viewer • Updated 23 days ago • 20k • 53