Richard Ren's picture

2

Richard Ren

notrichardren

·

notrichardren

AI & ML interests

robustness, interpretability, probing, truthfulness

Recent Activity

published a model 7 days ago

notrichardren/lorra_tqa_7b

updated a dataset 6 months ago

notrichardren/catch_ai_liar

updated a dataset 9 months ago

notrichardren/ultrachat_piglatin_test_processed

View all activity

Organizations

models 4

notrichardren/lorra_tqa_7b

Updated 7 days ago

notrichardren/zephyr-7b-sft-qlora-alignment-10000

Updated May 11, 2024 • 4

notrichardren/zephyr-7b-sft-qlora-pig-latin-10000-v2

Updated May 11, 2024

notrichardren/zephyr-7b-sft-qlora

Updated May 11, 2024

datasets 27

notrichardren/catch_ai_liar

Viewer • Updated Jul 24, 2024 • 27 • 34

notrichardren/ultrachat_piglatin_test_processed

Viewer • Updated May 15, 2024 • 23.1k • 38

notrichardren/ultrachat_chinese_test_processed

Viewer • Updated May 15, 2024 • 1k • 35

notrichardren/pig_latin_english_mmlu

Viewer • Updated May 15, 2024 • 15.9k • 37

notrichardren/english_chinese_mmlu

Viewer • Updated May 15, 2024 • 14.9k • 31

notrichardren/azaria-mitchell-diff-filtered-2

Viewer • Updated Oct 3, 2023 • 7.59k • 72

notrichardren/azaria-mitchell-diff-filtered

Viewer • Updated Oct 3, 2023 • 803 • 74

notrichardren/HaluEval

Viewer • Updated Sep 11, 2023 • 35k • 553

notrichardren/gpt_generated_10k

Viewer • Updated Aug 24, 2023 • 10.9k • 47

notrichardren/deception-evals

Viewer • Updated Aug 24, 2023 • 924 • 37 • 1