Richard Ren's picture
2

Richard Ren

notrichardren

AI & ML interests

robustness, interpretability, probing, truthfulness

Recent Activity

published a model 7 days ago
notrichardren/lorra_tqa_7b
updated a dataset 6 months ago
notrichardren/catch_ai_liar
updated a dataset 9 months ago
notrichardren/ultrachat_piglatin_test_processed
View all activity

Organizations

Truthfulness & Deception Research Team's profile picture Robust Control's profile picture