hlzhang109's picture

2 1

hlzhang109

hlzhang109

·

AI & ML interests

None yet

Recent Activity

authored a paper about 1 month ago

Do the Rewards Justify the Means? Measuring Trade-Offs Between Rewards and Ethical Behavior in the MACHIAVELLI Benchmark

authored a paper about 1 month ago

CoLoR-Filter: Conditional Loss Reduction Filtering for Targeted Language Model Pre-training

authored a paper about 1 month ago

DataComp-LM: In search of the next generation of training sets for language models

View all activity

Organizations

Papers 5

arxiv:2412.02674

arxiv:2407.01100

arxiv:2406.11794

arxiv:2406.10670

models 1

hlzhang109/CoLoR-filter

Updated Jun 15, 2024

datasets

None public yet