arxiv:2412.02674
hlzhang109
hlzhang109
AI & ML interests
None yet
Recent Activity
authored
a paper
about 1 month ago
Do the Rewards Justify the Means? Measuring Trade-Offs Between Rewards
and Ethical Behavior in the MACHIAVELLI Benchmark
authored
a paper
about 1 month ago
CoLoR-Filter: Conditional Loss Reduction Filtering for Targeted Language
Model Pre-training
authored
a paper
about 1 month ago
DataComp-LM: In search of the next generation of training sets for
language models
Organizations
models
1
datasets
None public yet