arxiv:2408.06292
Chris Lu
chrlu
AI & ML interests
None yet
Organizations
models
19
chrlu/zephyr-7b-gemma-bline-kto-unlabeled
Text Generation
•
Updated
•
5
chrlu/zephyr-7b-gemma-kto-2
Text Generation
•
Updated
•
5
chrlu/zephyr-7b-gemma-adaptive_confidence_margin_loss_213
Text Generation
•
Updated
•
3
chrlu/zephyr-7b-gemma-adaptive_quantile_feedback_loss
Text Generation
•
Updated
•
1
chrlu/zephyr-7b-gemma-dynamic_blended_adaptive_quantile_loss
Text Generation
•
Updated
•
5
chrlu/zephyr-7b-gemma-adaptive_blended_loss_with_temperature_scaling
Text Generation
•
Updated
•
2
chrlu/zephyr-7b-gemma-log_ratio_modulated_loss
Text Generation
•
Updated
•
5
chrlu/zephyr-7b-gemma-policy_focused_loss
Text Generation
•
Updated
•
7
chrlu/zephyr-7b-gemma-combined_exp_logistic_loss
Text Generation
•
Updated
chrlu/zephyr-7b-gemma-adaptive_quantile_loss
Text Generation
•
Updated
•
6
datasets
None public yet