arxiv:2412.03679
Seungone Kim
seungone
AI & ML interests
Large Language Models, LLM-as-a-Judge, Reward Model Overoptimization, Personalized Alignment
Recent Activity
liked
a Space
2 days ago
Qwen/QwQ-32B-preview
updated
a model
13 days ago
seungone/skywork-reward-replicate
upvoted
a
paper
15 days ago
MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at
Scale
Organizations
Papers
22
datasets
5
seungone/ablation1_math_gpt4o_mini
Viewer
•
Updated
•
5.56k
•
66
seungone/ablation3_math_llama3.1_8b_instruct
Viewer
•
Updated
•
24.8k
•
69
seungone/ablation2_math_llama3.1_8b_instruct
Viewer
•
Updated
•
5.99k
•
31
seungone/ablation1_code_gpt4o_mini
Viewer
•
Updated
•
10k
•
77
seungone/final-math-claude3.5_sonnet-10000
Viewer
•
Updated
•
10k
•
34