7 159 9

Robin Williams PRO

bfuzzy1

AI & ML interests

None yet

Recent Activity

updated a collection 5 days ago

upvoted a paper 5 days ago

Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate

updated a collection 7 days ago

View all activity

Organizations

None yet

Collections 12

models 10

datasets 2

bfuzzy1/gunny_v2_solo_dolo

Viewer • Updated Oct 10, 2024 • 2.9k • 38 • 1

bfuzzy1/gunny_x

Viewer • Updated Oct 1, 2024 • 10k • 64 • 3

Robin Williams PRO

AI & ML interests

Recent Activity

Organizations

Collections 12

bfuzzy1/acheron-m

bfuzzy1/acheron-m1a-llama

RobustFT: Robust Supervised Fine-tuning for Large Language Models under Noisy Response

B-STaR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners

Deliberation in Latent Space via Differentiable Cache Augmentation

Outcome-Refining Process Supervision for Code Generation

models 10

bfuzzy1/acheron-m1a-llama

bfuzzy1/acheron-m

bfuzzy1/acheron-d

bfuzzy1/llambses-1

bfuzzy1/acheron-o9

bfuzzy1/acheron

bfuzzy1/acheron-c

bfuzzy1/Gunny

bfuzzy1/llambses-1_4bit

bfuzzy1/acheron-x

datasets 2

bfuzzy1/gunny_v2_solo_dolo

bfuzzy1/gunny_x

Robin Williams PRO

AI & ML interests

Recent Activity

Organizations

Collections 12

models 10 Sort: Recently updated

datasets 2 Sort: Recently updated

models 10

datasets 2