Pio
huggirus
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
11 days ago
SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open
Software Evolution
upvoted
a
paper
25 days ago
Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time
Scaling
upvoted
an
article
about 1 month ago
Open-R1: a fully open reproduction of DeepSeek-R1
Organizations
None yet
Collections
1
models
4
datasets
None public yet