Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
2
7
2
Shibo Hao
Shibo-UCSD
Follow
nzynzy's profile picture
JordanTensor's profile picture
Teleadmin's profile picture
3 followers
ยท
1 following
[email protected]
AI & ML interests
None yet
Recent Activity
authored
a paper
about 2 months ago
Offline Reinforcement Learning for LLM Multi-Step Reasoning
upvoted
a
paper
about 2 months ago
Offline Reinforcement Learning for LLM Multi-Step Reasoning
commented
on
a paper
about 2 months ago
Offline Reinforcement Learning for LLM Multi-Step Reasoning
View all activity
Organizations
None yet
Papers
7
arxiv:
2412.16145
arxiv:
2412.06769
arxiv:
2406.09455
arxiv:
2406.05673
Expand 7 papers
models
None public yet
datasets
None public yet