Shibo Hao's picture

2 7 2

Shibo Hao

Shibo-UCSD

·

[email protected]

AI & ML interests

None yet

Recent Activity

authored a paper about 2 months ago

Offline Reinforcement Learning for LLM Multi-Step Reasoning

upvoted a paper about 2 months ago

Offline Reinforcement Learning for LLM Multi-Step Reasoning

commented on a paper about 2 months ago

Offline Reinforcement Learning for LLM Multi-Step Reasoning

View all activity

Organizations

None yet

Papers 7

arxiv:2412.16145

arxiv:2412.06769

arxiv:2406.09455

arxiv:2406.05673

models

None public yet

datasets

None public yet