Tuyabei
Tuyabei
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 22 hours ago
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via
Reinforcement Learning
upvoted
a
paper
about 22 hours ago
Test-Time Preference Optimization: On-the-Fly Alignment via Iterative
Textual Feedback
liked
a dataset
10 days ago
WHB139426/webvid10m_motion_2.5m