Jiang Jiwen

jjw0126
·

AI & ML interests

RL, LLM

Recent Activity

liked a dataset about 15 hours ago
AymanTarig/function-calling-v0.2-with-r1-cot
liked a dataset about 15 hours ago
Jofthomas/hermes-function-calling-thinking-V1
liked a model about 21 hours ago
Salesforce/blip2-opt-2.7b
View all activity

Organizations

ucas's profile picture ELM Team's profile picture PLM-Team's profile picture

jjw0126's activity

upvoted 2 articles 7 days ago
view article
Article

Open-R1: a fully open reproduction of DeepSeek-R1

774
view article
Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

By NormalUhr
43