Jesse Zhang
Nagi-ovo
AI & ML interests
LLM Reasoning & AI Agent
Recent Activity
liked
a dataset
10 days ago
AI-MO/NuminaMath-TIR
updated
a model
12 days ago
Nagi-ovo/Qwen2.5-7B-Reasoning-Adapter
published
a model
12 days ago
Nagi-ovo/Qwen2.5-7B-Reasoning-Adapter
Organizations
None yet
Collections
1
spaces
1
models
23

Nagi-ovo/Qwen2.5-7B-Reasoning-Adapter
Text Generation
•
Updated
•
17

Nagi-ovo/Llama-3-8B-PPO
Text Generation
•
Updated
•
16

Nagi-ovo/Llama-3-8B-SFT-RuoZhiBa
Text Generation
•
Updated
•
14

Nagi-ovo/Llama-3-8B-RM
Text Classification
•
Updated
•
14
•
2

Nagi-ovo/Llama-3-8B-DPO
Text Generation
•
Updated
•
20

Nagi-ovo/Nagi_TinyLLaMA_medical_sft
Text Generation
•
Updated
•
66

Nagi-ovo/alphazero-gomoku
Reinforcement Learning
•
Updated
•
1

Nagi-ovo/sd-class-butterflies-32
Unconditional Image Generation
•
Updated
•
60

Nagi-ovo/Qwen2.5-3B-Alpaca
Updated

Nagi-ovo/poca-SoccerTwos
Reinforcement Learning
•
Updated
•
16
datasets
None public yet