Reinforced Token Optimization

AI & ML interests

None defined yet.

Recent Activity

zkshan2002  published a model 24 days ago
RTO-RL/Llama3-8B-TDPO
zkshan2002  updated a model 24 days ago
RTO-RL/Llama3-8B-TDPO
zkshan2002  published a model 24 days ago
RTO-RL/Llama3-8B-SimPO
View all activity