Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
JingzeShi 
posted an update 15 days ago
Post
1643
🤩warmup -> stable -> decay leanring rate scheduler:
😎use the Stable Phase CheckPoints to Continue Training the model on Any New Dataset without spikes of the training!!!
SmallDoge/Doge-20M-checkpoint
SmallDoge/Doge-60M-checkpoint

Internet celebrity doge

·

but you are internet celebrity rapper of📙

finally

·

the process is always hard, the result is always good.😁