tianzhechu commited on
Commit
3fb89cd
·
verified ·
1 Parent(s): 7a370e4

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +17 -0
README.md ADDED
@@ -0,0 +1,17 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ ---
4
+ # VIRL-L-Init
5
+ This model serves as a initial checkpoint to reproduce results in paper **SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training**.
6
+
7
+
8
+
9
+ ## Related links
10
+
11
+ Website: https://tianzhechu.com/SFTvsRL/
12
+
13
+ Github: https://github.com/LeslieTrue/SFTvsRL
14
+
15
+ Arxiv: https://arxiv.org/abs/2501.17161v1
16
+
17
+ HF: https://huggingface.co/papers/2501.17161