pk11 commited on
Commit
054cd5b
·
1 Parent(s): eea826b

bump README.md

Browse files
Files changed (1) hide show
  1. README.md +5 -1
README.md CHANGED
@@ -2,4 +2,8 @@
2
  license: apache-2.0
3
  base_model:
4
  - Qwen/Qwen2.5-14B-Instruct-1M
5
- ---
 
 
 
 
 
2
  license: apache-2.0
3
  base_model:
4
  - Qwen/Qwen2.5-14B-Instruct-1M
5
+ ---
6
+
7
+
8
+ Fine-tuned/hyperfitted with methodology from https://arxiv.org/abs/2412.04318
9
+ With OrthoGrad optimizer https://arxiv.org/abs/2501.04697