Corianas commited on
Commit
fdd70d7
·
verified ·
1 Parent(s): 0936c44

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -0
README.md CHANGED
@@ -2,9 +2,12 @@
2
  license: cc-by-nc-4.0
3
  ---
4
  A llama.c model based on Karpathy's Llama2.c project. https://github.com/karpathy/llama2.c
 
5
  Vocab of 4096, trained on Tinystories, and my custom littlestories dataset (currently unreleased.)
 
6
  This version was further trained on following instructions... somewhat... using https://github.com/mlabonne/llm-course/blob/main/Fine_tune_Llama_2_in_Google_Colab.ipynb
7
 
 
8
  Model uses ↨ as a shift key, instead of using capial letters, this allowed simplification of the tokenizer to avoid duplicates that are uppercase.
9
 
10
  To convert normal text to the right format I use:
 
2
  license: cc-by-nc-4.0
3
  ---
4
  A llama.c model based on Karpathy's Llama2.c project. https://github.com/karpathy/llama2.c
5
+
6
  Vocab of 4096, trained on Tinystories, and my custom littlestories dataset (currently unreleased.)
7
+
8
  This version was further trained on following instructions... somewhat... using https://github.com/mlabonne/llm-course/blob/main/Fine_tune_Llama_2_in_Google_Colab.ipynb
9
 
10
+
11
  Model uses ↨ as a shift key, instead of using capial letters, this allowed simplification of the tokenizer to avoid duplicates that are uppercase.
12
 
13
  To convert normal text to the right format I use: