Nexesenex commited on
Commit
667b94c
1 Parent(s): 9dd6835

Update README.md

Browse files

---
license: llama2
---
CodeLlama 2 7b
With Guanaco Lora (Tim Dettmers), merged by Varunk29.
Then
With Mistral AI 7b 0.1 delta bits compared to Llama2 (extracted by Undi95), merged by me.

Base model (CodeLlama) training context : 16k (max context up to 96k with the base ROPE)
Mistral injection training context : 8k (Sliding Windows Attention is likely inoperant on such a merge/injection)

For test and amusement only.

Prompt : Alpaca works.

Files changed (1) hide show
  1. README.md +11 -0
README.md CHANGED
@@ -1,3 +1,14 @@
1
  ---
2
  license: llama2
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: llama2
3
  ---
4
+ CodeLlama 2 7b
5
+ With Guanaco Lora (Tim Dettmers), merged by Varunk29.
6
+ Then
7
+ With Mistral AI 7b 0.1 delta bits compared to Llama2 (extracted by Undi95), merged by me.
8
+
9
+ Base model (CodeLlama) training context : 16k (max context up to 96k with the base ROPE)
10
+ Mistral injection training context : 8k (Sliding Windows Attention is likely inoperant on such a merge/injection)
11
+
12
+ For test and amusement only.
13
+
14
+ Prompt : Alpaca works.