LoneStriker
commited on
Commit
β’
e0cc40b
1
Parent(s):
8f02934
Upload folder using huggingface_hub
Browse files- .gitattributes +5 -35
- README.md +92 -0
- Yi-1.5-34B-32K-Q3_K_L.gguf +3 -0
- Yi-1.5-34B-32K-Q4_K_M.gguf +3 -0
- Yi-1.5-34B-32K-Q5_K_M.gguf +3 -0
- Yi-1.5-34B-32K-Q6_K.gguf +3 -0
- Yi-1.5-34B-32K-Q8_0.gguf +3 -0
- md5 +21 -0
- nohup.out +0 -0
.gitattributes
CHANGED
@@ -1,35 +1,5 @@
|
|
1 |
-
|
2 |
-
|
3 |
-
|
4 |
-
|
5 |
-
|
6 |
-
*.ftz filter=lfs diff=lfs merge=lfs -text
|
7 |
-
*.gz filter=lfs diff=lfs merge=lfs -text
|
8 |
-
*.h5 filter=lfs diff=lfs merge=lfs -text
|
9 |
-
*.joblib filter=lfs diff=lfs merge=lfs -text
|
10 |
-
*.lfs.* filter=lfs diff=lfs merge=lfs -text
|
11 |
-
*.mlmodel filter=lfs diff=lfs merge=lfs -text
|
12 |
-
*.model filter=lfs diff=lfs merge=lfs -text
|
13 |
-
*.msgpack filter=lfs diff=lfs merge=lfs -text
|
14 |
-
*.npy filter=lfs diff=lfs merge=lfs -text
|
15 |
-
*.npz filter=lfs diff=lfs merge=lfs -text
|
16 |
-
*.onnx filter=lfs diff=lfs merge=lfs -text
|
17 |
-
*.ot filter=lfs diff=lfs merge=lfs -text
|
18 |
-
*.parquet filter=lfs diff=lfs merge=lfs -text
|
19 |
-
*.pb filter=lfs diff=lfs merge=lfs -text
|
20 |
-
*.pickle filter=lfs diff=lfs merge=lfs -text
|
21 |
-
*.pkl filter=lfs diff=lfs merge=lfs -text
|
22 |
-
*.pt filter=lfs diff=lfs merge=lfs -text
|
23 |
-
*.pth filter=lfs diff=lfs merge=lfs -text
|
24 |
-
*.rar filter=lfs diff=lfs merge=lfs -text
|
25 |
-
*.safetensors filter=lfs diff=lfs merge=lfs -text
|
26 |
-
saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
27 |
-
*.tar.* filter=lfs diff=lfs merge=lfs -text
|
28 |
-
*.tar filter=lfs diff=lfs merge=lfs -text
|
29 |
-
*.tflite filter=lfs diff=lfs merge=lfs -text
|
30 |
-
*.tgz filter=lfs diff=lfs merge=lfs -text
|
31 |
-
*.wasm filter=lfs diff=lfs merge=lfs -text
|
32 |
-
*.xz filter=lfs diff=lfs merge=lfs -text
|
33 |
-
*.zip filter=lfs diff=lfs merge=lfs -text
|
34 |
-
*.zst filter=lfs diff=lfs merge=lfs -text
|
35 |
-
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
|
|
1 |
+
Yi-1.5-34B-32K-Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text
|
2 |
+
Yi-1.5-34B-32K-Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text
|
3 |
+
Yi-1.5-34B-32K-Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text
|
4 |
+
Yi-1.5-34B-32K-Q6_K.gguf filter=lfs diff=lfs merge=lfs -text
|
5 |
+
Yi-1.5-34B-32K-Q8_0.gguf filter=lfs diff=lfs merge=lfs -text
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
README.md
ADDED
@@ -0,0 +1,92 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
license: apache-2.0
|
3 |
+
---
|
4 |
+
<div align="center">
|
5 |
+
|
6 |
+
<picture>
|
7 |
+
<img src="https://raw.githubusercontent.com/01-ai/Yi/main/assets/img/Yi_logo_icon_light.svg" width="150px">
|
8 |
+
</picture>
|
9 |
+
|
10 |
+
</div>
|
11 |
+
|
12 |
+
<p align="center">
|
13 |
+
<a href="https://github.com/01-ai">π GitHub</a> β’
|
14 |
+
<a href="https://discord.gg/hYUwWddeAu">πΎ Discord</a> β’
|
15 |
+
<a href="https://twitter.com/01ai_yi">π€ Twitter</a> β’
|
16 |
+
<a href="https://github.com/01-ai/Yi-1.5/issues/2">π¬ WeChat</a>
|
17 |
+
<br/>
|
18 |
+
<a href="https://arxiv.org/abs/2403.04652">π Paper</a> β’
|
19 |
+
<a href="https://github.com/01-ai/Yi/tree/main?tab=readme-ov-file#faq">π FAQ</a> β’
|
20 |
+
<a href="https://github.com/01-ai/Yi/tree/main?tab=readme-ov-file#learning-hub">π Learning Hub</a>
|
21 |
+
</p>
|
22 |
+
|
23 |
+
# Intro
|
24 |
+
|
25 |
+
Yi-1.5 is an upgraded version of Yi. It is continuously pre-trained on Yi with a high-quality corpus of 500B tokens and fine-tuned on 3M diverse fine-tuning samples.
|
26 |
+
|
27 |
+
Compared with Yi, Yi-1.5 delivers stronger performance in coding, math, reasoning, and instruction-following capability, while still maintaining excellent capabilities in language understanding, commonsense reasoning, and reading comprehension.
|
28 |
+
|
29 |
+
<div align="center">
|
30 |
+
|
31 |
+
Model | Context Length | Pre-trained Tokens
|
32 |
+
| :------------: | :------------: | :------------: |
|
33 |
+
| Yi-1.5 | 4K, 16K, 32K | 3.6T
|
34 |
+
|
35 |
+
</div>
|
36 |
+
|
37 |
+
# Models
|
38 |
+
|
39 |
+
- Chat models
|
40 |
+
|
41 |
+
<div align="center">
|
42 |
+
|
43 |
+
| Name | Download |
|
44 |
+
| --------------- | ------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
|
45 |
+
| Yi-1.5-34B-Chat | β’ [π€ Hugging Face](https://huggingface.co/collections/01-ai/yi-15-2024-05-663f3ecab5f815a3eaca7ca8) β’ [π€ ModelScope](https://www.modelscope.cn/organization/01ai) |
|
46 |
+
| Yi-1.5-34B-Chat-16K | β’ [π€ Hugging Face](https://huggingface.co/collections/01-ai/yi-15-2024-05-663f3ecab5f815a3eaca7ca8) β’ [π€ ModelScope](https://www.modelscope.cn/organization/01ai) |
|
47 |
+
| Yi-1.5-9B-Chat | β’ [π€ Hugging Face](https://huggingface.co/collections/01-ai/yi-15-2024-05-663f3ecab5f815a3eaca7ca8) β’ [π€ ModelScope](https://www.modelscope.cn/organization/01ai) |
|
48 |
+
| Yi-1.5-9B-Chat-16K | β’ [π€ Hugging Face](https://huggingface.co/collections/01-ai/yi-15-2024-05-663f3ecab5f815a3eaca7ca8) β’ [π€ ModelScope](https://www.modelscope.cn/organization/01ai) |
|
49 |
+
| Yi-1.5-6B-Chat | β’ [π€ Hugging Face](https://huggingface.co/collections/01-ai/yi-15-2024-05-663f3ecab5f815a3eaca7ca8) β’ [π€ ModelScope](https://www.modelscope.cn/organization/01ai) |
|
50 |
+
|
51 |
+
</div>
|
52 |
+
|
53 |
+
- Base models
|
54 |
+
|
55 |
+
<div align="center">
|
56 |
+
|
57 |
+
| Name | Download |
|
58 |
+
| ---------- | ------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
|
59 |
+
| Yi-1.5-34B | β’ [π€ Hugging Face](https://huggingface.co/collections/01-ai/yi-15-2024-05-663f3ecab5f815a3eaca7ca8) β’ [π€ ModelScope](https://www.modelscope.cn/organization/01ai) |
|
60 |
+
| Yi-1.5-34B-32K | β’ [π€ Hugging Face](https://huggingface.co/collections/01-ai/yi-15-2024-05-663f3ecab5f815a3eaca7ca8) β’ [π€ ModelScope](https://www.modelscope.cn/organization/01ai) |
|
61 |
+
| Yi-1.5-9B | β’ [π€ Hugging Face](https://huggingface.co/collections/01-ai/yi-15-2024-05-663f3ecab5f815a3eaca7ca8) β’ [π€ ModelScope](https://www.modelscope.cn/organization/01ai) |
|
62 |
+
| Yi-1.5-9B-32K | β’ [π€ Hugging Face](https://huggingface.co/collections/01-ai/yi-15-2024-05-663f3ecab5f815a3eaca7ca8) β’ [π€ ModelScope](https://www.modelscope.cn/organization/01ai) |
|
63 |
+
| Yi-1.5-6B | β’ [π€ Hugging Face](https://huggingface.co/collections/01-ai/yi-15-2024-05-663f3ecab5f815a3eaca7ca8) β’ [π€ ModelScope](https://www.modelscope.cn/organization/01ai) |
|
64 |
+
|
65 |
+
</div>
|
66 |
+
|
67 |
+
# Benchmarks
|
68 |
+
|
69 |
+
- Chat models
|
70 |
+
|
71 |
+
Yi-1.5-34B-Chat is on par with or excels beyond larger models in most benchmarks.
|
72 |
+
|
73 |
+
![image/png](https://cdn-uploads.huggingface.co/production/uploads/656d9adce8bf55919aca7c3f/KcsJ9Oc1VnEmfCDEJc5cd.png)
|
74 |
+
|
75 |
+
Yi-1.5-9B-Chat is the top performer among similarly sized open-source models.
|
76 |
+
|
77 |
+
![image/png](https://cdn-uploads.huggingface.co/production/uploads/656d9adce8bf55919aca7c3f/xf6pLg5jqRCwjlh6m3t6_.png)
|
78 |
+
|
79 |
+
- Base models
|
80 |
+
|
81 |
+
Yi-1.5-34B is on par with or excels beyond larger models in some benchmarks.
|
82 |
+
|
83 |
+
![image/png](https://cdn-uploads.huggingface.co/production/uploads/656d9adce8bf55919aca7c3f/BwU7QM-03dZvZzwdIE1xY.png)
|
84 |
+
|
85 |
+
Yi-1.5-9B is the top performer among similarly sized open-source models.
|
86 |
+
|
87 |
+
![image/png](https://cdn-uploads.huggingface.co/production/uploads/656d9adce8bf55919aca7c3f/y-EYSYPT-3aWLJ0x8R94F.png)
|
88 |
+
|
89 |
+
# Quick Start
|
90 |
+
|
91 |
+
For getting up and running with Yi-1.5 models quickly, see [README](https://github.com/01-ai/Yi-1.5).
|
92 |
+
|
Yi-1.5-34B-32K-Q3_K_L.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:cfe8e78a23d9236a8017a76e64f95d093957c3a98cfbca0b076aee7c75351c89
|
3 |
+
size 18139445376
|
Yi-1.5-34B-32K-Q4_K_M.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:f205eafa6c4cc076f4162a8bf458f4ee8f3f22848a7ecb37bec4a64fcf811c1d
|
3 |
+
size 20658710656
|
Yi-1.5-34B-32K-Q5_K_M.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:50af9d670fd1473b75cefd9f78090c2696570a3845ef487a0f6bb9baf862bd24
|
3 |
+
size 24321845376
|
Yi-1.5-34B-32K-Q6_K.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:65525f9d4bc6d6ab97f5940cf028765d40ca1e9468260bbd84792ad9b492a559
|
3 |
+
size 28213926016
|
Yi-1.5-34B-32K-Q8_0.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:6d74b6c01d7811ae9381c6f617335645ae9e0b111be2a68c31a27ec3a713f965
|
3 |
+
size 36542281856
|
md5
ADDED
@@ -0,0 +1,21 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
dfa9a347880967bd00337d317f7be4a7 config.json
|
2 |
+
c8ef9083b4a144feb5b2a814f4a002ba generation_config.json
|
3 |
+
64a07979e4184ecdbbe108874d84ae24 model-00001-of-00015.safetensors
|
4 |
+
b1a26a9bb4f64ffef13939b45aa75196 model-00002-of-00015.safetensors
|
5 |
+
29843157024717a7020b18c118e3633b model-00003-of-00015.safetensors
|
6 |
+
01d1007f3b6c6cc9a9adf539208bf03a model-00004-of-00015.safetensors
|
7 |
+
73124ba5f32a6dd8e983b27dc403ef68 model-00005-of-00015.safetensors
|
8 |
+
e60357523f97ad189c36c4ba4ab6a025 model-00006-of-00015.safetensors
|
9 |
+
a054b9f383cac9b6147d5af2eecc2c1e model-00007-of-00015.safetensors
|
10 |
+
6925579e085d6c96b4165b7af1f7d4ae model-00008-of-00015.safetensors
|
11 |
+
141a2dc4aae0a0f3bdbc504c831f8d66 model-00009-of-00015.safetensors
|
12 |
+
a3d6b584dbd051e726d33ef27fae2032 model-00010-of-00015.safetensors
|
13 |
+
3a0a77a24e5fca70ab200ac51e5cfc9a model-00011-of-00015.safetensors
|
14 |
+
c4c30d23a293c44be9c9c512d628056d model-00012-of-00015.safetensors
|
15 |
+
c81232db1d5f0f43796ced42ae0b4bcf model-00013-of-00015.safetensors
|
16 |
+
aa1e3f59d1320aba00dc71d125addf0c model-00014-of-00015.safetensors
|
17 |
+
294688e18d848ed00454b25e75e7b9fe model-00015-of-00015.safetensors
|
18 |
+
d24846f52ccec5bbc266022ad9b17efc model.safetensors.index.json
|
19 |
+
431e5b229fb9efa8c11ccc673036569d tokenizer.json
|
20 |
+
291724ef50f729e45d68f474a7755bbc tokenizer.model
|
21 |
+
3dadb1263199aa59076b52a91d9ec7f1 tokenizer_config.json
|
nohup.out
ADDED
File without changes
|