|
--- |
|
tags: |
|
- llama2 |
|
- llama2-70b |
|
- rkllm |
|
- rockchip |
|
- rk3588 |
|
--- |
|
|
|
# Llama 2 Chat 70B for RK3588 |
|
This is a conversion from https://huggingface.co/meta-llama/Llama-2-70b-chat-hf to the RKLLM format for Rockchip devices. |
|
This runs on the NPU from the RK3588. |
|
|
|
# Convert to one file |
|
Run: |
|
|
|
```bash |
|
cat llama2-chat-70b-hf-0* > llama2-chat-70b-hf.rkllm |
|
``` |
|
|
|
# But wait... will this run on my RK3588? |
|
No. But I found interesting to see what happens if I converted it. |
|
Let's hope Microsoft never knows that I was using their SSDs as swap because they don't allow more than 32 GB RAM for the students subscription :P |
|
|
|
![image/png](https://cdn-uploads.huggingface.co/production/uploads/660da5d45d68779a53384179/lWqXKM3R_0_3Vlv-6yHji.png) |
|
|
|
And this is before finishing, it will probably get to 600 GBs of RAM + Swap. |
|
|
|
But hey! You can always try yourself getting a 512GB SSD (and use around 100-250 GB as swap), a 32 GB of RAM SBC, have some patience and see if it loads. Good luck with that! |
|
|
|
# Main repo |
|
See this for my full collection of converted LLMs for the RK3588's NPU: |
|
|
|
https://huggingface.co/Pelochus/ezrkllm-collection |
|
|
|
# License |
|
Same as the original LLM: |
|
|
|
https://huggingface.co/meta-llama/Llama-2-70b-chat-hf/blob/main/LICENSE.txt |