DeepSeek LLM PEPE
#64 opened 11 days ago
by
Dethox
![](https://cdn-avatars.huggingface.co/v1/production/uploads/noauth/vv_Kqhh8wTw7CYyrXAk_1.jpeg)
Request: DOI
#62 opened 11 days ago
by
ljh88301065
![](https://cdn-avatars.huggingface.co/v1/production/uploads/no-auth/-LFZ8vefq0j-MtvoixLKG.png)
Update README.md
#61 opened 24 days ago
by
chanhks55
`aux_loss_alpha` should be 1e-4 instead of 1e-3?
#60 opened 24 days ago
by
cuichenx
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1647892352892-6237746c4f73a51ab018f994.png)
skript
#59 opened 25 days ago
by
technikgolem
![](https://cdn-avatars.huggingface.co/v1/production/uploads/no-auth/4cbUqQTznFFFpEG-wsWsO.png)
有人跑成功了吗, 用的什么配置跑的
6
#58 opened 29 days ago
by
xl343
![](https://cdn-avatars.huggingface.co/v1/production/uploads/no-auth/fHaWC9P-4VZ1YYLSqKfds.png)
how to infer with mtp?
#57 opened 30 days ago
by
duanyu
Resource Requirements for Running DeepSeek v3 Locally
5
#56 opened 30 days ago
by
wilfoderek
![](https://cdn-avatars.huggingface.co/v1/production/uploads/60bfa4237f75bb4d92557db9/8Vu3xJkqI59GrtoFrZbwj.jpeg)
Update README.md
#55 opened about 1 month ago
by
malekradwan130
![](https://cdn-avatars.huggingface.co/v1/production/uploads/no-auth/1ixAuHolFm_mVvYkWlq90.png)
When running, the following error occurred, indicating that the config.json is missing.
#54 opened about 1 month ago
by
catsonthecar
![](https://cdn-avatars.huggingface.co/v1/production/uploads/no-auth/dcxx0RrwiNCOrtbsTg_ze.png)
Download link for models
2
#52 opened about 1 month ago
by
codecrypt112
Update README.md
#50 opened about 1 month ago
by
Aikun7777777
Update README.md
#49 opened about 1 month ago
by
Aikun7777777
Update modeling_deepseek.py
#47 opened about 1 month ago
by
erichartford
![](https://cdn-avatars.huggingface.co/v1/production/uploads/no-auth/j-sg53QbeCkHiLl-_1Tp1.png)
这东西cpu能跑不
8
#46 opened about 1 month ago
by
helloquark
Request: DOI
#45 opened about 1 month ago
by
fruit007
Delete .gitattributes
#44 opened about 1 month ago
by
Anna255
Delete .gitattributes
#43 opened about 1 month ago
by
Anna255
Request: DOI
#41 opened about 1 month ago
by
abc123v9
Is there a way to convert it in to a 1 bit LLM? and use BITnet
3
#40 opened about 1 month ago
by
infinityai
Upload 01JG960S1ZEQEYRVHEYK9F55BT (1).jpg
1
#39 opened about 1 month ago
by
Rehanch3121
Hi!
1
#37 opened about 1 month ago
by
Amirshahi2
![](https://cdn-avatars.huggingface.co/v1/production/uploads/no-auth/KCHOnkizFNevd2ZwDHXAa.png)
Confusing Answer
3
#36 opened about 1 month ago
by
Zilikon
You will release a small version for consumer hardware like the v2 generation?
9
#35 opened about 1 month ago
by
anon-linux-mint
![](https://cdn-avatars.huggingface.co/v1/production/uploads/noauth/4nEQcPv8v7X_mXQ9iTPOV.png)
deepseek v3 - the best open source AI model?
#33 opened about 1 month ago
by
Gerogiy
![](https://cdn-avatars.huggingface.co/v1/production/uploads/no-auth/-MkxB3Oxs-R6v0fGtV0Sa.png)
fp8转bf16的脚本在A100上无法执行
4
#32 opened about 1 month ago
by
duanyu
v3的自我认知怎么还不如之前
4
#31 opened about 1 month ago
by
yyren7
![](https://cdn-avatars.huggingface.co/v1/production/uploads/no-auth/Z03c1c4iriNJ0nc3ukw6N.png)
No monthly active user limitation on commercial use?
#30 opened about 1 month ago
by
DrNicefellow
![](https://cdn-avatars.huggingface.co/v1/production/uploads/5f2220612128ba01bdf08c26/PTAD_Zcdq_O91DfYaDpNW.png)
Very impressive. Good world knowledge (SimpleQA of 25) despite high math/coding performance.
2
#27 opened about 1 month ago
by
phil111
Converted bf16 Model on Hugging Face
#26 opened about 1 month ago
by
OpenSourceRonin
Create README.md
#24 opened about 1 month ago
by
xiaoshuai1234
应该把字节、阿里、百度的钱和显卡都分给deepseek,不然浪费资源啊
8
#23 opened about 1 month ago
by
eatcosmos
开源了也下不了。。哈哈哈
1
#21 opened about 1 month ago
by
hzxx0921
SHU到此一游QAQ
#20 opened about 1 month ago
by
gtTeri
![](https://cdn-avatars.huggingface.co/v1/production/uploads/no-auth/4RLYUEWdL_9tOVhmJlGy8.png)
Excited to this Open Source LLM!
#19 opened about 1 month ago
by
adrisinaga
![](https://cdn-avatars.huggingface.co/v1/production/uploads/noauth/BBLML63RVX0N7gQR46MwX.png)
Create README.md
#17 opened about 1 month ago
by
amiramiramirdeh
某些自研代码助手又有饭吃了
2
#16 opened about 1 month ago
by
zh20233
I can't wait to see your work
#15 opened about 1 month ago
by
jiangchengchengNLP
![](https://cdn-avatars.huggingface.co/v1/production/uploads/noauth/nIz1XHkqKdrCQa3Ot2QDJ.png)
vllm/sglang deploy script?
1
#14 opened about 1 month ago
by
Meteonis
![](https://cdn-avatars.huggingface.co/v1/production/uploads/6334f2f1259c518276efa730/z_SH_OBkDyj4RCN9mqsKS.jpeg)
Create README.md
1
#13 opened about 1 month ago
by
Jaggz333
![](https://cdn-avatars.huggingface.co/v1/production/uploads/noauth/Rxu7sFUlDzNZnSvtPnhSa.png)
When using the web version of DeepSeek v3, it keeps repeating responses without stopping.
1
#12 opened about 1 month ago
by
Nydaym
No model card rn
#11 opened about 1 month ago
by
BlackBeenie
where?
#10 opened about 1 month ago
by
MrZhanggggg
![](https://cdn-avatars.huggingface.co/v1/production/uploads/noauth/4ztCwF-hifxaF-TLkP2bk.jpeg)
Create README.md
#9 opened about 1 month ago
by
semenionut
![](https://cdn-avatars.huggingface.co/v1/production/uploads/641f96ea632a1ec42cb39726/15Q3IT0LR_lT_vPis_L96.jpeg)
Create README.md
#8 opened about 1 month ago
by
gavinzhu
Create README.md
#6 opened about 1 month ago
by
RIOGOAT
Missing Model Card
#5 opened about 1 month ago
by
p3nGu1nZz
![](https://cdn-avatars.huggingface.co/v1/production/uploads/65f2c47c8000c6a096f8f189/a1zEyog6VLHnPtqXZ0AY2.png)