Cognitive Computations

community

https://erichartford.com

erhartford

ehartford

Activity Feed

AI & ML interests

Supervised Fine Tuning, DPO, and unalignment

Recent Activity

v2ray updated a model 3 days ago

cognitivecomputations/DeepSeek-V3-AWQ

v2ray updated a model 3 days ago

cognitivecomputations/DeepSeek-R1-AWQ

v2ray new activity 3 days ago

cognitivecomputations/DeepSeek-R1-AWQ:Deployment framework

View all activity

cognitivecomputations's activity

v2ray

updated 2 models 3 days ago

cognitivecomputations/DeepSeek-V3-AWQ

Text Generation • Updated 3 days ago • 3.11k • 12

cognitivecomputations/DeepSeek-R1-AWQ

Text Generation • Updated 3 days ago • 670 • 7

v2ray

in cognitivecomputations/DeepSeek-R1-AWQ 3 days ago

Deployment framework

#2 opened 3 days ago by

xro7

v2ray

in cognitivecomputations/DeepSeek-R1-AWQ 4 days ago

Smaller deepseek models?

#1 opened 5 days ago by

loshka2

v2ray

in cognitivecomputations/DeepSeek-V3-AWQ 6 days ago

vllm support a100

#2 opened 13 days ago by

HuggingLianWang

ehartford

updated a model 6 days ago

cognitivecomputations/DeepSeek-R1-AWQ

Text Generation • Updated 3 days ago • 670 • 7

ehartford

published a model 6 days ago

cognitivecomputations/DeepSeek-R1-AWQ

Text Generation • Updated 3 days ago • 670 • 7

ehartford

updated a model 7 days ago

cognitivecomputations/DeepSeek-R1-bf16

Updated 7 days ago • 26 • 2

ehartford

published a model 7 days ago

cognitivecomputations/DeepSeek-R1-bf16

Updated 7 days ago • 26 • 2

ehartford

updated a dataset 20 days ago

cognitivecomputations/OpenCoder-LLM_opc-sft-stage1-DolphinLabeled

Viewer • Updated 20 days ago • 3.01M • 78 • 7

bartowski

posted an update 21 days ago

Post

14644

Switching to author_model-name

I posted a poll on twitter, and others have mentioned the interest in me using the convention of including the author name in the model path when I upload.

It has a couple advantages, first and foremost of course is ensuring clarity of who uploaded the original model (did Qwen upload Qwen2.6? Or did someone fine tune Qwen2.5 and named it 2.6 for fun?)

The second thing is that it avoids collisions, so if multiple people upload the same model and I try to quant them both, I would normally end up colliding and being unable to upload both

I'll be implementing the change next week, there are just two final details I'm unsure about:

First, should the files also inherit the author's name?

Second, what to do in the case that the author name + model name pushes us past the character limit?

Haven't yet decided how to handle either case, so feedback is welcome, but also just providing this as a "heads up"