michael jefferson
520jefferson
AI & ML interests
None yet
Recent Activity
liked
a dataset
16 days ago
di-zhang-fdu/OpenLongCoT-Pretrain-v2
new activity
27 days ago
deepseek-ai/DeepSeek-V3-Base:模型区别
liked
a model
about 1 month ago
RLHFlow/Llama3.1-8B-PRM-Deepseek-Data
Organizations
520jefferson's activity
模型区别
2
#22 opened 28 days ago
by
520jefferson
Run on iOS
4
#1 opened 9 months ago
by
evgeniymikholap
模型能区分下吗?
2
#2 opened 5 months ago
by
520jefferson
在此基础上能训练100k吗?
#6 opened 7 months ago
by
520jefferson
How much memory is needed if you make the 128k context length
1
#13 opened 8 months ago
by
ggbondcxk
multiturn roleplay
2
#2 opened about 1 year ago
by
520jefferson
how can i limit the generate length , i haven't see the new max length in chat().
2
#6 opened about 1 year ago
by
520jefferson
my i use deepspeed to accelerate the model infer process?
1
#5 opened about 1 year ago
by
520jefferson