An Yang
yangapku
AI & ML interests
NLP and Deep Learning
Organizations
yangapku's activity
Create README.md
#1 opened about 2 months ago
by
Zhenru
Update README of branch dev_triton.
2
#11 opened 11 months ago
by
Cheshire94
Does Qwen support 16k context, what is the best config for max_new_tokens?
2
#22 opened over 1 year ago
by
Cheshire94
RuntimeError: The size of tensor a (8192) must match the size of tensor b (11581) at non-singleton dimension 3
1
#32 opened about 1 year ago
by
wujiekd
Fix typo
#29 opened about 1 year ago
by
IlysvlVEizbr
Load tokenizer and model in no internet kernel?
1
#33 opened about 1 year ago
by
nikjohn7
FlashAttention推理时还是需要关闭,目前开启输出是错乱的
1
#27 opened over 1 year ago
by
Trangle
我看模型更新了,有说明吗
2
#21 opened over 1 year ago
by
Weiguo
_convert_id_to_token方法没有实现
2
#1 opened over 1 year ago
by
YeungNLP
does it support Chinese and English mixed input?
5
#1 opened almost 2 years ago
by
Baicai003
How can I add context with text input along with the image and the labels?
3
#5 opened almost 2 years ago
by
micole66
remove styling to fix spacing
#4 opened almost 2 years ago
by
akhaliq
Minor nit
1
#3 opened almost 2 years ago
by
osanseviero