Jaron
JaronTHU
·
AI & ML interests
None yet
Recent Activity
new activity
14 days ago
internlm/internlm3-8b-instruct:Fast Tokenizer
upvoted
a
collection
about 2 months ago
Phi-3
liked
a dataset
4 months ago
lmms-lab/LLaVA-Video-178K
Organizations
JaronTHU's activity
Fast Tokenizer
1
#17 opened 14 days ago
by
JaronTHU
Question about lm_head weights in Gemma-2-9b-it model
2
#34 opened 7 months ago
by
mjkmain
Fails to generate with `inputs_embeds`
2
#18 opened 7 months ago
by
JaronTHU
"It is strongly recommended to train Gemma2 models with the `eager` attention implementation "
2
#10 opened 8 months ago
by
JaronTHU
"It is strongly recommended to train Gemma2 models with the `eager` attention implementation "
2
#10 opened 8 months ago
by
JaronTHU