Haoze Wu

WaitHZ
·

AI & ML interests

Modular DL, Complex Reasoning

Recent Activity

Organizations

None yet

WaitHZ's activity

upvoted an article 7 days ago
upvoted 2 articles about 1 month ago
view article
Article

How to generate text: using different decoding methods for language generation with Transformers

165
view article
Article

You could have designed state of the art positional encoding

179
New activity in deepseek-ai/deepseek-moe-16b-base 12 months ago

A little question about aux_loss

2
#4 opened about 1 year ago by
WaitHZ

A little question about aux_loss

2
#4 opened about 1 year ago by
WaitHZ
New activity in deepseek-ai/deepseek-moe-16b-base about 1 year ago

A little question about aux_loss

2
#4 opened about 1 year ago by
WaitHZ