arxiv:2410.13184
Guoheng Sun
s1ghhh
AI & ML interests
None yet
Recent Activity
liked
a model
3 days ago
deepseek-ai/Janus-Pro-7B
liked
a dataset
3 days ago
bespokelabs/Bespoke-Stratos-17k
liked
a dataset
3 days ago
AI-MO/NuminaMath-TIR
Organizations
Papers
1
spaces
1
models
19
s1ghhh/opt-attn-tmp
Updated
s1ghhh/Llama-3-70b-Drop
Text Generation
•
Updated
•
13
•
3
s1ghhh/Llama-2-70b-Drop
Text Generation
•
Updated
•
4
•
2
s1ghhh/Llama-2-13b-Drop8Block
Updated
•
3
•
2
s1ghhh/Llama-2-13b-Drop8Attn
Updated
•
1
•
2
s1ghhh/Llama-2-13b-Drop8MLP
Updated
•
2
•
2
s1ghhh/Llama-2-13b-Drop4MLP
Updated
•
5
•
2
s1ghhh/Llama-2-13b-Drop4Attn
Updated
•
4
•
2
s1ghhh/Llama-2-13b-Drop4Block
Updated
•
3
•
2
s1ghhh/Mistral-7B-v0.1-Drop8Block
Updated
•
3
•
2