AndyYang's picture

1 2 2

AndyYang

andyyang

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

Scale-Distribution Decoupling: Enabling Stable and Effective Training of Large Language Models

authored a paper about 1 month ago

Lossless KV Cache Compression to 2%

authored a paper about 2 months ago

HMoE: Heterogeneous Mixture of Experts for Language Modeling

View all activity

Organizations

andyyang's activity

upvoted a paper 3 days ago

Scale-Distribution Decoupling: Enabling Stable and Effective Training of Large Language Models

Paper • 2502.15499 • Published 8 days ago • 12

authored a paper about 1 month ago

Lossless KV Cache Compression to 2%

Paper • 2410.15252 • Published Oct 20, 2024 • 1

authored 3 papers about 2 months ago

HMoE: Heterogeneous Mixture of Experts for Language Modeling

Paper • 2408.10681 • Published Aug 20, 2024 • 9

Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent

Paper • 2411.02265 • Published Nov 4, 2024 • 24

Scaling Laws for Floating Point Quantization Training

Paper • 2501.02423 • Published Jan 5 • 26

upvoted a paper about 2 months ago

Lossless KV Cache Compression to 2%

Paper • 2410.15252 • Published Oct 20, 2024 • 1

New activity in lmsys/lmsys-chat-1m over 1 year ago

Dataset Access

#1 opened over 1 year ago by

liked a dataset over 1 year ago

Anthropic/hh-rlhf

Viewer • Updated May 26, 2023 • 169k • 14.9k • 1.28k

liked a dataset almost 2 years ago

RyokoAI/CNNovel125K

Viewer • Updated Apr 4, 2023 • 2.25k • 536 • 21

updated a dataset over 2 years ago

andyyang/stable_diffusion_prompts_2m

Viewer • Updated Nov 10, 2022 • 2M • 96 • 15