Shawn Huang's picture

1 1

Shawn Huang

hyx21

·

AI & ML interests

None yet

Recent Activity

authored a paper 1 day ago

Tool Learning with Foundation Models

authored a paper 1 day ago

Ouroboros: Speculative Decoding with Large Model Enhanced Drafting

authored a paper 1 day ago

MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies

View all activity

Organizations

hyx21's activity

authored 6 papers 1 day ago

Tool Learning with Foundation Models

Paper • 2304.08354 • Published Apr 17, 2023 • 3

Ouroboros: Speculative Decoding with Large Model Enhanced Drafting

Paper • 2402.13720 • Published Feb 21, 2024 • 7

MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies

Paper • 2404.06395 • Published Apr 9, 2024 • 22

Locret: Enhancing Eviction in Long-Context LLM Inference with Trained Retaining Heads

Paper • 2410.01805 • Published Oct 2, 2024

FR-Spec: Accelerating Large-Vocabulary Language Models via Frequency-Ranked Speculative Sampling

Paper • 2502.14856 • Published 14 days ago • 7

APB: Accelerating Distributed Long-Context Inference by Passing Compressed Context Blocks across GPUs

Paper • 2502.12085 • Published 17 days ago • 2

upvoted a paper 1 day ago

FR-Spec: Accelerating Large-Vocabulary Language Models via Frequency-Ranked Speculative Sampling

Paper • 2502.14856 • Published 14 days ago • 7

updated 2 models 5 months ago

hyx21/Locret-llama-3.1-8B-instruct

Updated Sep 25, 2024

hyx21/Locret-phi-3-mini-128K

Updated Sep 25, 2024

updated 2 models 7 months ago

hyx21/T5-3B-qdp-bmcook

Updated Aug 7, 2024 • 8

hyx21/Llama-13B-Alpaca-QLoRA-CALoRA

Updated Aug 7, 2024

liked a model about 1 year ago

openbmb/MiniCPM-2B-sft-bf16

Text Generation • Updated Sep 7, 2024 • 6.46k • 118