arxiv:2410.19743
Heming Xia
hemingkx
AI & ML interests
Efficient and Effective NLP, Tool Learning, and Vision-Language Understanding.
Recent Activity
authored
a paper
about 1 hour ago
Unlocking Efficiency in Large Language Model Inference: A Comprehensive
Survey of Speculative Decoding
authored
a paper
about 1 hour ago
AppBench: Planning of Multiple APIs from Various APPs for Complex User
Instruction
authored
a paper
about 1 hour ago
SWIFT: On-the-Fly Self-Speculative Decoding for LLM Inference
Acceleration
Organizations
None yet
models
None public yet