BBTv2: Towards a Gradient-Free Future with Large Language Models Paper • 2205.11200 • Published May 23, 2022 • 2
Multi-Head Adapter Routing for Cross-Task Generalization Paper • 2211.03831 • Published Nov 7, 2022 • 2 • 2
BoRA: Bi-dimensional Weight-Decomposed Low-Rank Adaptation Paper • 2412.06441 • Published Dec 9, 2024 • 2
FineGates: LLMs Finetuning with Compression using Stochastic Gates Paper • 2412.12951 • Published Dec 17, 2024 • 2
RandLoRA: Full-rank parameter-efficient fine-tuning of large models Paper • 2502.00987 • Published 23 days ago • 9 • 3
EDoRA: Efficient Weight-Decomposed Low-Rank Adaptation via Singular Value Decomposition Paper • 2501.12067 • Published Jan 21 • 2
TriAdaptLoRA: Brain-Inspired Triangular Adaptive Low-Rank Adaptation for Parameter-Efficient Fine-Tuning Paper • 2501.08008 • Published Jan 14 • 2
GaLore$+$: Boosting Low-Rank Adaptation for LLMs with Cross-Head Projection Paper • 2412.19820 • Published Dec 15, 2024 • 2
Sequential Compression Layers for Efficient Federated Learning in Foundational Models Paper • 2412.07021 • Published Dec 9, 2024 • 2
Federated Sketching LoRA: On-Device Collaborative Fine-Tuning of Large Language Models Paper • 2501.19389 • Published 25 days ago • 3 • 3
Adaptive Parameter-Efficient Federated Fine-Tuning on Heterogeneous Devices Paper • 2412.20004 • Published Dec 28, 2024 • 2
LoRS: Efficient Low-Rank Adaptation for Sparse Large Language Model Paper • 2501.08582 • Published Jan 15 • 2
Gradient Weight-normalized Low-rank Projection for Efficient LLM Training Paper • 2412.19616 • Published Dec 27, 2024 • 2
Hydra: Multi-head Low-rank Adaptation for Parameter Efficient Fine-tuning Paper • 2309.06922 • Published Sep 13, 2023 • 1 • 2
CritiPrefill: A Segment-wise Criticality-based Approach for Prefilling Acceleration in LLMs Paper • 2409.12490 • Published Sep 19, 2024 • 2
InfiniGen: Efficient Generative Inference of Large Language Models with Dynamic KV Cache Management Paper • 2406.19707 • Published Jun 28, 2024 • 2
Expediting and Elevating Large Language Model Reasoning via Hidden Chain-of-Thought Decoding Paper • 2409.08561 • Published Sep 13, 2024 • 2
Diver: Large Language Model Decoding with Span-Level Mutual Information Verification Paper • 2406.02120 • Published Jun 4, 2024 • 2