231 2 1

)))?!?(((

stereoplegic

AI & ML interests

None yet

Recent Activity

commented on a paper 7 days ago

BBTv2: Towards a Gradient-Free Future with Large Language Models

updated a collection 7 days ago

No backprop

updated a collection 14 days ago

Speculative

View all activity

Organizations

stereoplegic's activity

commented a paper 7 days ago

BBTv2: Towards a Gradient-Free Future with Large Language Models

Paper • 2205.11200 • Published May 23, 2022 •

commented 13 papers 21 days ago

EDoRA: Efficient Weight-Decomposed Low-Rank Adaptation via Singular Value Decomposition

Paper • 2501.12067 • Published Jan 21 •

TriAdaptLoRA: Brain-Inspired Triangular Adaptive Low-Rank Adaptation for Parameter-Efficient Fine-Tuning

Paper • 2501.08008 • Published Jan 14 •

GaLore$+$: Boosting Low-Rank Adaptation for LLMs with Cross-Head Projection

Paper • 2412.19820 • Published Dec 15, 2024 •

Sequential Compression Layers for Efficient Federated Learning in Foundational Models

Paper • 2412.07021 • Published Dec 9, 2024 •

Federated Sketching LoRA: On-Device Collaborative Fine-Tuning of Large Language Models

Paper • 2501.19389 • Published 25 days ago • 3 •

Adaptive Parameter-Efficient Federated Fine-Tuning on Heterogeneous Devices

Paper • 2412.20004 • Published Dec 28, 2024 •

LoRS: Efficient Low-Rank Adaptation for Sparse Large Language Model

Paper • 2501.08582 • Published Jan 15 •

Gradient Weight-normalized Low-rank Projection for Efficient LLM Training

Paper • 2412.19616 • Published Dec 27, 2024 •

Hydra: Multi-head Low-rank Adaptation for Parameter Efficient Fine-tuning

Paper • 2309.06922 • Published Sep 13, 2023 • 1 •

New activity in OpenNLPLab/TransNormerLLM-7B 3 months ago

Unsafe use of eval

#1 opened over 1 year ago by

Daniel-P-Gonzalez

commented 5 papers 5 months ago

CritiPrefill: A Segment-wise Criticality-based Approach for Prefilling Acceleration in LLMs

Paper • 2409.12490 • Published Sep 19, 2024 •

Inference-Friendly Models With MixAttention

Paper • 2409.15012 • Published Sep 23, 2024 •

InfiniGen: Efficient Generative Inference of Large Language Models with Dynamic KV Cache Management

Paper • 2406.19707 • Published Jun 28, 2024 •

Expediting and Elevating Large Language Model Reasoning via Hidden Chain-of-Thought Decoding

Paper • 2409.08561 • Published Sep 13, 2024 •

Diver: Large Language Model Decoding with Span-Level Mutual Information Verification

Paper • 2406.02120 • Published Jun 4, 2024 •