Timothe Laborie's picture

20 13

Timothe Laborie

timothelaborie

·

AI & ML interests

ML

Organizations

timothelaborie's activity

upvoted a paper 1 day ago

BitNet a4.8: 4-bit Activations for 1-bit LLMs

Paper • 2411.04965 • Published 2 days ago • 50

upvoted a paper 23 days ago

FlatQuant: Flatness Matters for LLM Quantization

Paper • 2410.09426 • Published 29 days ago • 12

upvoted 2 papers about 1 month ago

nGPT: Normalized Transformer with Representation Learning on the Hypersphere

Paper • 2410.01131 • Published Oct 1 • 8

Addition is All You Need for Energy-efficient Language Models

Paper • 2410.00907 • Published Oct 1 • 143

upvoted a paper 4 months ago

Q-Sparse: All Large Language Models can be Fully Sparsely-Activated

Paper • 2407.10969 • Published Jul 15 • 20

upvoted 2 papers 5 months ago

MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark

Paper • 2406.01574 • Published Jun 3 • 42

Transformers are SSMs: Generalized Models and Efficient Algorithms Through Structured State Space Duality

Paper • 2405.21060 • Published May 31 • 63

upvoted a paper 8 months ago

Chronos: Learning the Language of Time Series

Paper • 2403.07815 • Published Mar 12 • 45

upvoted 2 papers 9 months ago

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27 • 602

Think before you speak: Training Language Models With Pause Tokens

Paper • 2310.02226 • Published Oct 3, 2023 • 2

upvoted 2 papers 10 months ago

EAGLE: Speculative Sampling Requires Rethinking Feature Uncertainty

Paper • 2401.15077 • Published Jan 26 • 18

SliceGPT: Compress Large Language Models by Deleting Rows and Columns

Paper • 2401.15024 • Published Jan 26 • 68

upvoted a paper 12 months ago

Exponentially Faster Language Modelling

Paper • 2311.10770 • Published Nov 15, 2023 • 118