Maxime Labonne's picture

Maxime Labonne PRO

mlabonne

·

https://mlabonne.github.io/blog

AI & ML interests

Post-training, model editing, quantization

Recent Activity

liked a Space 2 days ago

burtenshaw/recap

View all activity

Articles

Decoding Strategies in Large Language Models

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

The Rise of Agentic Data Generation

Uncensor any LLM with abliteration

Fine-tune Llama 3 with ORPO

Create Mixtures of Experts with MergeKit

Merge Large Language Models with mergekit

Organizations

mlabonne's activity

upvoted 2 articles about 1 month ago

Article

The Beginners Guide to Cleaning a Dataset

By

•

Nov 18

• 24

Article

Releasing the largest multilingual open pretraining dataset

By

•

Nov 13

• 98

upvoted an article about 2 months ago

Article

Decoding Strategies in Large Language Models

By

•

Oct 29

• 38

upvoted a paper about 2 months ago

Back to Basics: Revisiting REINFORCE Style Optimization for Learning from Human Feedback in LLMs

Paper • 2402.14740 • Published Feb 22 • 11

upvoted 2 articles 3 months ago

Article

VLM Art Analysis

By

•

Oct 4

• 11

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

Sep 18

• 213

upvoted a collection 4 months ago

🧠 Abliteration

Uncensored models using abliteration. See this article for more information: huggingface.co/blog/mlabonne/abliteration • 7 items • Updated Nov 18 • 24

upvoted an article 4 months ago

Article

Introduction to ggml

Aug 13

• 120

upvoted a paper 5 months ago

The Impact of Hyperparameters on Large Language Model Inference Performance: An Evaluation of vLLM and HuggingFace Pipelines

Paper • 2408.01050 • Published Aug 2 • 8

upvoted an article 5 months ago

Article

The case for specialized pre-training: ultra-fast foundation models for dedicated tasks

By

•

Aug 4

• 27

upvoted a paper 5 months ago

Improving Text Embeddings for Smaller Language Models Using Contrastive Fine-tuning

Paper • 2408.00690 • Published Aug 1 • 23

upvoted a collection 5 months ago

Probably function calling datasets

Created using the https://huggingface.co/spaces/librarian-bots/dataset-column-search-api Space. • 39 items • Updated Jul 17 • 36

upvoted 2 papers 5 months ago

Executable Code Actions Elicit Better LLM Agents

Paper • 2402.01030 • Published Feb 1 • 27

Understanding Reference Policies in Direct Preference Optimization

Paper • 2407.13709 • Published Jul 18 • 16

upvoted 3 collections 5 months ago

Bad Data Toolbox

PleIAs collection of models for the data processing of challenging document and data sources. • 5 items • Updated Jul 18 • 15

OpenCulture

A multilingual dataset of public domain books and newspapers. • 27 items • Updated Nov 6 • 121

Finance Commons

A large collection of multimodal financial documents in open data. • 7 items • Updated Jul 17 • 7

upvoted an article 5 months ago

Article

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

By

•

Jul 29

• 257

upvoted a paper 5 months ago

The Importance of Online Data: Understanding Preference Fine-tuning via Coverage

Paper • 2406.01462 • Published Jun 3 • 6

upvoted an article 5 months ago

Article

The Rise of Agentic Data Generation

By

•

Jul 15

• 78