Thomas Wolf's picture

Thomas Wolf PRO

thomwolf

·

https://thomwolf.io

AI & ML interests

NLP and open-source :-)

Recent Activity

upvoted an article 1 day ago

Open-R1: Update #1

liked a model 2 days ago

deepseek-ai/DeepSeek-R1

upvoted a collection 3 days ago

View all activity

Articles

Introducing smolagents: simple agents that write actions in code.

FineWeb2-C: Help Build Better Language Models in Your Language

LeMaterial: an open source initiative to accelerate materials discovery and research

FineVideo: behind the scenes

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

A failed experiment: Infini-Attention, and why we should keep trying?

Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent

Constitutional AI with Open LLMs

Open LLM Leaderboard: DROP deep dive

What's going on with the Open LLM Leaderboard?

Can foundation models label data like humans?

Organizations

thomwolf's activity

upvoted an article 1 day ago

Article

Open-R1: Update #1

By

•

1 day ago

• 154

upvoted a collection 3 days ago

Tulu 3 Models

All models released with Tulu 3 -- state of the art open post-training recipes. • 10 items • Updated 4 days ago • 82

upvoted 2 articles 6 days ago

Article

Welcome to Inference Providers on the Hub 🔥

6 days ago

• 211

Article

Open-R1: a fully open reproduction of DeepSeek-R1

6 days ago

• 570

upvoted 2 articles 19 days ago

Article

MiniMax-01 is Now Open-Source: Scaling Lightning Attention for the AI Agent Era

By

•

19 days ago

• 40

Article

Diving into MiniMax01 405B MoE

By

•

19 days ago

• 17

upvoted a paper 27 days ago

DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning

Paper • 2406.11896 • Published Jun 14, 2024 • 20

upvoted an article 29 days ago

Article

🐺🐦‍⬛ LLM Comparison/Test: DeepSeek-V3, QVQ-72B-Preview, Falcon3 10B, Llama 3.3 70B, Nemotron 70B in my updated MMLU-Pro CS benchmark

By

•

Jan 2

• 39

upvoted a collection 29 days ago

Phi-3

Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. • 26 items • Updated 26 days ago • 550

upvoted a paper 29 days ago

Phi-4 Technical Report

Paper • 2412.08905 • Published Dec 12, 2024 • 106

upvoted a paper 30 days ago

Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction

Paper • 2412.04454 • Published Dec 5, 2024 • 60

upvoted a collection about 1 month ago

Falcon3

Falcon3 family of Open Foundation Models is a set of pretrained and instruct LLMs ranging from 1B to 10B parameters. • 40 items • Updated 26 days ago • 80

upvoted an article about 1 month ago

Article

FineWeb2-C: Help Build Better Language Models in Your Language

By

•

Dec 23, 2024

• 18

upvoted a collection about 2 months ago

TabuLa-8B

Training, eval suite, and model from the paper "Large Scale Transfer Learning for Tabular Data via Language Modeling" https://arxiv.org/abs/2406.12031 • 4 items • Updated Jun 19, 2024 • 11

upvoted 2 papers about 2 months ago

LiFT: Leveraging Human Feedback for Text-to-Video Model Alignment

Paper • 2412.04814 • Published Dec 6, 2024 • 45

Solving Quantitative Reasoning Problems with Language Models

Paper • 2206.14858 • Published Jun 29, 2022 • 1

upvoted a collection about 2 months ago

GUI agents

A collection of papers on GUI agents • 3 items • Updated Dec 14, 2024 • 5

upvoted a paper about 2 months ago

AgentTrek: Agent Trajectory Synthesis via Guiding Replay with Web Tutorials

Paper • 2412.09605 • Published Dec 12, 2024 • 28

upvoted a collection about 2 months ago

🥂 FineWeb2

3 items • Updated Dec 8, 2024 • 12

upvoted a collection 2 months ago

🤖 Agents

21 items • Updated Dec 31, 2024 • 111