PZ's picture

PZ PRO

philipp-zettl

·

philipp-zettl

AI & ML interests

NLP/CV/Multimodal learning

Recent Activity

new activity 2 days ago

philipp-zettl/chessPT:Any results?

upvoted a paper 3 days ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

liked a Space 3 days ago

huggingface/HuggingDiscussions

View all activity

Organizations

philipp-zettl's activity

upvoted a paper 3 days ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published 5 days ago • 140

upvoted an article 13 days ago

Article

FineWeb2-C: Help Build Better Language Models in Your Language

By

and 5 others •

Dec 23, 2024

• 18

upvoted a paper 27 days ago

The GAN is dead; long live the GAN! A Modern GAN Baseline

Paper • 2501.05441 • Published about 1 month ago • 87

upvoted a paper about 1 month ago

1.58-bit FLUX

Paper • 2412.18653 • Published Dec 24, 2024 • 74

upvoted a paper about 2 months ago

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Paper • 2412.13663 • Published Dec 18, 2024 • 126

upvoted 2 collections 2 months ago

Llama 3.3

This collection hosts the transformers and original repos of the Llama 3.3 • 1 item • Updated Dec 6, 2024 • 127

Common Models

The first generation of models pretrained on Common Corpus. • 5 items • Updated Dec 5, 2024 • 29

upvoted an article 2 months ago

Article

They Said It Couldn’t Be Done

By

and 2 others •

Dec 5, 2024

• 80

upvoted 2 collections 2 months ago

LLäMmlein Chat Preview 🐑

https://www.informatik.uni-wuerzburg.de/datascience/projects/nlp/llammlein/ • 8 items • Updated Nov 22, 2024 • 11

INTELLECT-1

13 items • Updated Dec 5, 2024 • 7

upvoted a paper 3 months ago

Cautious Optimizers: Improving Training with One Line of Code

Paper • 2411.16085 • Published Nov 25, 2024 • 15

upvoted a collection 3 months ago

SmolLM2

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 16 items • Updated 3 days ago • 222

upvoted a collection 4 months ago

Moshi v0.1 Release

MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi • 13 items • Updated Sep 18, 2024 • 227

upvoted an article 4 months ago

Article

HTRflow - A tool for HTR and OCR

By

and 3 others •

Oct 1, 2024

• 15

upvoted a collection 4 months ago

Realistic Vision (SD1.5)

8 items • Updated Dec 4, 2023 • 33

upvoted an article 4 months ago

Article

Llama can now see and run on your device - welcome Llama 3.2

Sep 25, 2024

• 182

upvoted a paper 5 months ago

Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks

Paper • 2311.06242 • Published Nov 10, 2023 • 89

upvoted a collection 6 months ago

Zeroshot Classifiers

These are my current best zeroshot classifiers. Some of my older models are downloaded more often, but the models in this collection are newer/better. • 12 items • Updated Jan 6 • 126

upvoted an article 6 months ago

Article

Introducing TextImage Augmentation for Document Images

Aug 6, 2024

• 32

upvoted an article 7 months ago

Article

How we leveraged distilabel to create an Argilla 2.0 Chatbot

Jul 16, 2024

• 33