27 23 70

geronimo PRO

g-ronimo

AI & ML interests

fafo

Recent Activity

liked a model about 20 hours ago

ZhengPeng7/BiRefNet_HR

upvoted a paper 1 day ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

updated a model 4 days ago

g-ronimo/hana-alpha14_cifar10-128_TS-1000_1000e

View all activity

Organizations

g-ronimo's activity

upvoted a paper 1 day ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published 5 days ago • 140

upvoted a paper about 1 month ago

Dolphin: Closed-loop Open-ended Auto-research through Thinking, Practice, and Feedback

Paper • 2501.03916 • Published Jan 7 • 14

upvoted an article about 1 month ago

Article

Fine-tune ModernBERT for text classification using synthetic data

•

Dec 30, 2024

• 31

upvoted a paper about 2 months ago

CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up

Paper • 2412.16112 • Published Dec 20, 2024 • 22

upvoted 4 papers 3 months ago

VisualLens: Personalization through Visual History

Paper • 2411.16034 • Published Nov 25, 2024 • 16

UnifiedCrawl: Aggregated Common Crawl for Affordable Adaptation of LLMs on Low-Resource Languages

Paper • 2411.14343 • Published Nov 21, 2024 • 7

Add-it: Training-Free Object Insertion in Images With Pretrained Diffusion Models

Paper • 2411.07232 • Published Nov 11, 2024 • 64

JanusFlow: Harmonizing Autoregression and Rectified Flow for Unified Multimodal Understanding and Generation

Paper • 2411.07975 • Published Nov 12, 2024 • 30

upvoted 2 articles 5 months ago

Article

"Diffusers Image Fill" guide

•

Sep 13, 2024

• 44

Article

Extending Transformer layers as Painters to DiT's

•

Aug 31, 2024

• 10

upvoted a paper 9 months ago

LoRA Learns Less and Forgets Less

Paper • 2405.09673 • Published May 15, 2024 • 88

upvoted 2 articles 9 months ago

Article

Train custom AI models with the trainer API and adapt them to 🤗

•

Jun 29, 2024

• 33

Article

SeeMoE: Implementing a MoE Vision Language Model from Scratch

•

Jun 23, 2024

• 33

upvoted a paper 10 months ago

How Good Are Low-bit Quantized LLaMA3 Models? An Empirical Study

Paper • 2404.14047 • Published Apr 22, 2024 • 45

upvoted 5 articles 10 months ago

Article

seemore: Implement a Vision Language Model from Scratch

•

Jun 23, 2024

• 70

Article

Releasing Youtube-Commons: a massive open corpus for conversational and multimodal data

•

Apr 18, 2024

• 22

Article

On Coding Your First Attention

•

Apr 21, 2024

• 7

Article

Introducing Idefics2: A Powerful 8B Vision-Language Model for the community

Apr 15, 2024

• 173

Article

DS-MoE: Making MoE Models More Efficient and Less Memory-Intensive

•

Apr 9, 2024

• 29

upvoted a paper 10 months ago

Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order

Paper • 2404.00399 • Published Mar 30, 2024 • 42

geronimo PRO

AI & ML interests

Recent Activity

Organizations

g-ronimo's activity

Fine-tune ModernBERT for text classification using synthetic data

"Diffusers Image Fill" guide

Extending *Transformer layers as Painters* to DiT's

Train custom AI models with the trainer API and adapt them to 🤗

SeeMoE: Implementing a MoE Vision Language Model from Scratch

seemore: Implement a Vision Language Model from Scratch

Releasing Youtube-Commons: a massive open corpus for conversational and multimodal data

On Coding Your First Attention

Introducing Idefics2: A Powerful 8B Vision-Language Model for the community

DS-MoE: Making MoE Models More Efficient and Less Memory-Intensive

Extending Transformer layers as Painters to DiT's