Julien BLANCHON's picture

Julien BLANCHON PRO

blanchon

·

AI & ML interests

Math

Recent Activity

liked a dataset about 6 hours ago

HoangHa/Pensez-v0.1

liked a model about 6 hours ago

HoangHa/Pensez-Llama3.1-8B

upvoted a collection about 6 hours ago

View all activity

Organizations

blanchon's activity

upvoted a collection about 6 hours ago

Pensez-LLM

French-English reasoning model • 4 items • Updated 4 days ago • 1

upvoted an article 1 day ago

Article

Remote VAEs for decoding with HF endpoints 🤗

2 days ago

• 25

upvoted a collection 3 days ago

AuraFlow

AuraFlow v0.x series, to date the largest (6.8B) and highest fidelity (0.7+ on GenEval) open sourced text to image model. • 3 items • Updated Sep 6, 2024 • 7

upvoted an article 4 days ago

Article

SigLIP 2: A better multilingual vision language encoder

5 days ago

• 90

upvoted a paper 5 days ago

Gated Linear Attention Transformers with Hardware-Efficient Training

Paper • 2312.06635 • Published Dec 11, 2023 • 7

upvoted a collection 6 days ago

🇫🇷 Calme-3

Here you can find all the new Calme-3 models • 27 items • Updated 16 days ago • 13

upvoted an article 8 days ago

Article

Open Preference Dataset for Text-to-Image Generation by the 🤗 Community

Dec 9, 2024

• 54

upvoted a paper 12 days ago

Let's Go Shopping (LGS) -- Web-Scale Image-Text Dataset for Visual Concept Understanding

Paper • 2401.04575 • Published Jan 9, 2024 • 17

upvoted a collection 13 days ago

Nomic Embed v2

Multilingual Embedding Models • 4 items • Updated 10 days ago • 11

upvoted 2 papers 13 days ago

VidCRAFT3: Camera, Object, and Lighting Control for Image-to-Video Generation

Paper • 2502.07531 • Published 14 days ago • 13

Generating Multi-Image Synthetic Data for Text-to-Image Customization

Paper • 2502.01720 • Published 22 days ago • 7

upvoted a collection 13 days ago

Terminus XL

v-prediction SDXL clone with zero-terminal SNR noise schedule • 8 items • Updated Apr 24, 2024 • 7

upvoted 3 collections 14 days ago

AIMv2

A collection of AIMv2 vision encoders that supports a number of resolutions, native resolution, and a distilled checkpoint. • 19 items • Updated Nov 22, 2024 • 73

Ultravox v0.5

Ultravox is a multimodal Speech LLM built around different pretrained LLMs (frozen) and the whisper-large-v3-turbo (fine-tuned) backbone. • 3 items • Updated 15 days ago • 5

R3GAN

R3GAN: A Modern BaselineGAN https://github.com/brownvc/R3GAN/ https://arxiv.org/abs/2501.05441 • 7 items • Updated Jan 10 • 10

upvoted 3 papers 14 days ago

The GAN is dead; long live the GAN! A Modern GAN Baseline

Paper • 2501.05441 • Published Jan 9 • 88

ConceptAttention: Diffusion Transformers Learn Highly Interpretable Features

Paper • 2502.04320 • Published 19 days ago • 33

Material Anything: Generating Materials for Any 3D Object via Diffusion

Paper • 2411.15138 • Published Nov 22, 2024 • 44

upvoted a paper 16 days ago

Goku: Flow Based Video Generative Foundation Models

Paper • 2502.04896 • Published 18 days ago • 88

upvoted an article 20 days ago

Article

π0 and π0-FAST: Vision-Language-Action Models for General Robot Control

22 days ago

• 107