1 25 3

Jonathan LYS

jonathan-lys

jonathanlys01

AI & ML interests

None yet

Recent Activity

upvoted a paper 20 days ago

Parallelized Autoregressive Visual Generation

liked a Space 27 days ago

HuggingFaceH4/blogpost-scaling-test-time-compute

upvoted a paper about 1 month ago

SNOOPI: Supercharged One-step Diffusion Distillation with Proper Guidance

View all activity

Organizations

jonathan-lys's activity

upvoted a paper 20 days ago

Parallelized Autoregressive Visual Generation

Paper • 2412.15119 • Published 24 days ago • 50

upvoted 2 papers about 1 month ago

SNOOPI: Supercharged One-step Diffusion Distillation with Proper Guidance

Paper • 2412.02687 • Published Dec 3, 2024 • 108

TinyFusion: Diffusion Transformers Learned Shallow

Paper • 2412.01199 • Published Dec 2, 2024 • 14

upvoted a paper about 2 months ago

When Precision Meets Position: BFloat16 Breaks Down RoPE in Long-Context Training

Paper • 2411.13476 • Published Nov 20, 2024 • 15

upvoted a paper 2 months ago

Unpacking SDXL Turbo: Interpreting Text-to-Image Models with Sparse Autoencoders

Paper • 2410.22366 • Published Oct 28, 2024 • 77

upvoted 3 papers 3 months ago

Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss

Paper • 2410.17243 • Published Oct 22, 2024 • 89

Pixtral 12B

Paper • 2410.07073 • Published Oct 9, 2024 • 63

Addition is All You Need for Energy-efficient Language Models

Paper • 2410.00907 • Published Oct 1, 2024 • 145

upvoted an article 4 months ago

Article

wHy DoNt YoU jUsT uSe ThE lLaMa ToKeNiZeR??

•

Sep 27, 2024

• 38

upvoted 2 papers 5 months ago

SAM 2: Segment Anything in Images and Videos

Paper • 2408.00714 • Published Aug 1, 2024 • 111

The Llama 3 Herd of Models

Paper • 2407.21783 • Published Jul 31, 2024 • 110

upvoted a paper 7 months ago

Transformers are SSMs: Generalized Models and Efficient Algorithms Through Structured State Space Duality

Paper • 2405.21060 • Published May 31, 2024 • 64

upvoted 2 papers 8 months ago

Phased Consistency Model

Paper • 2405.18407 • Published May 28, 2024 • 46

LoRA Learns Less and Forgets Less

Paper • 2405.09673 • Published May 15, 2024 • 88

upvoted 3 papers 12 months ago

Improving fine-grained understanding in image-text pre-training

Paper • 2401.09865 • Published Jan 18, 2024 • 16

Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Paper • 2401.09417 • Published Jan 17, 2024 • 59

Scalable Pre-training of Large Autoregressive Image Models

Paper • 2401.08541 • Published Jan 16, 2024 • 36

upvoted 3 papers about 1 year ago

Blending Is All You Need: Cheaper, Better Alternative to Trillion-Parameters LLM

Paper • 2401.02994 • Published Jan 4, 2024 • 49

Denoising Vision Transformers

Paper • 2401.02957 • Published Jan 5, 2024 • 28

COSMO: COntrastive Streamlined MultimOdal Model with Interleaved Pre-Training

Paper • 2401.00849 • Published Jan 1, 2024 • 15