Stephen Oates PRO

soates

AI & ML interests

None yet

Recent Activity

upvoted an article 7 days ago

Open-R1: Update #1

upvoted an article 12 days ago

Open-R1: a fully open reproduction of DeepSeek-R1

upvoted a collection 18 days ago

EvaByte

View all activity

Organizations

None yet

soates's activity

upvoted an article 7 days ago

Article

Open-R1: Update #1

and 7 others •

8 days ago

• 258

upvoted an article 12 days ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

13 days ago

• 679

upvoted a collection 18 days ago

EvaByte

Collection

3 items • Updated 19 days ago • 3

upvoted an article 27 days ago

Article

Mastering Tensor Dimensions in Transformers

•

28 days ago

• 43

upvoted a paper about 2 months ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 345

liked a model about 2 months ago

Datou1111/shou_xin

Text-to-Image • Updated Dec 9, 2024 • 2.66k • 857

upvoted a paper 5 months ago

LoRA Learns Less and Forgets Less

Paper • 2405.09673 • Published May 15, 2024 • 88

liked a model 5 months ago

lamm-mit/LifeGPT

Updated Sep 19, 2024 • 8

upvoted an article 5 months ago

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

Sep 18, 2024

• 217

liked a Space 6 months ago

110

Open-LLM performances are plateauing, let’s make the leaderboard steep again

🏔

Update leaderboard for fair model evaluation

upvoted 2 articles 6 months ago

Article

Llama-3.1-Storm-8B: Improved SLM with Self-Curation + Model Merging

•

Aug 19, 2024

• 76

Article

A failed experiment: Infini-Attention, and why we should keep trying?

Aug 14, 2024

• 57

liked a model 6 months ago

nisten/Biggie-SmoLlm-0.15B-Base

Text Generation • Updated Aug 7, 2024 • 571 • 233

liked a Space 7 months ago

Gpt2 Multiplication Predictor

📈

Multiply large numbers using different reasoning methods

upvoted an article 8 months ago

Article

Uncensor any LLM with abliteration

•

Jun 13, 2024

• 427

liked a Space 8 months ago

633

FineWeb: decanting the web for the finest text data at scale

🍷

Generate high-quality web text data for LLM training

liked a Space 9 months ago

267

Phi-3 WebGPU

🚀

A private and powerful AI that runs locally in your browser

updated a collection 9 months ago

Llms

Collection

2 items • Updated May 8, 2024

updated a collection 10 months ago

Llms

Collection

2 items • Updated May 8, 2024