8 101 236

Chmielewski

Eryk-Chmielewski

AI & ML interests

None yet

Recent Activity

liked a model 2 days ago

moonshotai/Moonlight-16B-A3B-Instruct

liked a model 3 days ago

zed-industries/zeta

liked a Space 6 days ago

nanotron/ultrascale-playbook

View all activity

Organizations

Eryk-Chmielewski's activity

upvoted a paper 14 days ago

Training Language Models for Social Deduction with Multi-Agent Reinforcement Learning

Paper • 2502.06060 • Published 16 days ago • 32

upvoted 13 papers 23 days ago

UI-TARS: Pioneering Automated GUI Interaction with Native Agents

Paper • 2501.12326 • Published Jan 21 • 51

RL + Transformer = A General-Purpose Problem Solver

Paper • 2501.14176 • Published Jan 24 • 24

ARWKV: Pretrain is not what we need, an RNN-Attention-Based Language Model Born from Transformer

Paper • 2501.15570 • Published about 1 month ago • 23

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published 28 days ago • 107

Large Language Models Think Too Fast To Explore Effectively

Paper • 2501.18009 • Published 27 days ago • 23

Streaming DiLoCo with overlapping communication: Towards a Distributed Free Lunch

Paper • 2501.18512 • Published 26 days ago • 27

Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs

Paper • 2501.18585 • Published 26 days ago • 55

Reward-Guided Speculative Decoding for Efficient LLM Reasoning

Paper • 2501.19324 • Published 25 days ago • 37

s1: Simple test-time scaling

Paper • 2501.19393 • Published 25 days ago • 106

upvoted a collection 26 days ago

HIGGS

Collection

Models prequantized with [HIGGS](https://arxiv.org/abs/2411.17525) zero-shot quantization. Requires the latest `transformers` to run. • 17 items • Updated Dec 24, 2024 • 6

upvoted an article 29 days ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

29 days ago

• 773

upvoted 4 papers 30 days ago

Tensor Product Attention Is All You Need

Paper • 2501.06425 • Published Jan 11 • 84

Reasoning Language Models: A Blueprint

Paper • 2501.11223 • Published Jan 20 • 32

SWE-Fixer: Training Open-Source LLMs for Effective and Efficient GitHub Issue Resolution

Paper • 2501.05040 • Published Jan 9 • 15

The GAN is dead; long live the GAN! A Modern GAN Baseline

Paper • 2501.05441 • Published Jan 9 • 88