Kai Zuberbühler's picture

443 305

Kai Zuberbühler

kaizuberbuehler

·

k-zubi

AI & ML interests

language models, agents, image generation, music generation

Recent Activity

liked a model about 19 hours ago

adlb/Audialab_EDM_Elements

updated a collection 3 days ago

liked a Space 3 days ago

ServiceNow/browsergym-leaderboard

View all activity

Organizations

None yet

kaizuberbuehler's activity

liked a model about 19 hours ago

adlb/Audialab_EDM_Elements

Updated Dec 5, 2024 • 38

updated a collection 3 days ago

Leaderboards

24 items • Updated 3 days ago • 2

liked a Space 3 days ago

BrowserGym Leaderboard

updated 2 collections 3 days ago

Reasoning

44 items • Updated 3 days ago • 1

LM Training

69 items • Updated 3 days ago • 2

upvoted a paper 3 days ago

Kimi k1.5: Scaling Reinforcement Learning with LLMs

Paper • 2501.12599 • Published 5 days ago • 57

updated 2 collections 3 days ago

Reasoning

44 items • Updated 3 days ago • 1

LM Training

69 items • Updated 3 days ago • 2

upvoted a paper 3 days ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published 5 days ago • 203

updated a collection 4 days ago

Agents

75 items • Updated 4 days ago • 3

upvoted a paper 4 days ago

UI-TARS: Pioneering Automated GUI Interaction with Native Agents

Paper • 2501.12326 • Published 5 days ago • 45

liked 2 models 4 days ago

bartowski/DeepSeek-R1-Distill-Qwen-14B-GGUF

Text Generation • Updated 5 days ago • 102k • 53

bartowski/DeepSeek-R1-Distill-Qwen-32B-GGUF

Text Generation • Updated 5 days ago • 907k • 73

updated 2 collections 5 days ago

LM Prompt Engineering

30 items • Updated 5 days ago

Agents

75 items • Updated 4 days ago • 3

upvoted a paper 5 days ago

PaSa: An LLM Agent for Comprehensive Academic Paper Search

Paper • 2501.10120 • Published 10 days ago • 37

updated a collection 5 days ago

Agents

75 items • Updated 4 days ago • 3

upvoted a paper 5 days ago

From Novice to Expert: LLM Agent Policy Optimization via Step-wise Reinforcement Learning

Paper • 2411.03817 • Published Nov 6, 2024 • 1

updated a collection 5 days ago

Agents

75 items • Updated 4 days ago • 3

upvoted a paper 5 days ago

DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning

Paper • 2406.11896 • Published Jun 14, 2024 • 20