Rookie

Rookied

iknocho

AI & ML interests

None yet

Recent Activity

upvoted an article about 9 hours ago

Open-R1: Update #1

upvoted an article about 15 hours ago

Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial

upvoted an article about 16 hours ago

🦸🏻#9: Does AI Remember? The Role of Memory in Agentic Workflows

View all activity

Organizations

Rookied's activity

upvoted an article about 9 hours ago

Article

Open-R1: Update #1

•

3 days ago

• 204

upvoted an article about 15 hours ago

Article

Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial

•

4 days ago

• 25

upvoted 2 articles about 16 hours ago

Article

🦸🏻#9: Does AI Remember? The Role of Memory in Agentic Workflows

•

2 days ago

• 4

Article

The AI tools for Art Newsletter - Issue 1

5 days ago

• 39

upvoted a paper 5 days ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published 13 days ago • 288

upvoted 2 articles 5 days ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

8 days ago

• 607

Article

Welcome to Inference Providers on the Hub 🔥

8 days ago

• 232

upvoted 2 articles 13 days ago

Article

Simplifying Alignment: From RLHF to Direct Preference Optimization (DPO)

•

16 days ago

• 13

Article

Hugging Face and FriendliAI partner to supercharge model deployment on the Hub

14 days ago

• 30

upvoted an article about 1 month ago

Article

Finetuning Falcon 7b in a hybrid distributed fashion

•

Dec 31, 2024

• 5

upvoted a paper about 1 month ago

Back to Basics: Revisiting REINFORCE Style Optimization for Learning from Human Feedback in LLMs

Paper • 2402.14740 • Published Feb 22, 2024 • 13

upvoted 2 articles 2 months ago

Article

Building a MusicGen API to Generate Custom Music Tracks Locally

•

Dec 4, 2024

• 2

Article

Optimizing Deep Learning Training Techniques

•

Dec 3, 2024

• 2

upvoted 2 articles 4 months ago

Article

LLM Inference at scale with TGI

•

Sep 6, 2024

• 14

Article

Recreating o1 at Home with Role-Play LLMs

•

Sep 20, 2024

• 23

upvoted an article 5 months ago

Article

Improving performance with Arena Learning in post training

•

Sep 11, 2024

• 5

upvoted 2 papers 5 months ago

Building and better understanding vision-language models: insights and future directions

Paper • 2408.12637 • Published Aug 22, 2024 • 124

InternLM2 Technical Report

Paper • 2403.17297 • Published Mar 26, 2024 • 31

upvoted 2 articles 6 months ago

Article

A failed experiment: Infini-Attention, and why we should keep trying?

Aug 14, 2024

• 57

Article

Outperforming Claude 3.5 Sonnet with Phi-3-mini-4k for graph entity relationship extraction tasks

•

Aug 19, 2024

• 7