Nathan Habib's picture

Nathan Habib

SaylorTwift

·

AI & ML interests

None yet

Recent Activity

liked a model 2 minutes ago

deepseek-ai/DeepSeek-R1

liked a model 13 minutes ago

Qwen/Qwen2.5-7B-Instruct-1M

upvoted a paper 18 days ago

Agent Laboratory: Using LLM Agents as Research Assistants

View all activity

Articles

CO₂ Emissions and Models Performance: Insights from the Open LLM Leaderboard

Open LLM Leaderboard: DROP deep dive

What's going on with the Open LLM Leaderboard?

Organizations

SaylorTwift's activity

upvoted a paper 18 days ago

Agent Laboratory: Using LLM Agents as Research Assistants

Paper • 2501.04227 • Published 19 days ago • 81

upvoted an article 18 days ago

Article

🌁#82: AI and ML in Real Life

By

•

20 days ago

• 15

upvoted a collection 19 days ago

🤖 Agents

21 items • Updated 27 days ago • 108

upvoted an article 20 days ago

Article

Announcing NVIDIA Cosmos World Foundation Models

By

•

20 days ago

• 23

upvoted a paper about 1 month ago

GenEx: Generating an Explorable World

Paper • 2412.09624 • Published Dec 12, 2024 • 89

upvoted a paper 4 months ago

HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models

Paper • 2409.16191 • Published Sep 24, 2024 • 42

upvoted a collection 4 months ago

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 45 items • Updated Nov 28, 2024 • 476

upvoted a paper 4 months ago

Qwen2.5-Coder Technical Report

Paper • 2409.12186 • Published Sep 18, 2024 • 140

upvoted a collection 5 months ago

Open LLM Leaderboard best models ❤️‍🔥

A daily uploaded list of models with best evaluations on the LLM leaderboard: • 64 items • Updated about 2 hours ago • 514

upvoted an article 5 months ago

Article

The 5 Most Under-Rated Tools on Hugging Face

Aug 22, 2024

• 86

upvoted 3 articles 6 months ago

Article

XetHub is joining Hugging Face!

Aug 8, 2024

• 81

Article

Tool Use, Unified

Aug 12, 2024

• 72

Article

Welcome FalconMamba: The first strong attention-free 7B model

Aug 12, 2024

• 108

upvoted an article 8 months ago

Article

Let's talk about LLM evaluation

By

•

May 23, 2024

• 148

upvoted a collection about 1 year ago

Model Merging

Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! • 30 items • Updated Jun 12, 2024 • 226

upvoted a paper about 1 year ago

Zephyr: Direct Distillation of LM Alignment

Paper • 2310.16944 • Published Oct 25, 2023 • 123

upvoted 3 papers over 1 year ago

Judging LLM-as-a-judge with MT-Bench and Chatbot Arena

Paper • 2306.05685 • Published Jun 9, 2023 • 33

Retentive Network: A Successor to Transformer for Large Language Models

Paper • 2307.08621 • Published Jul 17, 2023 • 171

The RefinedWeb Dataset for Falcon LLM: Outperforming Curated Corpora with Web Data, and Web Data Only

Paper • 2306.01116 • Published Jun 1, 2023 • 33