帖子、文章和讨论

SmolVLM 越变越小 —— 全新 250 M 和 500 M 模型正式发布！

由 2025年1月23日 • 127

Community Articles

view all

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

•

2 days ago

• 3

MindBot Ultra – Dreaming Edition: A Self-Building, Self-Aware AI for Synergistic Cognition and Autonomous Tool Generation

•

2 days ago

• 2

Announcing the winners of the Frugal AI Challenge 🌱

and 1 other •

2 days ago

• 6

Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face

•

2 days ago

• 1

From Llasa to Llasagna 🍕: Finetuning LLaSA to generates Italian speech and other languages

and 1 other •

3 days ago

• 17

Announcing AI Energy Score Ratings

•

3 days ago

• 21

🌁#87: Why DeepResearch Should Be Your New Hire

•

3 days ago

• 4

Prompt Engineering in Multi-Agent Systems with KaibanJS

•

3 days ago

Fine-tune Deepseek-R1 with a Synthetic Reasoning Dataset

•

3 days ago

• 27

Open R1: Update #2

and 6 others •

3 days ago

• 151

Darija Chatbot Arena: Making LLMs Compete in the Moroccan Dialect

and 2 others •

3 days ago

• 8

ROOST: Safety Tooling needs Open Tech🐓🤗

•

3 days ago

• 5

Arabic RAG Leaderboard: A Comprehensive Framework for Evaluating Arabic Language Retrieval Systems

and 1 other •

4 days ago

• 9

Struggling to understand enterprise-scale codebase?

•

5 days ago

• 2

Reasoning at the Forefront of Advanced AI Models : Mistral-Small-24B-Base-2501

•

5 days ago

• 3

Design 101: A Historical and Theoretical Exploration of Graphic Arts and Design in the Age of AI

•

5 days ago

• 1

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

6 days ago

• 25

使用 🤗 Transformers 优化 Bark

由 2023年8月9日 • 2

使用 DPO 微调 Llama 2

由 2023年8月8日 • 40

Huggy Lingo：利用机器学习改进 Hugging Face Hub 上的语言元数据

由 2023年8月2日 • 1

使用 FHE 实现加密大语言模型

由 2023年8月2日 guest • 13

开源 SD-Small 和 SD-Tiny 知识蒸馏代码与权重

由 2023年8月1日 guest • 3

手把手教你使用人工智能生成 3D 素材

由 2023年8月1日 • 6

首届开源 AI 游戏挑战赛事结果

由 2023年7月21日

🤗 Diffusers 一岁啦!

由 2023年7月20日 • 2

Llama 2 来袭 - 在 Hugging Face 上玩转它

由 2023年7月18日 • 25

Hugging Face 的文本生成和大语言模型的开源生态

由 2023年7月17日 • 2

在英特尔 CPU 上微调 Stable Diffusion 模型

由 2023年7月14日

用 Hugging Face 推理端点部署 LLM

由 2023年7月4日 • 11

使用 Habana Gaudi2 加速视觉语言模型 BridgeTower

由 2023年6月29日 • 2

道德与社会问题简报 #4：文生图模型中的偏见

由 2023年6月26日 • 2

Community Articles

view all

Topic 27: What are Chain-of-Agents and Chain-of-RAG?

and 1 other •

about 4 hours ago

• 2

Adventures in AI

•

about 6 hours ago

Agent Leaderboard: Evaluating AI Agents in Multi-Domain Scenarios

and 1 other •

1 day ago

• 7

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

•

2 days ago

• 3

MindBot Ultra – Dreaming Edition: A Self-Building, Self-Aware AI for Synergistic Cognition and Autonomous Tool Generation

•

2 days ago

• 2

Announcing the winners of the Frugal AI Challenge 🌱

and 1 other •

2 days ago

• 6

Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face

•

2 days ago

• 1

From Llasa to Llasagna 🍕: Finetuning LLaSA to generates Italian speech and other languages

and 1 other •

3 days ago

• 17

Announcing AI Energy Score Ratings

•

3 days ago

• 21

🌁#87: Why DeepResearch Should Be Your New Hire

•

3 days ago

• 4

Prompt Engineering in Multi-Agent Systems with KaibanJS

•

3 days ago

Fine-tune Deepseek-R1 with a Synthetic Reasoning Dataset

•

3 days ago

• 27

Open R1: Update #2

and 6 others •

3 days ago

• 151

Darija Chatbot Arena: Making LLMs Compete in the Moroccan Dialect

and 2 others •

3 days ago

• 8

ROOST: Safety Tooling needs Open Tech🐓🤗

•

3 days ago

• 5

Arabic RAG Leaderboard: A Comprehensive Framework for Evaluating Arabic Language Retrieval Systems

and 1 other •

4 days ago

• 9

Struggling to understand enterprise-scale codebase?

•

5 days ago

• 2

Reasoning at the Forefront of Advanced AI Models : Mistral-Small-24B-Base-2501

•

5 days ago

• 3

Design 101: A Historical and Theoretical Exploration of Graphic Arts and Design in the Age of AI

•

5 days ago

• 1

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

6 days ago

• 25

帖子、文章和讨论

SmolVLM 越变越小 —— 全新 250 M 和 500 M 模型正式发布！

Topic 27: What are Chain-of-Agents and Chain-of-RAG?

Adventures in AI

Agent Leaderboard: Evaluating AI Agents in Multi-Domain Scenarios

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

**MindBot Ultra – Dreaming Edition: A Self-Building, Self-Aware AI for Synergistic Cognition and Autonomous Tool Generation**

Announcing the winners of the Frugal AI Challenge 🌱

Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face

From Llasa to Llasagna 🍕: Finetuning LLaSA to generates Italian speech and other languages

Announcing AI Energy Score Ratings

🌁#87: Why DeepResearch Should Be Your New Hire

Prompt Engineering in Multi-Agent Systems with KaibanJS

Fine-tune Deepseek-R1 with a Synthetic Reasoning Dataset

Open R1: Update #2

Darija Chatbot Arena: Making LLMs Compete in the Moroccan Dialect

ROOST: Safety Tooling needs Open Tech🐓🤗

Arabic RAG Leaderboard: A Comprehensive Framework for Evaluating Arabic Language Retrieval Systems

Struggling to understand enterprise-scale codebase?

Reasoning at the Forefront of Advanced AI Models : Mistral-Small-24B-Base-2501

Design 101: A Historical and Theoretical Exploration of Graphic Arts and Design in the Age of AI

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

使用 🤗 Transformers 优化 Bark

使用 DPO 微调 Llama 2

Huggy Lingo：利用机器学习改进 Hugging Face Hub 上的语言元数据

使用 FHE 实现加密大语言模型

开源 SD-Small 和 SD-Tiny 知识蒸馏代码与权重

手把手教你使用人工智能生成 3D 素材

首届开源 AI 游戏挑战赛事结果

🤗 Diffusers 一岁啦!

Llama 2 来袭 - 在 Hugging Face 上玩转它

Hugging Face 的文本生成和大语言模型的开源生态

在英特尔 CPU 上微调 Stable Diffusion 模型

用 Hugging Face 推理端点部署 LLM

使用 Habana Gaudi2 加速视觉语言模型 BridgeTower

道德与社会问题简报 #4：文生图模型中的偏见

Topic 27: What are Chain-of-Agents and Chain-of-RAG?

Adventures in AI

Agent Leaderboard: Evaluating AI Agents in Multi-Domain Scenarios

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

**MindBot Ultra – Dreaming Edition: A Self-Building, Self-Aware AI for Synergistic Cognition and Autonomous Tool Generation**

Announcing the winners of the Frugal AI Challenge 🌱

Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face

From Llasa to Llasagna 🍕: Finetuning LLaSA to generates Italian speech and other languages

Announcing AI Energy Score Ratings

🌁#87: Why DeepResearch Should Be Your New Hire

Prompt Engineering in Multi-Agent Systems with KaibanJS

Fine-tune Deepseek-R1 with a Synthetic Reasoning Dataset

Open R1: Update #2

Darija Chatbot Arena: Making LLMs Compete in the Moroccan Dialect

ROOST: Safety Tooling needs Open Tech🐓🤗

Arabic RAG Leaderboard: A Comprehensive Framework for Evaluating Arabic Language Retrieval Systems

Struggling to understand enterprise-scale codebase?

Reasoning at the Forefront of Advanced AI Models : Mistral-Small-24B-Base-2501

Design 101: A Historical and Theoretical Exploration of Graphic Arts and Design in the Age of AI

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

MindBot Ultra – Dreaming Edition: A Self-Building, Self-Aware AI for Synergistic Cognition and Autonomous Tool Generation

MindBot Ultra – Dreaming Edition: A Self-Building, Self-Aware AI for Synergistic Cognition and Autonomous Tool Generation