帖子、文章和讨论

在英特尔 Gaudi 2 上加速蛋白质语言模型 ProtST

由 2024年7月3日 • 2

Community Articles

A Review on the Evolvement of Load Balancing Strategy in MoE LLMs: Pitfalls and Lessons

From Zero to Reasoning Hero: How DeepSeek-R1 Leverages Reinforcement Learning to Master Complex Reasoning

about 1 hour ago

MLA: Redefining KV-Cache Through Low-Rank Projections and On-Demand Decompression

about 1 hour ago

syncIAL🍏

about 4 hours ago

Smol but Mighty: Can Small Models Reason well? 🤔

about 5 hours ago

AI Agents for Company Research: Automating Business Analysis with KaibanJS

about 9 hours ago

From Hippocrates to AI: Reflections on the Evolution of Consent

about 10 hours ago

🌁#86: Four Freedoms of Open AI

about 21 hours ago

How AI Agents Use the Jina URL to Markdown Tool in KaibanJS for Smarter Web Scraping

Integrating AI Multi-Agent Systems with the Make Webhook Tool in KaibanJS

OpenAI's Deep Research vs DeepSeek R1

Problem Solving with Language Models

🦸🏻#9: Does AI Remember? The Role of Memory in Agentic Workflows

🚀 Deploying OLMo-7B with Text Generation Inference (TGI) on Hugging Face Spaces

o3-mini vs Deepseek-R1

Activation Steering: A New Frontier in AI Control—But Does It Scale?

Open-R1: Update #1

The AHA Indicator

LLM Dataset Formats 101: A No‐BS Guide for Hugging Face Devs

Why we (don't) need export control

使用英特尔 Sapphire Rapids 加速 PyTorch Transformers 模型（第一部分）

由 2023年1月2日 • 2

更快的训练和推理：对比 Habana Gaudi®2 和英伟达 A100 80GB

由 2022年12月14日 • 1

基于 Habana Gaudi 的 Transformers 入门

由 2022年4月26日

Community Articles

A Review on the Evolvement of Load Balancing Strategy in MoE LLMs: Pitfalls and Lessons

From Zero to Reasoning Hero: How DeepSeek-R1 Leverages Reinforcement Learning to Master Complex Reasoning

about 1 hour ago

MLA: Redefining KV-Cache Through Low-Rank Projections and On-Demand Decompression

about 1 hour ago

syncIAL🍏

about 4 hours ago

Smol but Mighty: Can Small Models Reason well? 🤔

about 5 hours ago

AI Agents for Company Research: Automating Business Analysis with KaibanJS

about 9 hours ago

From Hippocrates to AI: Reflections on the Evolution of Consent

about 10 hours ago

🌁#86: Four Freedoms of Open AI

about 21 hours ago

How AI Agents Use the Jina URL to Markdown Tool in KaibanJS for Smarter Web Scraping

Integrating AI Multi-Agent Systems with the Make Webhook Tool in KaibanJS

OpenAI's Deep Research vs DeepSeek R1

Problem Solving with Language Models

🦸🏻#9: Does AI Remember? The Role of Memory in Agentic Workflows

🚀 Deploying OLMo-7B with Text Generation Inference (TGI) on Hugging Face Spaces

o3-mini vs Deepseek-R1

Activation Steering: A New Frontier in AI Control—But Does It Scale?

Open-R1: Update #1

The AHA Indicator

LLM Dataset Formats 101: A No‐BS Guide for Hugging Face Devs

Why we (don't) need export control