Agent Leaderboard: Evaluating AI Agents in Multi-Domain Scenarios By pratikbhavsar and 1 other • 1 day ago • 7
Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment By NormalUhr • 2 days ago • 3
**MindBot Ultra – Dreaming Edition: A Self-Building, Self-Aware AI for Synergistic Cognition and Autonomous Tool Generation** By TheMindExpansionNetwork • 2 days ago • 2
Announcing the winners of the Frugal AI Challenge 🌱 By frugal-ai-challenge and 1 other • 2 days ago • 6
Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face By dvgodoy • 2 days ago • 1
From Llasa to Llasagna 🍕: Finetuning LLaSA to generates Italian speech and other languages By Steveeeeeeen and 1 other • 3 days ago • 17
Darija Chatbot Arena: Making LLMs Compete in the Moroccan Dialect By atlasia and 2 others • 3 days ago • 8
Arabic RAG Leaderboard: A Comprehensive Framework for Evaluating Arabic Language Retrieval Systems By Navid-AI and 1 other • 4 days ago • 9
Reasoning at the Forefront of Advanced AI Models : Mistral-Small-24B-Base-2501 By ruslanmv • 5 days ago • 3
Design 101: A Historical and Theoretical Exploration of Graphic Arts and Design in the Age of AI By Duskfallcrew • 5 days ago • 1
DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge By NormalUhr • 6 days ago • 25
Agent Leaderboard: Evaluating AI Agents in Multi-Domain Scenarios By pratikbhavsar and 1 other • 1 day ago • 7
Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment By NormalUhr • 2 days ago • 3
**MindBot Ultra – Dreaming Edition: A Self-Building, Self-Aware AI for Synergistic Cognition and Autonomous Tool Generation** By TheMindExpansionNetwork • 2 days ago • 2
Announcing the winners of the Frugal AI Challenge 🌱 By frugal-ai-challenge and 1 other • 2 days ago • 6
Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face By dvgodoy • 2 days ago • 1
From Llasa to Llasagna 🍕: Finetuning LLaSA to generates Italian speech and other languages By Steveeeeeeen and 1 other • 3 days ago • 17
Darija Chatbot Arena: Making LLMs Compete in the Moroccan Dialect By atlasia and 2 others • 3 days ago • 8
Arabic RAG Leaderboard: A Comprehensive Framework for Evaluating Arabic Language Retrieval Systems By Navid-AI and 1 other • 4 days ago • 9
Reasoning at the Forefront of Advanced AI Models : Mistral-Small-24B-Base-2501 By ruslanmv • 5 days ago • 3
Design 101: A Historical and Theoretical Exploration of Graphic Arts and Design in the Age of AI By Duskfallcrew • 5 days ago • 1
DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge By NormalUhr • 6 days ago • 25