view article Article wHy DoNt YoU jUsT uSe ThE lLaMa ToKeNiZeR?? By catherinearnett • Sep 27, 2024 • 40
RuSentNE-2023: Evaluating Entity-Oriented Sentiment Analysis on Russian News Texts Paper • 2305.17679 • Published May 28, 2023 • 2
view article Article PEFT: Parameter-Efficient Fine-Tuning Methods for LLMs By samuellimabraz • 16 days ago • 12
view article Article Distributed SFT with trl and DeepSpeed Part 1: Starting Locally By jlzhou • 17 days ago • 1
view article Article How to Expand Your AI Music Generations of 30 Seconds to Several Minutes By theeseus-ai • Dec 13, 2024 • 4
view article Article Simplifying Alignment: From RLHF to Direct Preference Optimization (DPO) By ariG23498 • 21 days ago • 13
view article Article Fine-tune ModernBERT for RAG with Synthetic Data By sdiazlor and 2 others • 20 days ago • 34
view article Article Yay! Organizations can now publish blog Articles By huggingface and 3 others • 20 days ago • 32
view article Article Mastering Long Contexts in LLMs with KVPress By nvidia and 1 other • 17 days ago • 61
view article Article The SOTA Text-to-speech and Zero Shot Voice cloning model that no one knows about... By srinivasbilla • 20 days ago • 60
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published 18 days ago • 305
view article Article Controlling Language Model Generation with NVIDIA's LogitsProcessorZoo Dec 23, 2024 • 39
Towards Best Practices for Open Datasets for LLM Training Paper • 2501.08365 • Published 26 days ago • 53