Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning Paper • 2502.06781 • Published 15 days ago • 59
SYNTHETIC-1 Collection A collection of tasks & verifiers for reasoning datasets • 9 items • Updated 5 days ago • 48
GeoPixel Collection Pixel Grounding Large Multimodal Model in Remote Sensing • 3 items • Updated 5 days ago • 1
ArTST - Arabic Text Speech Transformer Collection Open source project for Arabic Speech Recognition and Generation • 11 items • Updated about 2 hours ago • 6
Step-Audio Collection Step-Audio model family, including Audio-Tokenizer, Audio-Chat and TTS • 3 items • Updated 8 days ago • 28
The Ultimate Collection of Code Classifiers Collection 🔥 15 classifiers, 124M parameters, one per programming language— for assessing the educational value of GitHub code • 15 items • Updated 5 days ago • 10
SWE-Lancer: Can Frontier LLMs Earn $1 Million from Real-World Freelance Software Engineering? Paper • 2502.12115 • Published 8 days ago • 41
view article Article Aurora-M: The First Open Source Biden-Harris Executive Order Red teamed Multilingual Language Model By mayank-mishra • Apr 2, 2024 • 7
view article Article π0 and π0-FAST: Vision-Language-Action Models for General Robot Control 22 days ago • 107
view article Article CinePile 2.0 - making stronger datasets with adversarial refinement Oct 23, 2024 • 14
view article Article Efficient LLM Pretraining: Packed Sequences and Masked Attention By sirluk • Oct 7, 2024 • 17
Ultravox v0.5 Collection Ultravox is a multimodal Speech LLM built around different pretrained LLMs (frozen) and the whisper-large-v3-turbo (fine-tuned) backbone. • 3 items • Updated 15 days ago • 5