PatchFusion Release Collection PatchFusion is a framework boosting metric depth estimation models to high resolution • 7 items • Updated Mar 17 • 1
InverseCoder Collection Models and datasets of paper "InverseCoder: Unleashing the Power of Instruction-Tuned Code LLMs with Inverse-Instruct". • 7 items • Updated Jul 11 • 1
Mamba2-In-Llama3 Collection Mamba2 distilled from Llama3 8B instruct. The Mamba in the Llama: Distilling and Accelerating Hybrid Models (https://arxiv.org/abs/2408.15237). • 4 items • Updated 10 days ago • 2
Windows Agent Arena: Evaluating Multi-Modal OS Agents at Scale Paper • 2409.08264 • Published 7 days ago • 39
🧬 RegMix: Data Mixture as Regression Collection Automatic data mixture method for large language model pre-training • 10 items • Updated Jul 26 • 5
Interpretable Preferences via Multi-Objective Reward Modeling and Mixture-of-Experts Paper • 2406.12845 • Published Jun 18 • 1
PM-pair Collection This is a collection of materials for training pairwise preference model. • 3 items • Updated May 10 • 2
SciRIFF Collection Data and models to enhance instruction-following for scientific literature understanding. • 9 items • Updated 22 days ago • 6
MMEvol: Empowering Multimodal Large Language Models with Evol-Instruct Paper • 2409.05840 • Published 10 days ago • 43
REBEL: Reinforcement Learning via Regressing Relative Reward Collection 10 items • Updated 18 days ago • 1
Standard-format-preference-dataset Collection We collect the open-source datasets and process them into the standard format. • 14 items • Updated May 8 • 19
SSMs Collection A collection of Mamba-2-based research models with 8B parameters trained on 3.5T tokens for comparison with Transformers. • 5 items • Updated Jul 17 • 23
Functionary V3.1 Collection Fine-tuning Llama-3.1 using original Meta's prompt template for function calling • 5 items • Updated Aug 19 • 2
UltraMedical Collection Datasets and Models of UltraMedical • 7 items • Updated 18 days ago • 1
Coffee: Boost Your Code LLMs by Fixing Bugs with Feedback Paper • 2311.07215 • Published Nov 13, 2023 • 3
RetRobust Collection Making Retrieval-Augmented Language Models Robust to Irrelevant Context • 12 items • Updated Oct 1, 2023 • 1
Jamba-1.5 Collection The AI21 Jamba family of models are state-of-the-art, hybrid SSM-Transformer instruction following foundation models • 2 items • Updated 28 days ago • 71
Skywork-Reward-Data-Collection Collection Open-source preference datasets used to train the Skywork reward model series • 16 items • Updated 9 days ago • 4
Skywork-Reward-Model Collection Skywork reward model series • 2 items • Updated 14 days ago • 1
OLMoE Collection Artifacts for open mixture-of-experts language models. • 13 items • Updated 5 days ago • 18
LLaVA-OneVision Collection a model good at arbitrary types of visual input • 15 items • Updated 8 days ago • 18
Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders Paper • 2408.15998 • Published 22 days ago • 81
GeoDiffusion Collection A collection of GeoDiffusion checkpoints (https://kaichen1998.github.io/projects/geodiffusion/) • 8 items • Updated 17 days ago • 1
Meteor Collection Rationale-aided Efficient 7B size Large Language and Vision Models. Let's enjoy it! • 3 items • Updated Jun 13 • 1
TiC-CLIP Collection Benchmark for the design of efficient continual learning of image-text models over years. • 18 items • Updated Jun 19 • 5
AiM Collection Scalable Autoregressive Image Generation with Mamba • 4 items • Updated 23 days ago • 4
LLaVa-Interleave Collection LLaVa models that extends the model capabilities to Multi-image, Multi-frame (videos), Multi-patch (single-image) scenarios. • 3 items • Updated Jul 10 • 14
LLaVA-Onevision Collection LLaVa_Onevision models for single-image, multi-image, and video scenarios • 9 items • Updated 1 day ago • 8
MambaVision Collection MambaVision: A Hybrid Mamba-Transformer Vision Backbone. Includes tiny, tiny2, small, base, large and large2 variants. • 8 items • Updated Jul 24 • 12
STEER Style Transfer Collection Models and Datasets for STEER Style Transfer Paper, Hallinan. et. al 2023 • 2 items • Updated Jun 5 • 1
ESFT Collection models for paper expert-specialized fine-tuning • 15 items • Updated Aug 16 • 2
Gemma HelpSteer Collection A work in progress collection of resources related to a project to finetune Gemma 2 2b for helpfulness with Helpsteer2. • 5 items • Updated Aug 2 • 1
Finetune Open source LLMs Collection This collection contains an end to end workflow for fine-tuning open source LLMs. • 2 items • Updated Aug 12 • 1
Preference Datasets for KTO Collection This collection contains a list of curated preference datasets for KTO fine-tuning for intent alignment of LLMs through signals. • 5 items • Updated Jul 30 • 14
Roleplay Deployment Models Collection Models I think are cool and worth using. A lot of what I make is intended only as a part in a further merge or as a test, these are the others. • 13 items • Updated 16 days ago • 2