K - a diwank Collection

Try HuggingChat to chat with AI

jondurbin/py-dpo-v0.1

Viewer • Updated Jan 11, 2024 • 9.47k • 151 • 49

jondurbin/gutenberg-dpo-v0.1

Viewer • Updated Jan 12, 2024 • 918 • 1.05k • 134

jondurbin/cinematika-v0.1

Viewer • Updated Apr 11, 2024 • 47.1k • 277 • 54

ParisNeo/lollms_aware_dataset

Viewer • Updated Oct 27, 2023 • 464 • 84 • 5

grimulkan/LimaRP-augmented

Viewer • Updated Jan 24, 2024 • 804 • 35 • 29

TIGER-Lab/MathInstruct

Viewer • Updated May 15, 2024 • 262k • 3.37k • 270

christopher/rosetta-code

Viewer • Updated Sep 24, 2023 • 79k • 225 • 34

b-mc2/sql-create-context

Viewer • Updated Jan 25, 2024 • 78.6k • 2.05k • 441

migtissera/Synthia-v1.3

Viewer • Updated Nov 16, 2023 • 119k • 84 • 99

tinyBenchmarks/tinyMMLU

Viewer • Updated Jul 8, 2024 • 385 • 7.31k • 17

tinyBenchmarks/tinyWinogrande

Preview • Updated May 25, 2024 • 3k • 4

tinyBenchmarks/tinyAI2_arc

Preview • Updated May 25, 2024 • 2.51k • 3

tinyBenchmarks/tinyHellaswag

Viewer • Updated May 25, 2024 • 50k • 3.29k • 4

tinyBenchmarks/tinyTruthfulQA

Preview • Updated May 25, 2024 • 2.54k • 3

tinyBenchmarks/tinyAlpacaEval

Viewer • Updated Apr 19, 2024 • 100 • 54 • 5

tinyBenchmarks/tinyGSM8k

Preview • Updated May 25, 2024 • 2.57k • 5

cognitivecomputations/samantha-data

Updated Mar 29, 2024 • 1k • 127

roborovski/synthetic-tool-calls

Viewer • Updated Mar 5, 2024 • 6.01k • 43 • 1

roborovski/glaive-tool-usage-dpo

Viewer • Updated Feb 29, 2024 • 42k • 36 • 2

kalomaze/StackMix-v0.1

Viewer • Updated Feb 28, 2024 • 30 • 77 • 2

roborovski/glaive-function-calling-v2-conversation

Viewer • Updated Feb 19, 2024 • 113k • 36 • 2

mlabonne/truthy-dpo-v0.1

Viewer • Updated Feb 18, 2024 • 1.02k • 34 • 1

ai4bharat/indic-align

Viewer • Updated Jul 25, 2024 • 97.4M • 701 • 12

coseal/CodeUltraFeedback_binarized

Viewer • Updated Mar 18, 2024 • 9.5k • 198 • 17

coseal/CodeUltraFeedback

Viewer • Updated Mar 15, 2024 • 10k • 96 • 26

KTO: Model Alignment as Prospect Theoretic Optimization

Paper • 2402.01306 • Published Feb 2, 2024 • 16

ai4bharat/sangraha

Viewer • Updated Oct 21, 2024 • 268M • 8.89k • 39

Rephrase and Respond: Let Large Language Models Ask Better Questions for Themselves

Paper • 2311.04205 • Published Nov 7, 2023 • 5

Multilingual Instruction Tuning With Just a Pinch of Multilinguality

Paper • 2401.01854 • Published Jan 3, 2024 • 11

Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models

Paper • 2401.01335 • Published Jan 2, 2024 • 64

GAIA: a benchmark for General AI Assistants

Paper • 2311.12983 • Published Nov 21, 2023 • 192

Self-Instruct: Aligning Language Model with Self Generated Instructions

Paper • 2212.10560 • Published Dec 20, 2022 • 9

HuggingFaceH4/self-instruct-seed

Viewer • Updated Jan 31, 2023 • 175 • 48 • 27

ToolTalk: Evaluating Tool-Usage in a Conversational Setting

Paper • 2311.10775 • Published Nov 15, 2023 • 10

Dynamic Planning with a LLM

Paper • 2308.06391 • Published Aug 11, 2023 • 2

FreedomIntelligence/SocraticChat

Viewer • Updated Oct 12, 2023 • 50.7k • 53 • 8

Large Language Model as a User Simulator

Paper • 2308.11534 • Published Aug 21, 2023 • 2

Natural Language Embedded Programs for Hybrid Language Symbolic Reasoning

Paper • 2309.10814 • Published Sep 19, 2023 • 3

AlpaGasus: Training A Better Alpaca with Fewer Data

Paper • 2307.08701 • Published Jul 17, 2023 • 23

mlabonne/alpagasus

Viewer • Updated Aug 3, 2023 • 9.23k • 77 • 8

AgentTuning: Enabling Generalized Agent Abilities for LLMs

Paper • 2310.12823 • Published Oct 19, 2023 • 35

THUDM/AgentInstruct

Viewer • Updated Oct 23, 2023 • 1.87k • 142 • 204

Diversity of Thought Improves Reasoning Abilities of Large Language Models

Paper • 2310.07088 • Published Oct 11, 2023 • 5

SmartPlay : A Benchmark for LLMs as Intelligent Agents

Paper • 2310.01557 • Published Oct 2, 2023 • 13

Large Language Models Cannot Self-Correct Reasoning Yet

Paper • 2310.01798 • Published Oct 3, 2023 • 35

MINT: Evaluating LLMs in Multi-turn Interaction with Tools and Language Feedback

Paper • 2309.10691 • Published Sep 19, 2023 • 4

LLM+P: Empowering Large Language Models with Optimal Planning Proficiency

Paper • 2304.11477 • Published Apr 22, 2023 • 3

Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking

Paper • 2403.09629 • Published Mar 14, 2024 • 76

SelfCheck: Using LLMs to Zero-Shot Check Their Own Step-by-Step Reasoning

Paper • 2308.00436 • Published Aug 1, 2023 • 22

664

UGI Leaderboard

📢

Display and filter leaderboard data

MuSR: Testing the Limits of Chain-of-thought with Multistep Soft Reasoning

Paper • 2310.16049 • Published Oct 24, 2023 • 4

Instruction-Following Evaluation for Large Language Models

Paper • 2311.07911 • Published Nov 14, 2023 • 20

allenai/UNcommonsense

Viewer • Updated Jan 19, 2024 • 18.3k • 59 • 10

UNcommonsense Reasoning: Abductive Reasoning about Uncommon Situations

Paper • 2311.08469 • Published Nov 14, 2023 • 11

Flows: Building Blocks of Reasoning and Collaborating AI

Paper • 2308.01285 • Published Aug 2, 2023 • 2

aiflows/CCFlows

Updated Dec 10, 2023 • 2

Learning to Reason and Memorize with Self-Notes

Paper • 2305.00833 • Published May 1, 2023 • 5

Verify-and-Edit: A Knowledge-Enhanced Chain-of-Thought Framework

Paper • 2305.03268 • Published May 5, 2023 • 2

Making Large Language Models Better Reasoners with Alignment

Paper • 2309.02144 • Published Sep 5, 2023 • 2

Reason for Future, Act for Now: A Principled Framework for Autonomous LLM Agents with Provable Sample Efficiency

Paper • 2309.17382 • Published Sep 29, 2023 • 4

ALERT: Adapting Language Models to Reasoning Tasks

Paper • 2212.08286 • Published Dec 16, 2022 • 2

CodeIt: Self-Improving Language Models with Prioritized Hindsight Replay

Paper • 2402.04858 • Published Feb 7, 2024 • 15

Vivacem/MMIQC

Viewer • Updated Jan 20, 2024 • 2.29M • 80 • 15

LLMs in the Imaginarium: Tool Learning through Simulated Trial and Error

Paper • 2403.04746 • Published Mar 7, 2024 • 24

Learning to Decode Collaboratively with Multiple Language Models

Paper • 2403.03870 • Published Mar 6, 2024 • 21

Large Language Models as Zero-shot Dialogue State Tracker through Function Calling

Paper • 2402.10466 • Published Feb 16, 2024 • 19

SynthDST: Synthetic Data is All You Need for Few-Shot Dialog State Tracking

Paper • 2402.02285 • Published Feb 3, 2024 • 1

When Scaling Meets LLM Finetuning: The Effect of Data, Model and Finetuning Method

Paper • 2402.17193 • Published Feb 27, 2024 • 24

Towards Optimal Learning of Language Models

Paper • 2402.17759 • Published Feb 27, 2024 • 17

Evaluating Very Long-Term Conversational Memory of LLM Agents

Paper • 2402.17753 • Published Feb 27, 2024 • 19

Aman279/Locomo

Viewer • Updated Mar 7, 2024 • 35 • 9 • 1

Generative Representational Instruction Tuning

Paper • 2402.09906 • Published Feb 15, 2024 • 54

Synthetic Data (Almost) from Scratch: Generalized Instruction Tuning for Language Models

Paper • 2402.13064 • Published Feb 20, 2024 • 48

OpenCodeInterpreter: Integrating Code Generation with Execution and Refinement

Paper • 2402.14658 • Published Feb 22, 2024 • 82

Beyond A*: Better Planning with Transformers via Search Dynamics Bootstrapping

Paper • 2402.14083 • Published Feb 21, 2024 • 47

PerLTQA: A Personal Long-Term Memory Dataset for Memory Classification, Retrieval, and Synthesis in Question Answering

Paper • 2402.16288 • Published Feb 26, 2024 • 1

pandalla/Machine_Mindset_MBTI_dataset

Viewer • Updated Jun 4, 2024 • 161k • 234 • 58

berkeley-nest/Nectar

Viewer • Updated Mar 20, 2024 • 183k • 684 • 287

totally-not-an-llm/sharegpt-hyperfiltered-3k

Viewer • Updated Jul 13, 2023 • 3.24k • 72 • 14

HuggingFaceTB/cosmopedia

Viewer • Updated Aug 12, 2024 • 31.1M • 8.96k • 580

argilla/ultrafeedback-binarized-preferences-cleaned

Viewer • Updated Dec 11, 2023 • 60.9k • 3.75k • 134

dmayhem93/self-critiquing-refine

Viewer • Updated Apr 8, 2023 • 39.2k • 35 • 1

dmayhem93/self-critiquing-critique-and-refine

Viewer • Updated Apr 8, 2023 • 39.2k • 37 • 1

morzecrew/RefinedPersonaChat

Viewer • Updated Aug 7, 2023 • 207k • 29 • 2

beratcmn/rephrased-instruction-turkish-poems

Viewer • Updated Dec 16, 2023 • 4.96k • 63 • 4

Birchlabs/openai-prm800k-stepwise-critic

Viewer • Updated Jun 3, 2023 • 1.09M • 65 • 44

theblackcat102/evol-codealpaca-v1

Viewer • Updated Mar 10, 2024 • 111k • 683 • 158

meta-math/GSM8K_Backward

Viewer • Updated Nov 10, 2023 • 1.27k • 75 • 16

meta-math/MetaMathQA-40K

Viewer • Updated Nov 10, 2023 • 40k • 120 • 24

glaiveai/glaive-code-assistant-v2

Viewer • Updated Apr 4, 2024 • 215k • 135 • 44

Towards General Computer Control: A Multimodal Agent for Red Dead Redemption II as a Case Study

Paper • 2403.03186 • Published Mar 5, 2024 • 5

PROC2PDDL: Open-Domain Planning Representations from Texts

Paper • 2403.00092 • Published Feb 29, 2024 • 1

btan2/cappy-large

Text Classification • Updated Dec 7, 2023 • 20 • 20

VMware/open-instruct

Viewer • Updated Jul 12, 2023 • 143k • 100 • 44

QizhiPei/BioT5_finetune_dataset

Viewer • Updated Sep 2, 2024 • 33 • 408 • 6

Tensoic/gooftagoo

Viewer • Updated Mar 16, 2024 • 16.2k • 62 • 9

GenVRadmin/Aryabhatta-Orca-Maths-Hindi

Viewer • Updated Mar 18, 2024 • 200k • 32 • 3

Corex: Pushing the Boundaries of Complex Reasoning through Multi-Model Collaboration

Paper • 2310.00280 • Published Sep 30, 2023 • 3

JARVIS-1: Open-World Multi-task Agents with Memory-Augmented Multimodal Language Models

Paper • 2311.05997 • Published Nov 10, 2023 • 37

wangwilliamyang/wikihow

Updated Jan 18, 2024 • 8

argilla/distilabel-capybara-kto-15k-binarized

Viewer • Updated Mar 19, 2024 • 15.1k • 63 • 5

argilla/ultrafeedback-binarized-preferences-cleaned-kto

Viewer • Updated Mar 19, 2024 • 231k • 59 • 9

argilla/distilabel-intel-orca-kto

Viewer • Updated Mar 19, 2024 • 23.1k • 65 • 7

argilla/kto-mix-15k

Viewer • Updated Apr 19, 2024 • 15.3k • 66 • 13

KnutJaegersberg/dolphin_orca_clustered

Updated Sep 14, 2023 • 39 • 1

GAIR/autoj-scenario-classifier

Text Generation • Updated Oct 9, 2023 • 17 • 5

Orca 2: Teaching Small Language Models How to Reason

Paper • 2311.11045 • Published Nov 18, 2023 • 73

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Paper • 2403.03507 • Published Mar 6, 2024 • 185

Ask Optimal Questions: Aligning Large Language Models with Retriever's Preference in Conversational Search

Paper • 2402.11827 • Published Feb 19, 2024 • 1

Grounding Language Model with Chunking-Free In-Context Retrieval

Paper • 2402.09760 • Published Feb 15, 2024

Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models

Paper • 2403.12881 • Published Mar 19, 2024 • 17

BAAI/OPI

Preview • Updated Nov 6, 2024 • 196 • 8

internlm/Agent-FLAN

Preview • Updated Mar 20, 2024 • 84 • 70

kaist-ai/selfee-train

Viewer • Updated May 31, 2023 • 178k • 54 • 9

fabiochiu/medium-articles

Preview • Updated Jul 17, 2022 • 277 • 23

Reverse Training to Nurse the Reversal Curse

Paper • 2403.13799 • Published Mar 20, 2024 • 13

voidful/MuSiQue

Preview • Updated May 20, 2023 • 4 • 4

BAAI/bge-reranker-v2-m3

Text Classification • Updated Jun 24, 2024 • 1.01M • 512

allenai/reward-bench

Viewer • Updated Sep 9, 2024 • 8.11k • 8.22k • 87

Sorted LLaMA: Unlocking the Potential of Intermediate Layers of Large Language Models for Dynamic Inference Using Sorted Fine-Tuning (SoFT)

Paper • 2309.08968 • Published Sep 16, 2023 • 23

In-Context Learning Creates Task Vectors

Paper • 2310.15916 • Published Oct 24, 2023 • 43

Are Emergent Abilities in Large Language Models just In-Context Learning?

Paper • 2309.01809 • Published Sep 4, 2023 • 3

ZenMoore/RoleBench

Preview • Updated Nov 23, 2023 • 332 • 76

LLM Agent Operating System

Paper • 2403.16971 • Published Mar 25, 2024 • 65

ORPO: Monolithic Preference Optimization without Reference Model

Paper • 2403.07691 • Published Mar 12, 2024 • 64

princeton-nlp/QuRatedPajama-260B

Viewer • Updated Apr 16, 2024 • 254M • 232 • 7

Arcee's MergeKit: A Toolkit for Merging Large Language Models

Paper • 2403.13257 • Published Mar 20, 2024 • 20

Can large language models explore in-context?

Paper • 2403.15371 • Published Mar 22, 2024 • 32

Locutusque/OpenCerebrum-dpo

Viewer • Updated Mar 26, 2024 • 21.1k • 61 • 6

Doctor-Shotgun/theory-of-mind-dpo

Viewer • Updated Mar 14, 2024 • 539 • 49 • 16

Locutusque/arc-cot-dpo

Viewer • Updated Mar 26, 2024 • 957 • 42 • 6

fblgit/simple-math-DPO

Viewer • Updated Aug 1, 2024 • 800k • 117 • 16

KrisPi/PythonTutor-Evol-1k-DPO-GPT4_vs_35

Viewer • Updated Nov 18, 2023 • 943 • 33 • 14

zerolink/zsql-postgres-dpo

Viewer • Updated Feb 2, 2024 • 259k • 72 • 7

Lakera/gandalf_ignore_instructions

Viewer • Updated Oct 2, 2023 • 1k • 182 • 27

mrm8488/unnatural-instructions-full

Viewer • Updated Dec 21, 2022 • 66k • 70 • 16

NilanE/SmallParallelDocs-Ja_En-6k

Viewer • Updated Mar 5, 2024 • 6.32k • 58 • 2

Long-form factuality in large language models

Paper • 2403.18802 • Published Mar 27, 2024 • 25

NousResearch/OLMo-Bitnet-1B

Text Generation • Updated Apr 11, 2024 • 197 • 118

pyp1/VoiceCraft

Text-to-Speech • Updated Aug 21, 2024 • 42 • 211

CarperAI/openai_summarize_comparisons

Viewer • Updated Feb 27, 2023 • 260k • 1.58k • 40

PygmalionAI/PIPPA

Updated Sep 7, 2023 • 126 • 209

ivanleomk/gpt4-chain-of-density

Preview • Updated Nov 12, 2023 • 75 • 6

AIRI-NLP/cnli_memory_extracted

Viewer • Updated Mar 22, 2024 • 8.23k • 35 • 1

Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs

Paper • 2311.05657 • Published Nov 9, 2023 • 32

openbmb/UltraInteract_sft

Viewer • Updated Apr 5, 2024 • 289k • 26.7k • 121

openbmb/UltraInteract_pair

Viewer • Updated Apr 5, 2024 • 220k • 310 • 108

openbmb/Eurus-70b-nca

Text Generation • Updated Apr 12, 2024 • 34 • 11

Noise Contrastive Alignment of Language Models with Explicit Rewards

Paper • 2402.05369 • Published Feb 8, 2024 • 1

ai2lumos/lumos_multimodal_ground_iterative

Viewer • Updated Mar 19, 2024 • 15.9k • 48 • 1

ai2lumos/lumos_multimodal_plan_iterative

Viewer • Updated Mar 19, 2024 • 15.9k • 50 • 2

ai2lumos/lumos_complex_qa_plan_onetime

Viewer • Updated Mar 19, 2024 • 19.4k • 60 • 3

ai2lumos/lumos_complex_qa_ground_onetime

Viewer • Updated Mar 19, 2024 • 19.2k • 62 • 3

ai2lumos/lumos_complex_qa_ground_iterative

Viewer • Updated Mar 19, 2024 • 19.1k • 42 • 2

ai2lumos/lumos_unified_plan_iterative

Viewer • Updated Mar 19, 2024 • 55.4k • 70 • 2

ai2lumos/lumos_complex_qa_plan_iterative

Viewer • Updated Mar 18, 2024 • 19k • 61 • 6

ai2lumos/lumos_unified_ground_iterative

Viewer • Updated Mar 19, 2024 • 55.5k • 55 • 2

ai2lumos/lumos_web_agent_ground_iterative

Viewer • Updated Mar 18, 2024 • 1.01k • 66 • 2

ai2lumos/lumos_web_agent_plan_iterative

Viewer • Updated Mar 18, 2024 • 1.01k • 44 • 4

ai2lumos/lumos_maths_ground_iterative

Viewer • Updated Mar 18, 2024 • 19.5k • 56 • 3

ai2lumos/lumos_maths_ground_onetime

Viewer • Updated Mar 18, 2024 • 19.8k • 39 • 1

ai2lumos/lumos_maths_plan_onetime

Viewer • Updated Mar 18, 2024 • 19.8k • 56 • 2

Symbol-LLM/Symbol-LLM-7B-Instruct

Text Generation • Updated Jun 23, 2024 • 58 • 13

MoritzLaurer/deberta-v3-large-zeroshot-v2.0

Zero-Shot Classification • Updated Apr 11, 2024 • 66.8k • • 89

MoritzLaurer/bge-m3-zeroshot-v2.0

Zero-Shot Classification • Updated Apr 22, 2024 • 69.5k • 46

What Makes Good Data for Alignment? A Comprehensive Study of Automatic Data Selection in Instruction Tuning

Paper • 2312.15685 • Published Dec 25, 2023 • 16

Pavithree/eli5

Viewer • Updated Apr 23, 2022 • 229k • 164 • 2

vicgalle/configurable-system-prompt-multitask

Viewer • Updated Apr 23, 2024 • 1.95k • 240 • 22

paraloq/json_data_extraction

Viewer • Updated Mar 25, 2024 • 484 • 80 • 20

livecodebench/execution

Viewer • Updated Mar 12, 2024 • 479 • 79 • 4

iamtarun/python_code_instructions_18k_alpaca

Viewer • Updated Jul 27, 2023 • 18.6k • 1.55k • 283

LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement

Paper • 2403.15042 • Published Mar 22, 2024 • 26

manishiitg/CogStack-QA

Viewer • Updated Feb 9, 2024 • 24.7k • 30 • 1

manishiitg/CogStack-Tasks

Viewer • Updated Feb 9, 2024 • 4.69k • 37 • 1

manishiitg/CogStack-Conv

Viewer • Updated Feb 9, 2024 • 2.35k • 36 • 1

Reformatted Alignment

Paper • 2402.12219 • Published Feb 19, 2024 • 18

abacusai/SystemChat-1.1

Viewer • Updated Apr 11, 2024 • 20.2k • 60 • 31

Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention

Paper • 2404.07143 • Published Apr 10, 2024 • 106

Anthropic/persuasion

Viewer • Updated Apr 9, 2024 • 3.94k • 419 • 183

Rho-1: Not All Tokens Are What You Need

Paper • 2404.07965 • Published Apr 11, 2024 • 90

M4-ai/prm_dpo_pairs

Viewer • Updated Jul 1, 2024 • 93.9k • 61 • 7

OpenLLM-France/Claire-Dialogue-French-0.1

Viewer • Updated Dec 5, 2023 • 37k • 246 • 44

amaydle/npc-dialogue

Viewer • Updated Mar 25, 2023 • 1.92k • 175 • 16

facebook/empathetic_dialogues

Updated Jan 18, 2024 • 1.36k • 97

Salesforce/dialogstudio

Updated 26 days ago • 954 • 219

Direct Nash Optimization: Teaching Language Models to Self-Improve with General Preferences

Paper • 2404.03715 • Published Apr 4, 2024 • 61

microsoft/Taskbench

Viewer • Updated Aug 21, 2024 • 17.3k • 465 • 24

Learn Your Reference Model for Real Good Alignment

Paper • 2404.09656 • Published Apr 15, 2024 • 84

CantTalkAboutThis: Aligning Language Models to Stay on Topic in Dialogues

Paper • 2404.03820 • Published Apr 4, 2024 • 25

mlabonne/orpo-dpo-mix-40k

Viewer • Updated Oct 17, 2024 • 44.2k • 773 • 275

allenai/persona-bias

Updated Feb 5, 2024 • 25 • 11

PleIAs/YouTube-Commons

Updated Jun 26, 2024 • 734 • 338

FreedomIntelligence/evol-instruct-hindi

Viewer • Updated Aug 6, 2023 • 59k • 20 • 2

FreedomIntelligence/OVM-process

Viewer • Updated Apr 1, 2024 • 7.47k • 38 • 1

nuprl/CanItEdit

Viewer • Updated Mar 19, 2024 • 105 • 273 • 12

totally-not-an-llm/EverythingLM-data-V3

Viewer • Updated Sep 11, 2023 • 1.07k • 50 • 31

RUCAIBox/Story-Generation

Updated Mar 3, 2023 • 76 • 12

fabraz/writingPromptAug

Viewer • Updated Oct 14, 2023 • 24.1k • 124 • 2

jerryjalapeno/nart-100k-synthetic

Viewer • Updated Jul 16, 2023 • 99.1k • 160 • 41

jat-project/jat-dataset

Viewer • Updated Feb 16, 2024 • 258M • 348k • 37

euclaise/ReMask-3B

Text Generation • Updated Aug 10, 2024 • 113 • 15

google/Synthetic-Persona-Chat

Viewer • Updated Mar 1, 2024 • 10.9k • 2.3k • 97

google/cvss

Updated Feb 10, 2024 • 116 • 13

neural-bridge/rag-dataset-12000

Viewer • Updated Feb 5, 2024 • 12k • 1.46k • 129

HannahRoseKirk/prism-alignment

Viewer • Updated Apr 25, 2024 • 77.9k • 1.08k • 81

Gigax/NPC-LLM-3_8B

Text Generation • Updated May 14, 2024 • 76 • 24

nuprl/MultiPL-T

Viewer • Updated Aug 20, 2024 • 215k • 125 • 7

cognitivecomputations/SystemChat-1.2

Viewer • Updated Apr 30, 2024 • 52 • 34 • 6

mlabonne/arena-preferences

Viewer • Updated Apr 27, 2024 • 2.69k • 50 • 9

INTERS: Unlocking the Power of Large Language Models in Search with Instruction Tuning

Paper • 2401.06532 • Published Jan 12, 2024 • 12

Flexibly Scaling Large Language Models Contexts Through Extensible Tokenization

Paper • 2401.07793 • Published Jan 15, 2024 • 3

yutaozhu94/INTERS

Preview • Updated Feb 19, 2024 • 836 • 12

THUDM/CogAgent

Updated Dec 18, 2023 • 17

urchade/gliner_large-v2.1

Token Classification • Updated Apr 10, 2024 • 6.78k • 31

shachardon/ShareLM

Viewer • Updated Aug 6, 2024 • 331k • 158 • 29

nvidia/ChatQA-Training-Data

Viewer • Updated Jun 4, 2024 • 442k • 880 • 167

lightblue/tagengo-gpt4

Viewer • Updated Jun 2, 2024 • 78.1k • 120 • 63

Efficient-Large-Model/Llama-3-VILA1.5-8B

Text Generation • Updated Aug 16, 2024 • 2.29k • 31

bigcode/commitpackft

Viewer • Updated Aug 20, 2023 • 702k • 5.36k • 65

glaiveai/glaive-code-assistant-v3

Viewer • Updated May 20, 2024 • 950k • 140 • 46

davanstrien/cosmochat

Viewer • Updated May 10, 2024 • 199 • 47 • 12

davanstrien/cosmopedia_chat

Viewer • Updated Mar 8, 2024 • 1.19k • 52 • 7

MemGPT/MSC-Self-Instruct

Viewer • Updated Nov 2, 2023 • 500 • 147 • 11

MemGPT/qa_data

Viewer • Updated Feb 6, 2024 • 18.6k • 25 • 1

google/imageinwords

Updated May 25, 2024 • 291 • 117

grammarly/coedit

Viewer • Updated Oct 21, 2023 • 70.8k • 915 • 67

bea2019st/wi_locness

Updated Jan 18, 2024 • 133 • 14

GEM/FairytaleQA

Viewer • Updated Oct 25, 2022 • 10.6k • 218 • 8

grammarly/medit

Viewer • Updated Oct 1, 2024 • 113k • 152 • 13

MemGPT/MemGPT-DPO-Dataset

Viewer • Updated Apr 18, 2024 • 42.3k • 67 • 9

lmarena-ai/arena-human-preference-55k

Viewer • Updated May 17, 2024 • 57.5k • 664 • 141

princeton-nlp/QuRating-GPT3.5-Judgments

Viewer • Updated Mar 29, 2024 • 250k • 54 • 6

princeton-nlp/AutoCompressor-Llama-2-7b-6k

Updated Nov 22, 2023 • 63 • 2

H-D-T/Select-Stack

Viewer • Updated Sep 2, 2024 • 1.46M • 54 • 16

EleutherAI/lichess-puzzles

Viewer • Updated May 9, 2024 • 1.48M • 100 • 20

selfrag/selfrag_train_data

Viewer • Updated Oct 31, 2023 • 146k • 79 • 70

community-datasets/yahoo_answers_topics

Viewer • Updated Jun 24, 2024 • 1.46M • 1.97k • 54

TIGER-Lab/MMLU-Pro

Viewer • Updated Nov 27, 2024 • 12.1k • 35.3k • 321

ylacombe/expresso

Viewer • Updated Apr 30, 2024 • 11.6k • 283 • 37

microsoft/MeetingBank-QA-Summary

Viewer • Updated May 16, 2024 • 862 • 90 • 14

microsoft/MeetingBank-LLMCompressed

Viewer • Updated May 16, 2024 • 5.17k • 50 • 15

nvidia/ChatRAG-Bench

Viewer • Updated May 24, 2024 • 34.6k • 1.69k • 108

xingyaoww/code-act

Viewer • Updated Feb 5, 2024 • 78.4k • 115 • 53

kaist-ai/Multifaceted-Collection-ORPO

Viewer • Updated Jul 1, 2024 • 64.6k • 66 • 10

Alibaba-NLP/gte-Qwen2-7B-instruct

hwjiang/Real3D

Image-to-3D • Updated Jun 14, 2024 • 11 • 18

nvidia/Aegis-AI-Content-Safety-Dataset-1.0

Viewer • Updated Jun 28, 2024 • 12k • 986 • 49

ProGamerGov/synthetic-dataset-1m-dalle3-high-quality-captions

Updated Oct 30, 2024 • 1.24k • 124

OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text

Paper • 2406.08418 • Published Jun 12, 2024 • 29

facebook/multi-token-prediction

Updated Jun 18, 2024 • 363

TIGER-Lab/M-BEIR

Viewer • Updated Aug 7, 2024 • 2.86M • 1.33k • 16

tomg-group-umd/pixelprose

Viewer • Updated Jun 23, 2024 • 15.6M • 390 • 143

mit-han-lab/ShareGPT4V

Preview • Updated Feb 22, 2024 • 115 • 3

mit-han-lab/litepose

Updated Jun 5, 2024 • 1

mit-han-lab/Llama-3-8B-Instruct-QServe-g128

Text Generation • Updated May 6, 2024 • 94 • 1

internlm/internlm-xcomposer2-vl-7b

Visual Question Answering • Updated Apr 12, 2024 • 1.25k • 80

OpenGVLab/InternViT-6B-448px-V1-5

Image Feature Extraction • Updated Dec 9, 2024 • 878 • 79

OpenGVLab/InternVL-Chat-V1-5

Image-Text-to-Text • Updated 15 days ago • 5.06k • 408

OpenGVLab/Mini-InternVL-Chat-4B-V1-5

Image-Text-to-Text • Updated 15 days ago • 459 • 61

openbmb/MiniCPM-Llama3-V-2_5

Image-Text-to-Text • Updated Jan 15 • 30k • 1.39k

microsoft/Florence-2-large

Image-Text-to-Text • Updated Dec 8, 2024 • 779k • 1.41k

llava-hf/LLaVA-NeXT-Video-7B-DPO-hf

Video-Text-to-Text • Updated 23 days ago • 1.88k • 9

arcee-ai/BAAI-Infinity-Instruct-System

Viewer • Updated Jun 24, 2024 • 2.36M • 134 • 15

hpcai-tech/OpenSora-VAE-v1.2

Updated Jun 17, 2024 • 36k • 57

hpcai-tech/OpenSora-STDiT-v3

Updated Jun 17, 2024 • 25.1k • 46

liuqi6777/RankGPT-msmarco-100k-clean

Viewer • Updated Feb 6, 2024 • 87.3k • 51 • 1

failspy/Meta-Llama-3-70B-Instruct-abliterated-v3.5

Text Generation • Updated May 30, 2024 • 3.79k • 42

ResplendentAI/NSFW_RP_Format_DPO

Viewer • Updated Mar 17, 2024 • 400 • 146 • 66

microsoft/msr_text_compression

Updated Jan 18, 2024 • 82 • 8

microsoft/msr_sqa

Updated Jan 18, 2024 • 98 • 4

microsoft/crd3

Updated Jan 18, 2024 • 152 • 24

nvidia/domain-classifier

Updated 26 days ago • 134k • 74

jhu-clsp/FollowIR-train

Viewer • Updated Mar 25, 2024 • 1.78k • 64 • 5

vicgalle/Phudge-3

Text Classification • Updated May 30, 2024 • 11 • 3

TWO/sutra-mlt256-v2

Updated May 24, 2024 • 9

AutoRAG-HP: Automatic Online Hyper-Parameter Tuning for Retrieval-Augmented Generation

Paper • 2406.19251 • Published Jun 27, 2024 • 9

aiana94/xMINDlarge

Viewer • Updated Oct 25, 2024 • 4.12M • 159 • 4

OpenCo7/UpVoteWeb

Viewer • Updated Jul 17, 2024 • 557M • 508 • 94

davanstrien/magpie-preference

Viewer • Updated 1 day ago • 527 • 844 • 13

FunAudioLLM/SenseVoiceSmall

Updated Jul 31, 2024 • 1.58k • 221

euclaise/gsm8k_multiturn

Viewer • Updated Jul 6, 2024 • 8.79k • 51 • 13

internlm/internlm-xcomposer2d5-7b

Visual Question Answering • Updated Jul 22, 2024 • 670k • 199

dell-research-harvard/newswire

Viewer • Updated Jul 2, 2024 • 1.44M • 358 • 70

alexshengzhili/SciGraphQA-295K-train

Viewer • Updated Aug 8, 2023 • 296k • 76 • 11

xinsir/controlnet-union-sdxl-1.0

Text-to-Image • Updated Jul 30, 2024 • 90.4k • 1.3k

T-FREE: Tokenizer-Free Generative LLMs via Sparse Representations for Memory-Efficient Embeddings

Paper • 2406.19223 • Published Jun 27, 2024 • 9

laion/links_to_pocasts_lecture_and_shows_for_tts

Viewer • Updated May 29, 2024 • 331k • 15 • 8

laion/datacomp-hq

Viewer • Updated Mar 13, 2024 • 20.7M • 187 • 12

laion/Subjects-for-curricular

Viewer • Updated Dec 20, 2023 • 3.99M • 79 • 5

laion/strategic_game_maze

Viewer • Updated Oct 20, 2023 • 345M • 22.2k • 11

mlabonne/llmtwin

Viewer • Updated Aug 27, 2024 • 3.34k • 97 • 10

Multimodal Self-Instruct: Synthetic Abstract Image and Visual Reasoning Instruction Using Language Model

Paper • 2407.07053 • Published Jul 9, 2024 • 44

NovaSearch/stella_en_400M_v5

NovaSearch/stella_en_1.5B_v5

RhapsodyAI/MiniCPM-V-Embedding-preview

Feature Extraction • Updated Aug 20, 2024 • 199 • 48

agentsea/wave-ui-25k

Viewer • Updated Jul 3, 2024 • 25k • 312 • 23

TencentARC/StoryStream

Preview • Updated Jul 17, 2024 • 249 • 25

apple/DCLM-7B

Updated Jul 26, 2024 • 647 • 831

HuggingFaceTB/smollm-corpus

Viewer • Updated Sep 6, 2024 • 237M • 8.75k • 298

HuggingFaceTB/bisac-topics

Viewer • Updated Apr 3, 2024 • 5.5k • 17 • 2

From GaLore to WeLore: How Low-Rank Weights Non-uniformly Emerge from Low-Rank Gradients

Paper • 2407.11239 • Published Jul 15, 2024 • 8

mistralai/Mistral-Nemo-Base-2407

Text Generation • Updated Nov 6, 2024 • 32.6k • 294

TencentARC/SEED-Story

Text-to-Image • Updated Aug 26, 2024 • 15 • 26

xlangai/BRIGHT

Viewer • Updated Nov 18, 2024 • 1.35M • 2.56k • 21

glaiveai/RAG-v1

Viewer • Updated Jun 25, 2024 • 51.4k • 106 • 71

QuietImpostor/Claude-3-Opus-Claude-3.5-Sonnnet-9k

Viewer • Updated Jun 30, 2024 • 9.94k • 67 • 19

PawanKrd/gpt-4o-200k

Viewer • Updated Jun 29, 2024 • 200k • 30 • 24

kalomaze/Opus_Instruct_3k

Viewer • Updated Jul 19, 2024 • 2.95k • 67 • 25

Coarse-to-Fine Vision-Language Pre-training with Fusion in the Backbone

Paper • 2206.07643 • Published Jun 15, 2022 • 1

Active Self-Supervised Learning: A Few Low-Cost Relationships Are All You Need

Paper • 2303.15256 • Published Mar 27, 2023 • 1

fireworks-ai/llama-3-firefunction-v2

Text Generation • Updated Jun 18, 2024 • 126 • 142

Stateful Memory-Augmented Transformers for Dialogue Modeling

Paper • 2209.07634 • Published Sep 15, 2022 • 1

cognitivecomputations/SystemChat-2.0

Preview • Updated May 31, 2024 • 116 • 57

CollectiveCognition/chats-data-2023-10-16

Viewer • Updated Oct 16, 2023 • 200 • 33 • 21

Izazk/Sequence-of-action-prediction-mind2web

Viewer • Updated Feb 22, 2024 • 68.9k • 56 • 4

BigAction/mind2web_clean

Viewer • Updated Apr 25, 2024 • 199 • 44 • 4

osunlp/Mind2Web

Viewer • Updated Jul 19, 2023 • 253 • 357 • 99

magicgh/MT-Mind2Web

Viewer • Updated Feb 23, 2024 • 259 • 95 • 2

TencentARC/PhotoMaker-V2

Text-to-Image • Updated Jul 22, 2024 • 24.3k • 136

KevSun/Personality_LM

Text Classification • Updated Jul 29, 2024 • 15.9k • 21

250

Infinite Dataset Hub

♾

Search and save datasets generated with a LLM in real time

chargoddard/SlimOrcaDedupCleaned-Sonnet3.5-DPO

Viewer • Updated Jul 23, 2024 • 168k • 39 • 7

nvidia/Minitron-8B-Base

Text Generation • Updated 5 days ago • 8.24k • 63

mlfoundations/MINT-1T-HTML

Viewer • Updated Sep 21, 2024 • 623M • 198k • 82

mlfoundations/MINT-1T-ArXiv

Viewer • Updated Sep 19, 2024 • 5.6M • 3.85k • 48

mlfoundations/MINT-1T-PDF-CC-2024-18

Updated Sep 19, 2024 • 6.97k • 19

AI-MO/NuminaMath-TIR

Viewer • Updated Nov 25, 2024 • 72.5k • 16.5k • 100

DistilDIRE: A Small, Fast, Cheap and Lightweight Diffusion Synthesized Deepfake Detection

Paper • 2406.00856 • Published Jun 2, 2024 • 12

mlabonne/FineTome-100k

Viewer • Updated Jul 29, 2024 • 100k • 9.88k • 162

LiruiZhao/Diffree

Image-to-Image • Updated Jul 29, 2024 • 27 • 18

BAAI/bge-multilingual-gemma2

Feature Extraction • Updated Jul 31, 2024 • 147k • 170

BAAI/bge-reranker-v2.5-gemma2-lightweight

Text Classification • Updated Sep 6, 2024 • 2.08k • 45

BAAI/IndustryCorpus

Viewer • Updated Jul 23, 2024 • 595M • 760 • 51

jspringer/echo-mistral-7b-instruct-lasttoken

Feature Extraction • Updated Feb 26, 2024 • 442 • 6

BAAI/bge-en-icl

Feature Extraction • Updated Jan 15 • 23.8k • 123

AlekseyKorshuk/full_user_edit_responses-clean

Viewer • Updated Mar 30, 2023 • 364k • 29 • 1

m-a-p/MMRA

Viewer • Updated Jul 31, 2024 • 1.02k • 86 • 13

m-a-p/II-Bench

Viewer • Updated Jun 29, 2024 • 1.43k • 452 • 10

BEE-spoke-data/fineweb-1000_64k

Viewer • Updated Jun 23, 2024 • 2k • 45 • 4

Salesforce/xgen-mm-phi3-mini-instruct-r-v1

Image-Text-to-Text • Updated 16 days ago • 1.24k • 186

black-forest-labs/FLUX.1-dev

Text-to-Image • Updated Aug 16, 2024 • 1.78M • • 8.86k

black-forest-labs/FLUX.1-schnell

Text-to-Image • Updated Aug 16, 2024 • 1.08M • • 3.41k

numind/NuExtract

Text Generation • Updated Oct 17, 2024 • 1.26k • 216

numind/NuSentiment-multilingual

Feature Extraction • Updated Jan 26, 2024 • 582 • 11

HuggingFaceM4/Idefics3-8B-Llama3

Image-Text-to-Text • Updated Dec 2, 2024 • 50.5k • 267

aipicasso/megalith-10m-florence2

Viewer • Updated Jul 31, 2024 • 9.14M • 122 • 23

ZhengPeng7/BiRefNet

Image Segmentation • Updated 14 days ago • 588k • 321

nvidia/quality-classifier-deberta

Updated 19 days ago • 19k • 56

Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters

Paper • 2408.04093 • Published Aug 7, 2024 • 4

tiiuae/falcon-mamba-7b-4bit

Text Generation • Updated Oct 10, 2024 • 100 • 11

nisten/all-human-diseases

Viewer • Updated Aug 19, 2024 • 2.2k • 117 • 106

THUDM/LongWriter-6k

Viewer • Updated Aug 14, 2024 • 6k • 272 • 173

anthracite-org/Stheno-Data-Filtered

Viewer • Updated Aug 18, 2024 • 31.1k • 21 • 14

anthracite-org/kalo-opus-instruct-22k-no-refusal

Viewer • Updated Aug 13, 2024 • 22.3k • 113 • 27

anthracite-org/nopm_claude_writing_fixed

Viewer • Updated Aug 18, 2024 • 6.35k • 95 • 13

microsoft/Phi-3.5-vision-instruct

Image-Text-to-Text • Updated Sep 26, 2024 • 295k • • 663

microsoft/Phi-3.5-MoE-instruct

Text Generation • Updated Oct 24, 2024 • 35.9k • • 555

fal/AuraFace-v1

Updated Aug 26, 2024 • 86

NexaAIDev/Squid

Updated Sep 3, 2024 • 51 • 34

Dolphin: Long Context as a New Modality for Energy-Efficient On-Device Language Models

Paper • 2408.15518 • Published Aug 28, 2024 • 43

HuggingFaceTB/everyday-conversations-llama3.1-2k

Viewer • Updated 21 days ago • 2.38k • 395 • 94

NousResearch/hermes-function-calling-v1

Viewer • Updated Aug 30, 2024 • 11.6k • 587 • 238

multimodalart/product-design

Text-to-Image • Updated Sep 22, 2024 • 4.53k • • 36

novateur/WavTokenizer

Text-to-Speech • Updated Dec 2, 2024 • 49

facebook/sapiens

Updated Sep 20, 2024 • 8 • 233

Shakker-Labs/AWPortrait-FL

Text-to-Image • Updated Sep 5, 2024 • 170k • 443

sequelbox/Supernova

Viewer • Updated Sep 27, 2024 • 178k • 77 • 8

542

Vision Arena (Testing VLMs side-by-side)

🖼

Analyze images to detect and label objects

mattshumer/Reflection-Llama-3.1-70B

Text Generation • Updated Sep 24, 2024 • 630 • 1.71k

deepseek-ai/DeepSeek-V2.5

Text Generation • Updated Dec 11, 2024 • 4.88k • 698

deepseek-ai/ESFT-vanilla-lite

Text Generation • Updated Jul 23, 2024 • 561 • 11

yifeihu/TB-OCR-preview-0.1

Image-Text-to-Text • Updated Sep 6, 2024 • 439 • 131

gabrielmbmb/distilabel-reflection-tuning

Viewer • Updated Sep 6, 2024 • 5 • 78 • 56

TencentARC/Open-MAGVIT2

Image Feature Extraction • Updated Sep 9, 2024 • 12

openbmb/MiniCPM3-4B

Text Generation • Updated Nov 30, 2024 • 24.5k • 404

THUDM/LongCite-glm4-9b

Text Generation • Updated Dec 16, 2024 • 154 • 30

jinaai/reader-lm-1.5b

Text Generation • Updated Jan 17 • 1.37k • 589

Vchitect/Vchitect-2.0-2B

Text-to-Video • Updated Sep 15, 2024 • 68 • 38

tencent/DepthCrafter

Depth Estimation • Updated Sep 24, 2024 • 260k • 84

mistralai/Pixtral-12B-2409

Image-Text-to-Text • Updated Dec 26, 2024 • • 605

stepfun-ai/GOT-OCR2_0

Image-Text-to-Text • Updated 16 days ago • 199k • 1.39k

StoryMaker: Towards Holistic Consistent Characters in Text-to-image Generation

Paper • 2409.12576 • Published Sep 19, 2024 • 16

THUdyh/Oryx-7B

Text Generation • Updated Sep 25, 2024 • 248 • 11

THUdyh/Oryx-7B-Image

Text Generation • Updated Sep 23, 2024 • 891 • 3

THUdyh/Oryx-ViT

Image Classification • Updated Sep 23, 2024 • 5

BAAI/SegGPT

Updated Apr 21, 2023 • 18

Salesforce/fineweb_deduplicated

Viewer • Updated 16 days ago • 6.43B • 370 • 35

KbsdJames/Omni-MATH

Viewer • Updated Oct 12, 2024 • 4.43k • 2.66k • 77

BAAI/Emu3-Gen

Any-to-Any • Updated Oct 23, 2024 • 1.99k • 208

CultriX/elitebabes-flux

Text-to-Image • Updated Sep 20, 2024 • 653 • • 16

RED-AIGC/StoryMaker

Text-to-Image • Updated Nov 9, 2024 • 235 • 75

google/frames-benchmark

Viewer • Updated Oct 15, 2024 • 824 • 1.39k • 185

Anthropic/discrim-eval

Viewer • Updated Jan 5, 2024 • 18.9k • 787 • 45

facebook/sam2.1-hiera-large

Mask Generation • Updated Sep 24, 2024 • 700k • 68

Zyphra/Zamba2-2.7B-instruct

Text Generation • Updated 6 days ago • 701 • 82

princeton-nlp/Llama-3-8B-ProLong-512k-Instruct

Updated Oct 31, 2024 • 4.56k • 18

jxm/cde-small-v1

Feature Extraction • Updated 29 days ago • 85.7k • 285

PrincetonPLI/Instruct-SkillMix-SDD

Viewer • Updated Sep 9, 2024 • 8k • 55 • 5

THUDM/cogvlm2-llama3-caption

Video-Text-to-Text • Updated 28 days ago • 8.13k • 84

julien040/hacker-news-posts

Viewer • Updated Jun 6, 2023 • 4.01M • 112 • 6

princeton-nlp/Llama-3-8B-ProLong-512k-Base

Updated Oct 31, 2024 • 2.06k • 8

LLM360/TxT360

Preview • Updated Nov 8, 2024 • 417k • 223

bingbangboom/flux-waterscape

Text-to-Image • Updated Oct 10, 2024 • 956 • • 14

facebook/Self-taught-evaluator-DPO-data

Viewer • Updated 13 days ago • 57.5k • 48 • 32

facebook/layerskip-llama2-13B

Text Generation • Updated Oct 19, 2024 • 1.3k • 5

ibm-granite/granite-8b-code-instruct-accelerator

Updated May 29, 2024 • 15 • 1

peakji/steiner-32b-preview

Updated Oct 21, 2024 • 20 • 43

CohereForAI/aya-expanse-32b

Text Generation • Updated Dec 6, 2024 • 119k • 214

CohereForAI/aya-expanse-8b

Text Generation • Updated Dec 6, 2024 • 49.1k • 334

Mono-InternVL: Pushing the Boundaries of Monolithic Multimodal Large Language Models with Endogenous Visual Pre-training

Paper • 2410.08202 • Published Oct 10, 2024 • 4

McGill-NLP/FaithDial

Viewer • Updated Feb 5, 2023 • 32.3k • 242 • 17

relaxml/Llama-3.1-8b-Instruct-QTIP-4Bit

Updated Oct 28, 2024 • 74 • 2

Dualformer: Controllable Fast and Slow Thinking by Learning with Randomized Reasoning Traces

Paper • 2410.09918 • Published Oct 13, 2024 • 3

GAIR/o1-journey

Viewer • Updated Oct 16, 2024 • 327 • 138 • 133

marcelbinz/Psych-101

Viewer • Updated Nov 2, 2024 • 60.1k • 154 • 42

nvidia/Nemotron-4-Mini-Hindi-4B-Base

Updated Oct 23, 2024 • 30 • 11

nvidia/Nemotron-4-Mini-Hindi-4B-Instruct

Updated Nov 15, 2024 • 21 • 18

Etched/oasis-500m

Updated Nov 4, 2024 • 175 • 446

HuggingFaceTB/SmolLM2-1.7B-Instruct

Text Generation • Updated 13 days ago • 349k • • 532

tencent/Tencent-Hunyuan-Large

Text Generation • Updated Jan 19 • 471 • 564

THUDM/webrl-llama-3.1-8b

Updated Nov 6, 2024 • 624 • 3

THUDM/webrl-glm-4-9b

Updated Nov 5, 2024 • 43 • 8

hbseong/HarmAug-Guard

Text Classification • Updated Oct 14, 2024 • 457 • 38

BAAI/IndustryCorpus2

Viewer • Updated Dec 17, 2024 • 826M • 1.71k • 48

di-zhang-fdu/OpenLongCoT-Pretrain

Viewer • Updated Oct 28, 2024 • 103k • 74 • 87

microsoft/maira-2

Text Generation • Updated about 8 hours ago • 45.9k • 44

LLM2CLIP: Powerful Language Model Unlock Richer Visual Representation

Paper • 2411.04997 • Published Nov 7, 2024 • 37

microsoft/orca-agentinstruct-1M-v1

Viewer • Updated Nov 1, 2024 • 1.05M • 3.15k • 427

Nexusflow/Athene-V2-Chat

Text Generation • Updated Nov 26, 2024 • 8.54k • 280

Nexusflow/Athene-V2-Agent

Text Generation • Updated Nov 21, 2024 • 493 • 125

numind/NuExtract-1.5-tiny

Text Generation • Updated Nov 18, 2024 • 1.57k • • 15

Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models

Paper • 2411.04996 • Published Nov 7, 2024 • 51

allenai/ACE2-ERA5

Updated Nov 21, 2024 • 2

Do I Know This Entity? Knowledge Awareness and Hallucinations in Language Models

Paper • 2411.14257 • Published Nov 21, 2024 • 13

nvidia/Hymba-1.5B-Base

Text Generation • Updated Jan 2 • 3.23k • 139

AIDC-AI/Marco-o1

Text Generation • Updated Nov 23, 2024 • 7.47k • 709

allenai/Llama-3.1-Tulu-3-70B

Text Generation • Updated 9 days ago • 5.65k • 54

nachoyawn/three-million-bluesky

Viewer • Updated Nov 28, 2024 • 3.01M • 129 • 10

huihui-ai/QwQ-32B-Preview-abliterated

Text Generation • Updated Nov 28, 2024 • 495 • 97

data-is-better-together/open-image-preferences-v1

Viewer • Updated Dec 9, 2024 • 8.67k • 9.47k • 24

showlab/ShowUI-desktop

Viewer • Updated Dec 17, 2024 • 7.5k • 874 • 22

o1-Coder: an o1 Replication for Coding

Paper • 2412.00154 • Published Nov 29, 2024 • 43

nvidia/multilingual-domain-classifier

Updated 26 days ago • 703 • 14

TencentARC/Divot

Updated Dec 10, 2024 • 12 • 6

microsoft/RedStone

Updated Dec 5, 2024 • 32 • 32

ruliad/deepthought-8b-llama-v0.01-alpha

Text Generation • Updated Dec 7, 2024 • 813 • 143

TIGER-Lab/ScholarCopilot-v1

Updated Dec 8, 2024 • 16 • 4

TIGER-Lab/ScholarCopilot-Data-v1

Viewer • Updated Dec 15, 2024 • 677k • 115 • 2

facebook/sparsh-dino-base

Updated Oct 21, 2024 • 5

deepseek-ai/DeepSeek-V2.5-1210

Text Generation • Updated Dec 11, 2024 • 5.33k • 250

facebook/metamotivo-M-1

Updated Dec 12, 2024 • 1.22k • 7

deepseek-ai/DeepSeek-Prover-V1.5-RL

Updated Aug 29, 2024 • 13.7k • 47

tiiuae/Falcon3-10B-Base

Text Generation • Updated Dec 18, 2024 • 20.1k • 34

answerdotai/ModernBERT-base

Fill-Mask • Updated Jan 15 • 9.79M • 754

HuggingFaceTB/finemath

Viewer • Updated 13 days ago • 48.3M • 12.3k • 283

google/reveal

Viewer • Updated Apr 9, 2024 • 6.1k • 89 • 29

showlab/ShowUI-web

Viewer • Updated Jan 5 • 22k • 8.5k • 11

Writer/omniact

Updated Apr 29, 2024 • 941 • 34

NovaSky-AI/Sky-T1-32B-Preview

Text Generation • Updated Jan 13 • 14k • 533

notdiamond/notdiamond-0001

Text Classification • Updated Jul 30, 2024 • 145 • 137

MiniMaxAI/MiniMax-Text-01

Text Generation • Updated Jan 17 • 4.97k • 526

EvaByte/EvaByte-SFT

Updated 29 days ago • 192 • 35

bespokelabs/Bespoke-Stratos-17k

Viewer • Updated 20 days ago • 16.7k • 66.1k • 274

kyutai/hibiki-2b-rs-bf16

Translation • Updated 10 days ago • 3