How many words does ChatGPT know? The answer is ChatWords Paper β’ 2309.16777 β’ Published Sep 28, 2023 β’ 1
D_AU - Source files for GGUF, EXL2, AWQ, GPTQ, HQQ etc etc Collection Safetensor source files (by David_AU) to use directly and/or create different quants and/or merges. Link to GGUFS/full model card on each. β’ 77 items β’ Updated 3 days ago β’ 7
Distilling Step-by-Step! Outperforming Larger Language Models with Less Training Data and Smaller Model Sizes Paper β’ 2305.02301 β’ Published May 3, 2023 β’ 3
Mahou Collection flammen.ai's production model for casual conversation and character roleplay β’ 24 items β’ Updated Oct 14, 2024 β’ 4
Personal Favorites Collection Recommended models I use often or like for any reason. I recommend reading their cards for more details. β’ 10 items β’ Updated Dec 24, 2024 β’ 71
Beyond Distillation: Task-level Mixture-of-Experts for Efficient Inference Paper β’ 2110.03742 β’ Published Sep 24, 2021 β’ 4
Meta-DMoE: Adapting to Domain Shift by Meta-Distillation from Mixture-of-Experts Paper β’ 2210.03885 β’ Published Oct 8, 2022 β’ 1
RPMax v1 Models Collection RPMax series of models with higher creativity and reduced repetition for "classic" RP chats. β’ 16 items β’ Updated Dec 6, 2024 β’ 18
EVA Gen 0.0 Collection RP/creative writing specialist models, trained on a curated mixture of natural and synthetic data. β’ 6 items β’ Updated Dec 29, 2024 β’ 3
Recommended large models Collection This collection contains some of the recent models larger than ~25B parameters that should be high quality and reliable β’ 15 items β’ Updated Nov 27, 2024 β’ 16
view article Article πΊπ¦ββ¬ LLM Comparison/Test: 25 SOTA LLMs (including QwQ) through 59 MMLU-Pro CS benchmark runs By wolfram β’ Dec 4, 2024 β’ 76
view article Article π¨ ALERT: A Comprehensive Benchmark for Assessing Large Language Models' Safety through Red Teaming By sted97 β’ Jun 25, 2024 β’ 5
TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks Paper β’ 2412.14161 β’ Published Dec 18, 2024 β’ 50
Accelerated Preference Optimization for Large Language Model Alignment Paper β’ 2410.06293 β’ Published Oct 8, 2024 β’ 5