Ron Wolf's picture

Ron Wolf

ron-wolf

·

AI & ML interests

None yet

Recent Activity

updated a collection 5 days ago

upvoted a paper 5 days ago

How many words does ChatGPT know? The answer is ChatWords

liked a model 5 days ago

bartowski/DeepSeek-R1-Distill-Llama-70B-GGUF

View all activity

Organizations

None yet

ron-wolf's activity

upvoted a paper 5 days ago

How many words does ChatGPT know? The answer is ChatWords

Paper • 2309.16777 • Published Sep 28, 2023 • 1

upvoted a collection 13 days ago

D_AU - Source files for GGUF, EXL2, AWQ, GPTQ, HQQ etc etc

Safetensor source files (by David_AU) to use directly and/or create different quants and/or merges. Link to GGUFS/full model card on each. • 77 items • Updated 3 days ago • 7

upvoted 2 papers 14 days ago

Distilling Step-by-Step! Outperforming Larger Language Models with Less Training Data and Smaller Model Sizes

Paper • 2305.02301 • Published May 3, 2023 • 3

Grokking at the Edge of Numerical Stability

Paper • 2501.04697 • Published 27 days ago • 1

upvoted an article 17 days ago

Article

Finally, a Replacement for BERT: Introducing ModernBERT

Dec 19, 2024

• 522

upvoted 2 collections 25 days ago

Mahou

flammen.ai's production model for casual conversation and character roleplay • 24 items • Updated Oct 14, 2024 • 4

Personal Favorites

Recommended models I use often or like for any reason. I recommend reading their cards for more details. • 10 items • Updated Dec 24, 2024 • 71

upvoted 2 papers 29 days ago

Beyond Distillation: Task-level Mixture-of-Experts for Efficient Inference

Paper • 2110.03742 • Published Sep 24, 2021 • 4

Meta-DMoE: Adapting to Domain Shift by Meta-Distillation from Mixture-of-Experts

Paper • 2210.03885 • Published Oct 8, 2022 • 1

upvoted a collection 30 days ago

RPMax v1 Models

RPMax series of models with higher creativity and reduced repetition for "classic" RP chats. • 16 items • Updated Dec 6, 2024 • 18

upvoted 2 collections about 1 month ago

EVA Gen 0.0

RP/creative writing specialist models, trained on a curated mixture of natural and synthetic data. • 6 items • Updated Dec 29, 2024 • 3

Recommended large models

This collection contains some of the recent models larger than ~25B parameters that should be high quality and reliable • 15 items • Updated Nov 27, 2024 • 16

upvoted an article about 1 month ago

Article

🐺🐦‍⬛ LLM Comparison/Test: 25 SOTA LLMs (including QwQ) through 59 MMLU-Pro CS benchmark runs

By

•

Dec 4, 2024

• 76

upvoted 2 collections about 2 months ago

Current daily driver's & Past favorites.

10 items • Updated 9 days ago • 12

Lumimaid

3 items • Updated Jul 26, 2024 • 18

upvoted an article about 2 months ago

Article

🚨 ALERT: A Comprehensive Benchmark for Assessing Large Language Models' Safety through Red Teaming

By

•

Jun 25, 2024

• 5

upvoted a collection about 2 months ago

LLM-Papers

114 items • Updated 3 days ago • 3

upvoted an article about 2 months ago

Article

Uncensor any LLM with abliteration

By

•

Jun 13, 2024

• 415

upvoted 2 papers about 2 months ago

TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks

Paper • 2412.14161 • Published Dec 18, 2024 • 50

Accelerated Preference Optimization for Large Language Model Alignment

Paper • 2410.06293 • Published Oct 8, 2024 • 5