Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
267.6
TFLOPS
47
12
133
Daniel Han-Chen
danielhanchen
Follow
srinathkr07's profile picture
victor's profile picture
jed-tiotuico's profile picture
207 followers
·
110 following
https://unsloth.ai/
danielhanchen
AI & ML interests
None yet
Recent Activity
updated
a model
about 17 hours ago
unsloth/QVQ-72B-Preview-bnb-4bit
updated
a model
about 17 hours ago
unsloth/QVQ-72B-Preview
updated
a model
about 17 hours ago
unsloth/QVQ-72B-Preview
View all activity
Articles
Faster fine-tuning using TRL & Unsloth
Jan 10
•
42
Organizations
danielhanchen
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
New activity in
unsloth/Llama-3.3-70B-Instruct
12 days ago
Big thanks for these "without original" uploads!
1
#1 opened 18 days ago by
jukofyork
New activity in
unsloth/gemma-2-27b-it-bnb-4bit
3 months ago
Aphrodite/VLLM/SGLang all refuse to load this model
2
#5 opened 4 months ago by
fullstack
New activity in
unsloth/gemma-7b-bnb-4bit
3 months ago
No module named 'triton'
1
#3 opened 3 months ago by
NeelM0906
New activity in
unsloth/Hermes-3-Llama-3.1-8B-bnb-4bit
4 months ago
update base_model
#1 opened 4 months ago by
davanstrien
New activity in
unsloth/mistral-7b-instruct-v0.3
4 months ago
ValueError: The following `model_kwargs` are not used by the model: ['num_logits_to_keep'] (note: typos in the generate arguments will also show up in this list)
2
#1 opened 4 months ago by
NeelM0906
New activity in
unsloth/Phi-3-mini-4k-instruct-v0-bnb-4bit
4 months ago
Cant use the tokenizer using Unsloth Fastmodel
2
#2 opened 4 months ago by
aryarishit
New activity in
unsloth/Meta-Llama-3.1-8B-Instruct-bnb-4bit
5 months ago
RuntimeError: Unsloth: `unsloth/Meta-Llama-3.1-8B-bnb-4bit` is not a base model or a PEFT model.
6
#3 opened 5 months ago by
yorickdejong
New activity in
unsloth/Mistral-Nemo-Base-2407
5 months ago
difference
3
#1 opened 5 months ago by
ehartford
New activity in
google/gemma-2-9b-it
6 months ago
9B - query_pre_attn_scalar = 256 not 224
#26 opened 6 months ago by
danielhanchen
New activity in
google/gemma-2-9b
6 months ago
9B - query_pre_attn_scalar = 256 not 224
#22 opened 6 months ago by
danielhanchen
New activity in
unsloth/llama-3-8b
7 months ago
is this the llama-3-8b model clone?
13
#1 opened 8 months ago by
malhajar
New activity in
unsloth/gemma-2b-bnb-4bit
7 months ago
Model seems to be not PEFT model
1
#1 opened 7 months ago by
neuralresearcher
New activity in
unsloth/mistral-7b-v0.2-bnb-4bit
7 months ago
full disk on colab
3
#2 opened 7 months ago by
Dav22
New activity in
unsloth/Phi-3-mini-4k-instruct-bnb-4bit
7 months ago
TGI - RuntimeError: mat1 and mat2 shapes cannot be multiplied (4145x3072 and 1x14155776)
4
#3 opened 7 months ago by
turjo4nis
New activity in
unsloth/llama-3-8b-bnb-4bit
7 months ago
34 hour for file tunning ?
4
#7 opened 7 months ago by
dad1909
New activity in
unsloth/llama-3-70b-Instruct-bnb-4bit
7 months ago
Update config.json
#1 opened 7 months ago by
huseink
New activity in
unsloth/llama-3-8b-Instruct
7 months ago
Update config.json
2
#3 opened 7 months ago by
huseink
New activity in
unsloth/llama-3-8b-Instruct-bnb-4bit
7 months ago
Update config.json
1
#2 opened 7 months ago by
huseink
New activity in
unsloth/Phi-3-mini-4k-instruct-bnb-4bit
8 months ago
No package metadata was found for bitsandbytes
1
#1 opened 8 months ago by
halilbabacan
New activity in
unsloth/llama-3-8b-Instruct-bnb-4bit
8 months ago
BitsAndBytesConfig error
3
#1 opened 8 months ago by
vdavidr
Load more