Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
144.5
TFLOPS
673
15
191
Arthur Zucker
ArthurZ
Follow
gsarti's profile picture
tanyigy's profile picture
Jayita13's profile picture
307 followers
ยท
17 following
art_zucker
ArthurZucker
AI & ML interests
None yet
Recent Activity
liked
a model
15 days ago
meta-llama/Llama-3.2-1B-Instruct
liked
a Space
15 days ago
m-ric/llm-race-to-the-top
reacted
to
MonsterMMORPG
's
post
with ๐
27 days ago
FLUX Redux is a hidden Gem I am still doing huge research to publish an amazing fully Public - no paywalled Tutorial, but this is generated via SwarmUI Style Model Merge Strength : 0.5 FLUX Guidance Scale is : 6 Used base model is my FLUX fine tuned model with 256 images via Kohya SS GUI as shown in tutorial ( https://youtu.be/FvpWy1x5etM ) - 70 epoch Prompt : anime ohwx man walking in a jungle <segment:yolo-face_yolov9c.pt-1,0.7,0.5> ohwx man, anime
View all activity
Articles
Fixing Gradient Accumulation
Oct 16
โข
43
Improving Hugging Face Training Efficiency Through Packing with Flash Attention
Aug 21
โข
25
Fine-Tuning Gemma Models in Hugging Face
Feb 23
โข
27
Code Llama: Llama 2 learns to code
Aug 25, 2023
โข
9
Organizations
ArthurZ
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
New activity in
mistralai/Pixtral-Large-Instruct-2411
about 1 month ago
Upload transformers version
7
#3 opened about 1 month ago by
ArthurZ
New activity in
huggingface/documentation-images
about 1 month ago
Upload Meta-Llama-3-8B-Instruct, seqlen = 512, python, w_ compile.png
1
#392 opened about 1 month ago by
kwen2501
New activity in
mistral-community/pixtral-12b
2 months ago
Update model weight
8
#13 opened 2 months ago by
nguyen-brat
Update hidden_act to silu
2
#14 opened 2 months ago by
ArthurZ
New activity in
rhymes-ai/Aria
3 months ago
llama.cpp support
9
#1 opened 3 months ago by
ayyylol
New activity in
google/gemma-2-2b-jpn-it
3 months ago
tokenizer_config.json is different from gemma-2-2b-it
2
#8 opened 3 months ago by
dahara1
New activity in
mistral-community/pixtral-12b
3 months ago
How can i use the full 24GB model instead of this separated safetensors files?
1
#8 opened 3 months ago by
Valadaro
New activity in
meta-llama/Llama-3.2-11B-Vision-Instruct
3 months ago
hidden_activation vs hidden_act in config.json
2
#10 opened 3 months ago by
heheda
New activity in
mistral-community/pixtral-12b-240910
3 months ago
How to use safetensors?
2
#13 opened 3 months ago by
prathi1729
New activity in
mistral-community/pixtral-12b
3 months ago
lamma cpp ht to gguf not working
4
#2 opened 3 months ago by
RameshRajamani
New activity in
meta-llama/Llama-3.1-405B-Instruct-FP8
4 months ago
8-kv-heads
8
#14 opened 5 months ago by
ArthurZ
New activity in
meta-llama/Llama-3.1-405B-FP8
4 months ago
Update config.json
#17 opened 4 months ago by
ArthurZ
Config KV Heads should be 8 now?
1
#16 opened 5 months ago by
tanmaylaud
New activity in
meta-llama/Llama-3.1-405B-Instruct-FP8
5 months ago
8 kv heads
2
#13 opened 5 months ago by
kkokkie2360
New activity in
meta-llama/Llama-3.1-405B-FP8
5 months ago
8-kv-heads
#15 opened 5 months ago by
ArthurZ
New activity in
meta-llama/Llama-3.1-405B
5 months ago
8-kv-heads
3
#21 opened 5 months ago by
ArthurZ
New activity in
meta-llama/Llama-3.1-405B-Instruct
5 months ago
8-kv-heads
4
#17 opened 5 months ago by
ArthurZ
New activity in
meta-llama/Llama-3.1-405B-FP8
5 months ago
Updated eos_token to include multiple IDs
1
#14 opened 5 months ago by
vontimitta
Update tokenizer to prepend special token
#12 opened 5 months ago by
lysandre
New activity in
meta-llama/Llama-3.1-70B
5 months ago
Update tokenizer to prepend special token
1
#11 opened 5 months ago by
lysandre
Load more