Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
144.5
TFLOPS
661
13
179
Arthur Zucker
ArthurZ
Follow
jena-shreyas's profile picture
linoyts's profile picture
ppzxx's profile picture
255 followers
·
17 following
art_zucker
ArthurZucker
AI & ML interests
None yet
Articles
Improving Hugging Face Training Efficiency Through Packing with Flash Attention
30 days ago
•
19
Fine-Tuning Gemma Models in Hugging Face
Feb 23
•
21
Code Llama: Llama 2 learns to code
Aug 25, 2023
•
4
Organizations
ArthurZ
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
New activity in
meta-llama/Meta-Llama-3.1-405B-Instruct-FP8
about 1 month ago
8-kv-heads
8
#14 opened about 1 month ago by
ArthurZ
New activity in
meta-llama/Meta-Llama-3.1-405B-FP8
about 1 month ago
Update config.json
#17 opened about 1 month ago by
ArthurZ
Config KV Heads should be 8 now?
1
#16 opened about 1 month ago by
tanmaylaud
New activity in
meta-llama/Meta-Llama-3.1-405B-Instruct-FP8
about 1 month ago
8 kv heads
2
#13 opened about 2 months ago by
kkokkie2360
New activity in
meta-llama/Meta-Llama-3.1-405B-FP8
about 1 month ago
8-kv-heads
#15 opened about 2 months ago by
ArthurZ
New activity in
meta-llama/Meta-Llama-3.1-405B
about 1 month ago
8-kv-heads
3
#21 opened about 2 months ago by
ArthurZ
New activity in
meta-llama/Meta-Llama-3.1-405B-Instruct
about 1 month ago
8-kv-heads
4
#17 opened about 2 months ago by
ArthurZ
New activity in
meta-llama/Meta-Llama-3.1-405B-FP8
about 2 months ago
Updated eos_token to include multiple IDs
1
#14 opened about 2 months ago by
vontimitta
Update tokenizer to prepend special token
#12 opened about 2 months ago by
lysandre
New activity in
meta-llama/Meta-Llama-3.1-70B
about 2 months ago
Update tokenizer to prepend special token
1
#11 opened about 2 months ago by
lysandre
New activity in
meta-llama/Meta-Llama-3.1-8B-Instruct
about 2 months ago
Upload tokenizer
2
#29 opened about 2 months ago by
ArthurZ
Upload tokenizer
#28 opened about 2 months ago by
ArthurZ
New activity in
meta-llama/Meta-Llama-3.1-405B-Instruct-FP8
about 2 months ago
Upload tokenizer
1
#9 opened about 2 months ago by
ArthurZ
Update `_name_or_path` to the HF model id
#8 opened about 2 months ago by
davidthomas426
New activity in
meta-llama/Meta-Llama-3.1-8B
about 2 months ago
Update tokenizer to prepend special token
1
#12 opened about 2 months ago by
lysandre
New activity in
meta-llama/Meta-Llama-3.1-405B-Instruct
about 2 months ago
Upload tokenizer
1
#9 opened about 2 months ago by
ArthurZ
New activity in
meta-llama/Meta-Llama-3.1-70B-Instruct
about 2 months ago
Upload tokenizer
1
#12 opened about 2 months ago by
ArthurZ
New activity in
ArthurZ/new-t5-base
about 2 months ago
Upload tokenizer
#1 opened about 2 months ago by
ArthurZ
New activity in
meta-llama/Meta-Llama-3.1-8B-Instruct
about 2 months ago
Upload tokenizer
#27 opened about 2 months ago by
ArthurZ
New activity in
meta-llama/Meta-Llama-3.1-70B-Instruct
about 2 months ago
Upload tokenizer
#11 opened about 2 months ago by
ArthurZ
New activity in
meta-llama/Meta-Llama-3.1-405B-FP8
about 2 months ago
Fix quantization_config to work with vLLM v0.5.3.post1
1
#11 opened about 2 months ago by
davidthomas426
New activity in
meta-llama/Meta-Llama-3.1-8B-Instruct
about 2 months ago
DO NOT MERGE v2 make sure vllm and transformers work
#12 opened about 2 months ago by
ArthurZ
DO NOT MERGE test for vllm
2
#11 opened about 2 months ago by
ArthurZ
New activity in
meta-llama/Meta-Llama-3.1-70B
about 2 months ago
Can we add `use_scaled_rope` in the config.json?
4
#2 opened 2 months ago by
lanking
New activity in
meta-llama/Llama-Guard-3-8B-INT8
about 2 months ago
Update config.json
#6 opened about 2 months ago by
ArthurZ
New activity in
meta-llama/Llama-Guard-3-8B
about 2 months ago
Update config.json
#9 opened about 2 months ago by
ArthurZ
New activity in
meta-llama/Meta-Llama-3.1-70B
about 2 months ago
Update config.json
#9 opened about 2 months ago by
ArthurZ
New activity in
meta-llama/Meta-Llama-3.1-70B-Instruct
about 2 months ago
Update config.json
#6 opened about 2 months ago by
ArthurZ
New activity in
meta-llama/Meta-Llama-3.1-70B
about 2 months ago
Update config.json
#8 opened about 2 months ago by
ArthurZ
New activity in
meta-llama/Meta-Llama-3.1-8B
about 2 months ago
Update config.json
#10 opened about 2 months ago by
ArthurZ
New activity in
google/gemma-2-27b-it
3 months ago
Model repeating information and "spitting out" random characters
3
#12 opened 3 months ago by
brazilianslib
Hallucinations, misspellings etc. Something seems broken?
21
#10 opened 3 months ago by
sam-paech
transformers load fails?
7
#6 opened 3 months ago by
bdambrosio
New activity in
google/gemma-2-9b
3 months ago
Runtime autograd error due to inplace operations
1
#4 opened 3 months ago by
xianbin
New activity in
microsoft/Florence-2-large
3 months ago
Please add to llama.cpp and ollama
3
#21 opened 3 months ago by
KeilahElla
New activity in
meta-llama/Meta-Llama-3-8B
4 months ago
Why are "add_bos_token" and "add_eos_token" missing in tokenizer_config.json ?
1
#140 opened 4 months ago by
ekurtic
New activity in
mistralai/Mistral-7B-Instruct-v0.3
4 months ago
Slow tokenizer problem.
4
#22 opened 4 months ago by
bradhutchings
New activity in
meta-llama/Meta-Llama-3-8B
4 months ago
LlamaTokenizerFast.from_pretrained gives incorrect number of tokens for Llama3
2
#156 opened 4 months ago by
farzadab
New activity in
mistralai/Mistral-7B-Instruct-v0.3
4 months ago
Add minor reference to transformers
4
#7 opened 4 months ago by
osanseviero
Upload tokenizer
#6 opened 4 months ago by
ArthurZ
Upload tokenizer
#5 opened 4 months ago by
ArthurZ
New activity in
mistralai/Mistral-7B-v0.3
4 months ago
Update README.md
#4 opened 4 months ago by
ArthurZ
Update README.md
#3 opened 4 months ago by
ArthurZ
New activity in
mistralai/Mistral-7B-Instruct-v0.3
4 months ago
Update README.md
#4 opened 4 months ago by
ArthurZ
Update config.json
1
#3 opened 4 months ago by
ArthurZ
New activity in
mistralai/Mistral-7B-v0.3
4 months ago
Upload MistralForCausalLM
#2 opened 4 months ago by
ArthurZ
New activity in
mistralai/Mistral-7B-Instruct-v0.3
4 months ago
Upload MistralForCausalLM
#2 opened 4 months ago by
ArthurZ
New activity in
mistralai/Mistral-7B-v0.3
4 months ago
Upload tokenizer
1
#1 opened 4 months ago by
ArthurZ
New activity in
mistralai/Mistral-7B-Instruct-v0.3
4 months ago
Upload tokenizer
#1 opened 4 months ago by
ArthurZ
New activity in
01-ai/Yi-9B
4 months ago
Tokenizer inconsistencies related to HTML tags
4
#11 opened 5 months ago by
sanderland
New activity in
meta-llama/Meta-Llama-3-8B-Instruct
4 months ago
Update config.json
1
#105 opened 4 months ago by
ArthurZ
New activity in
meta-llama/Meta-Llama-3-70B-Instruct
4 months ago
Update config.json
3
#49 opened 4 months ago by
ArthurZ
The sample code for usage with Transformers is incorrect.
2
#45 opened 5 months ago by
endNone
New activity in
meta-llama/Meta-Llama-3-8B-Instruct
4 months ago
How to use EOT_ID
4
#54 opened 5 months ago by
saksham-lamini
New activity in
meta-llama/Meta-Llama-3-8B
4 months ago
Setting `pad_token_id` to `eos_token_id`:128001 for open-end generation.
9
#72 opened 5 months ago by
tianke0711
Unable to load the model for Torch versions starting from 2.0.1
10
#34 opened 5 months ago by
benhachem
New activity in
meta-llama/Meta-Llama-3-70B-Instruct
4 months ago
Update config.json
4
#33 opened 5 months ago by
ArthurZ
Update README.md
1
#31 opened 5 months ago by
kimseungho
New activity in
meta-llama/Meta-Llama-3-8B-Instruct
4 months ago
Update tokenizer_config.json
16
#60 opened 5 months ago by
Navanit-AI
New activity in
meta-llama/Meta-Llama-3-8B-Instruct
5 months ago
Update config.json
1
#71 opened 5 months ago by
ArthurZ
Load more