Zeze Nene's picture

Zeze Nene

Neman

·

AI & ML interests

LLM, evolutionary programming, AI

Recent Activity

liked a model 3 days ago

HuggingFaceTB/SmolVLM-500M-Instruct

liked a model 4 days ago

NikolayKozloff/DeepSeek-R1-Distill-Qwen-7B-Multilingual-Q8_0-GGUF

liked a model 5 days ago

bartowski/Mistral-Small-24B-Instruct-2501-GGUF

View all activity

Organizations

None yet

Neman's activity

New activity in unsloth/DeepSeek-R1-Distill-Qwen-14B-GGUF 14 days ago

unknown pre-tokenizer type: 'deepseek-r1-qwen'

#1 opened 16 days ago by

New activity in unsloth/DeepSeek-R1-Distill-Llama-8B-GGUF 14 days ago

unknown pre-tokenizer type: 'deepseek-r1-qwen'

#1 opened 16 days ago by

New activity in srinivasbilla/llasa-3b 15 days ago

safetensors size

#1 opened 15 days ago by

New activity in iiiorg/piiranha-v1-detect-personal-information 5 months ago

Phone number format

#4 opened 5 months ago by

New activity in google/gemma-2-9b-it 5 months ago

Update?

#44 opened 5 months ago by

New activity in OpenGVLab/Mini-InternVL-Chat-4B-V1-5 8 months ago

Flash Attention

#3 opened 8 months ago by

New activity in OpenGVLab/InternVL-Chat-ViT-6B-Vicuna-7B 8 months ago

What ViT?

#2 opened 9 months ago by

New activity in deepseek-ai/deepseek-vl-7b-chat 11 months ago

4-bit quant?

#3 opened 11 months ago by

New activity in YaTharThShaRma999/DeepSeek-vl-4bit-7b 11 months ago

Base or Chat?

#1 opened 11 months ago by

New activity in ISTA-DASLab/Mixtral-8x7b-AQLM-2Bit-1x16-hf 12 months ago

NameError: name 'flash_attn_func' is not defined

#4 opened 12 months ago by

New activity in MMInstruction/Silkie about 1 year ago

'QWenTokenizer' object has no attribute 'IMAGE_ST'

#1 opened about 1 year ago by

New activity in TheBloke/Qwen-14B-Chat-GPTQ over 1 year ago

Will it come?

#2 opened over 1 year ago by

New activity in Qwen/Qwen-VL-Chat-Int4 over 1 year ago

Update of checkpoints?

#1 opened over 1 year ago by

New activity in facebook/hf-seamless-m4t-large over 1 year ago

ImportError: cannot import name 'SeamlessM4TModel' from 'transformers'

#13 opened over 1 year ago by

New activity in adept/fuyu-8b over 1 year ago

Question What are the results for image captioning for fuyu-8b in comparison to other models?

#8 opened over 1 year ago by

What are the memory requirements for running the model?

#6 opened over 1 year ago by

New activity in llm-agents/tora-code-7b-v1.0 over 1 year ago

gguf variant?

#1 opened over 1 year ago by

New activity in TheBloke/Qwen-14B-Chat-GPTQ over 1 year ago

Will it come?

#2 opened over 1 year ago by