Example:

from transformers import AutoModelForCausalLM, AutoTokenizer, GenerationConfig

tokenizer = AutoTokenizer.from_pretrained("Qwen-1_8B-m4-LDJnr-combined", trust_remote_code=True)
model = AutoModelForCausalLM.from_pretrained("Qwen-1_8B-m4-LDJnr-combined", device_map="auto", trust_remote_code=True).eval()
response, _ = model.chat(tokenizer, "What kind of a noise annoys a noisy oyster?", history=None)
print(response)
Downloads last month
87
Safetensors
Model size
1.84B params
Tensor type
FP16
·
Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and HF Inference API does not yet support model repos that contain custom code.

Collection including aloobun/Qwen-1_8B-m4-LDJnr-combined