Triton error while running demo code
2
#11 opened 16 days ago
by
chiauho
Slower than standard Llama 8b?
1
#10 opened 16 days ago
by
Sijuade
I found some errors when building on a rpi 5
1
#9 opened 18 days ago
by
eddieoz
You can try to convert DeepSeek-V2.5 or Llama-3.1-Nemotron-70B-Instruct-HF?
2
#8 opened 22 days ago
by
win10
Finetuning this model
6
#7 opened about 1 month ago
by
Andrefty
GGUF conversion
11
#3 opened about 2 months ago
by
compilade