New discussion

Training Data and Distillation

1
#44 opened about 9 hours ago by
kardosdrur

ONNX related issues

#43 opened about 1 month ago by
harsh-shah

error deploying model

#38 opened 2 months ago by
jim-bo

Multilingual or Bilingual

#25 opened 5 months ago by
MeanBean-05

Remote Code execution risk

4
#24 opened 5 months ago by
srivishnuceg

flash attention

#21 opened 6 months ago by
Disassemblern

Model loading size on GPU

#20 opened 7 months ago by
divrajnd

MRL and linear layers

1
#19 opened 7 months ago by
bobox

Can it output sparse vector?

1
#18 opened 7 months ago by
kk3dmax

About Quantized Models

#14 opened 7 months ago by
infgrad

Any multi-lingual variant

1
#10 opened 7 months ago by
prophet123

Model max_seq_length

7
#6 opened 7 months ago by
shuyuej

Fix prompt_name typo

1
#4 opened 8 months ago by
mber