Mrw33554432
/

bitLinear-phi-1.5

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Mrw33554432 commited on Apr 16

Commit

b4239f2

•

1 Parent(s): 4ee4162

Update README.md

Files changed (1) hide show

README.md +2 -1

README.md CHANGED Viewed

@@ -22,13 +22,14 @@ Please notice the kernel is not optimzed for 1-bit matrix yet.
 The model is trained on a 3090(24GB) for 16 hours.
 ### For training code, check https://github.com/Mrw33554432/Bitlinear4HF.
 The training code should be compatible with most of the LLMs in huggingface.
 Using pretrained model weight (normal models) for training will not work due to gradient explosion.
-## Sample inference code
 ```python

 The model is trained on a 3090(24GB) for 16 hours.
+### For faster(3x) inference, check https://github.com/Mrw33554432/Bitlinear4HF and install custom kernel
 ### For training code, check https://github.com/Mrw33554432/Bitlinear4HF.
 The training code should be compatible with most of the LLMs in huggingface.
 Using pretrained model weight (normal models) for training will not work due to gradient explosion.
+## Sample inference code (slow)
 ```python