Mrw33554432
/

bitLinear-phi-1.5

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Mrw33554432 commited on Apr 12

Commit

ba01de7

•

1 Parent(s): 1ed66f8

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -24,9 +24,9 @@ The model is trained on a 3090(24GB) for 16 hours.
 ### For training code, check --placeholder--.
-The training code should be compatible with most of the LLMs in huggingface, but you have to start from scratch.
-Using pretrained model weight will not work due to gradient explosion.
 ## Sample inference code

 ### For training code, check --placeholder--.
+The training code should be compatible with most of the LLMs in huggingface.
+Using pretrained model weight (normal models) for training will not work due to gradient explosion.
 ## Sample inference code