is there a compiled list of token speed optimization methods yet?
#6
by
Alignment-Lab-AI
- opened
i know through my own experimentation ive been able to run it very quickly,
also; has anyone tried to binarize these yet?