I want use tabbyAPI with draft_model to speed up, and leave some vram for other applications or more ctx..
· Sign up or log in to comment