Annuvin/xcodec2-bf16 · How did you quantize the model?

There isn't really much to this one. Simply load the model, cast the weights and save it again. You can try float8_e4m3fn or float8_e5m2 but I have a feeling it won't work too well.

For safetensors:

import torch
from safetensors.torch import load_file, save_file

model = load_file("model.safetensors")

for k in model:
    if model[k].dtype == torch.float32:
        model[k] = model[k].to(torch.bfloat16)

save_file(model, "model.bf16.safetensors")

For pt you might need to add some metadata:

...
model = torch.load("model.pt")
...
save_file(model, "model.bf16.safetensors", metadata={"format": "pt"})