Casting torch.bfloat16 to torch.float16.

by nokados - opened Nov 2, 2024

Nov 2, 2024

In the README, you recommend using --dtype half, which is equivalent to float16. However, in the config, you are using bfloat16. vLLM warns that it is casting torch.bfloat16 to torch.float16. Perhaps it would be better to use the original --dtype bfloat16?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment