Manli's picture
Overwrite the `eos_token_id` for generation, avoiding the endless generation issue that happens only with the HF converted models
2b8d5f0