FIM-Tokens not marked special

by ruediste - opened Jan 2

Jan 2

•

{
      "id": 151660,
      "content": "<|fim_middle|>",
      "single_word": false,
      "lstrip": false,
      "rstrip": false,
      "normalized": false,
      "special": false
    },

zanghu

14 days ago

download tokenizer.json, tokenizer_config.json and vocab.json to directory: path\to\your\Qwen\Qwen2.5-Coder-7B

and exec code below

from transformers import AutoTokenizer

model_dir = r'path\to\your\Qwen\Qwen2.5-Coder-7B'
tokenizer = AutoTokenizer.from_pretrained(
pretrained_model_name_or_path=model_dir,
local_files_only=True,
)
tokenizer.add_tokens

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment