codegen-350M-multi-onnx / tokenizer_config.json
SirWaffle
added vocab/tokenizer related files
979c6d5
raw
history blame
240 Bytes
{"unk_token": "<|endoftext|>", "bos_token": "<|endoftext|>", "eos_token": "<|endoftext|>", "add_prefix_space": false, "model_max_length": 1e+20, "special_tokens_map_file": null, "name_or_path": "gpt2", "tokenizer_class": "CodeGenTokenizer"}