yhavinga
/

dutch-llama-tokenizer

Model card Files Files and versions Community

yhavinga commited on Jan 3, 2024

Commit

e974e27

·

1 Parent(s): 3bda927

Update README.md

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -30,18 +30,18 @@ The tokenizer was trained using the `spm_train` command with the following setti
 ## Installation
 To use the Dutch-Llama Tokenizer, ensure you have Python 3.10.12 or later installed. Then, install the Transformers library from Hugging Face:
-```
 pip install transformers
 ```
 ## Usage
 First, import the `AutoTokenizer` from the Transformers library and load the Dutch-Llama Tokenizer:
-```
 from transformers import AutoTokenizer
 tokenizer = AutoTokenizer.from_pretrained("yhavinga/dutch-llama-tokenizer")
 ```
 To tokenize text, use the `tokenizer.tokenize` method. For converting tokens to IDs and decoding them back to text, use `tokenizer.convert_tokens_to_ids` and `tokenizer.decode` respectively:
-```
 # Example text
 text = "Steenvliegen of oevervliegen[2] (Plecoptera) 华为发布Mate60手机"

 ## Installation
 To use the Dutch-Llama Tokenizer, ensure you have Python 3.10.12 or later installed. Then, install the Transformers library from Hugging Face:
+```shell
 pip install transformers
 ```
 ## Usage
 First, import the `AutoTokenizer` from the Transformers library and load the Dutch-Llama Tokenizer:
+```python
 from transformers import AutoTokenizer
 tokenizer = AutoTokenizer.from_pretrained("yhavinga/dutch-llama-tokenizer")
 ```
 To tokenize text, use the `tokenizer.tokenize` method. For converting tokens to IDs and decoding them back to text, use `tokenizer.convert_tokens_to_ids` and `tokenizer.decode` respectively:
+```python
 # Example text
 text = "Steenvliegen of oevervliegen[2] (Plecoptera) 华为发布Mate60手机"