croissantllm
/

CroissantLLMChat-v0.1

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

manu commited on Apr 25

Commit

f2f108d

•

1 Parent(s): 0952a5d

Update README.md

Files changed (1) hide show

README.md +36 -0

README.md CHANGED Viewed

@@ -68,6 +68,42 @@ Our work can be cited as:
 This model is a Chat model, that is, it is finetuned for Chat function and works best with the provided template.
 ```python
 import torch

 This model is a Chat model, that is, it is finetuned for Chat function and works best with the provided template.
+#### With pipeline
+```python
+import torch
+from transformers import AutoModelForCausalLM, AutoTokenizer, pipeline
+model_name = "croissantllm/CroissantLLMChat-v0.1"
+tokenizer = AutoTokenizer.from_pretrained(model_name)
+model = AutoModelForCausalLM.from_pretrained(model_name, torch_dtype=torch.float16, device_map="auto")
+messages = [
+   {"role": "user", "content": "Qui est le président francais ?"},
+]
+pipe = pipeline(
+    "text-generation",
+    model=model,
+    tokenizer=tokenizer,
+)
+generation_args = {
+    "max_new_tokens": 500,
+    "return_full_text": False,
+    "temperature": 0.0,
+    "do_sample": False,
+}
+output = pipe(messages, **generation_args)
+print(output[0]['generated_text'])
+```
+#### With generate
+This might require a stopping criteria on <|im_end|> token.
 ```python
 import torch