theminji
/

TinyLlama-v2ray

Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

Jeff man112 commited on Dec 26, 2023

Commit

8d27259

·

1 Parent(s): 0acdd9b

Update README.md

Files changed (1) hide show

README.md +46 -20

README.md CHANGED Viewed

@@ -8,28 +8,58 @@ tags:
 model-index:
 - name: TinyLlama-v2ray
   results: []
 ---
-<!-- This model card has been generated automatically according to the information the Trainer had access to. You
-should probably proofread and complete it, then remove this comment. -->
 # TinyLlama-v2ray
-This model is a fine-tuned version of [TinyLlama/TinyLlama-1.1B-Chat-v0.6](https://huggingface.co/TinyLlama/TinyLlama-1.1B-Chat-v0.6) on the None dataset.
 ## Model description
-More information needed
-## Intended uses & limitations
-More information needed
-## Training and evaluation data
-More information needed
-## Training procedure
 ### Training hyperparameters
@@ -45,13 +75,9 @@ The following hyperparameters were used during training:
 - training_steps: 1000
 - mixed_precision_training: Native AMP
-### Training results
 ### Framework versions
 - Transformers 4.35.2
 - Pytorch 2.1.0+cu121
 - Datasets 2.16.0
-- Tokenizers 0.15.0

 model-index:
 - name: TinyLlama-v2ray
   results: []
+datasets:
+- TheBossLevel123/v2ray
+library_name: transformers
 ---
 # TinyLlama-v2ray
+This model is a fine-tuned version of [TinyLlama/TinyLlama-1.1B-Chat-v0.6](https://huggingface.co/TinyLlama/TinyLlama-1.1B-Chat-v0.6) on the [TheBossLevel123/v2ray](https://huggingface.co/datasets/TheBossLevel123/v2ray) dataset.
 ## Model description
+Prompt format is as follows:
+```py
+<|im_start|>user
+{prompt}<|im_end|>
+<|im_start|>assistant
+```
+The model is intended to mimic the behavior of v2ray, so results will most likely be nonsensical or gibberish.
+## Example Usage
+```py
+import torch
+from transformers import pipeline, AutoTokenizer
+import re
+tokenizer = AutoTokenizer.from_pretrained("TheBossLevel123/TinyLlama-v2ray")
+pipe = pipeline("text-generation", model="TheBossLevel123/TinyLlama-v2ray", torch_dtype=torch.bfloat16, device_map="auto")
+def formatted_prompt(prompt)-> str:
+    return f"<|im_start|>user\n{prompt}<|im_end|>\n<|im_start|>assistant"
+def extract_text(text):
+    pattern = r'v2ray\n(.*?)(?=<\|im_end\|>)'
+    match = re.search(pattern, text, re.DOTALL)
+    if match:
+        return f"Output: {match.group(1)}"
+    else:
+        return "No match found"
+prompt = 'what are your thoughts on ccp'
+outputs = pipe(formatted_prompt(prompt), max_new_tokens=50, do_sample=True, temperature=0.4)
+if outputs and "generated_text" in outputs[0]:
+    text = extract_text(outputs[0]["generated_text"])
+    print(f"Prompt: {prompt}")
+    print("")
+    print(text)
+else:
+    print("No output or unexpected structure")
+#Prompt: what are ur thoughts on ccp
+#
+#Output: <Re: insaneness> you are a ccp
+```
 ### Training hyperparameters
 - training_steps: 1000
 - mixed_precision_training: Native AMP
 ### Framework versions
 - Transformers 4.35.2
 - Pytorch 2.1.0+cu121
 - Datasets 2.16.0
+- Tokenizers 0.15.0