mwz
/

UrduParaphraseBERT

Text2Text Generation

encoder-decoder

Model card Files Files and versions Community

mwz commited on Jun 19, 2023

Commit

b7e7f67

•

1 Parent(s): 7c5a860

Update README.md

Files changed (1) hide show

README.md +11 -1

README.md CHANGED Viewed

@@ -45,7 +45,7 @@ tokenizer = AutoTokenizer.from_pretrained("urduhack/roberta-urdu-small")
 Use the following code snippet to generate paraphrases with the loaded model and tokenizer:
 ```
 # Example sentence
-input_sentence = "This is an example sentence."
 # Tokenize the input sentence
 inputs = tokenizer(input_sentence, truncation=True, padding=True, return_tensors="pt")
@@ -59,3 +59,13 @@ with torch.no_grad():
 paraphrase = tokenizer.decode(outputs[0], skip_special_tokens=True)
 print("Paraphrase:", paraphrase)
 ```

 Use the following code snippet to generate paraphrases with the loaded model and tokenizer:
 ```
 # Example sentence
+input_sentence = "تصوراتی طور پر کریم سکمنگ کی دو بنیادی جہتیں ہیں - مصنوعات اور جغرافیہ۔"
 # Tokenize the input sentence
 inputs = tokenizer(input_sentence, truncation=True, padding=True, return_tensors="pt")
 paraphrase = tokenizer.decode(outputs[0], skip_special_tokens=True)
 print("Paraphrase:", paraphrase)
 ```
+## Performance
+The model has been fine-tuned on a 30k rows dataset of Urdu paraphrases and achieves impressive performance in generating high-quality paraphrases. Detailed performance metrics, such as accuracy and fluency, are being evaluated and will be updated soon.
+## Contributing
+Contributions to the Urdu Paraphrase Generation Model are welcome! If you find any issues or have suggestions for improvements, please open an issue or submit a pull request.
+## License
+This project is licensed under the MIT License. See the [LICENSE](LICENSE) file for details.