File size: 1,231 Bytes
cd6e192
 
9ef796b
 
 
 
 
 
 
 
 
9ee6891
cd6e192
 
9ef796b
 
 
 
cd6e192
9ef796b
cd6e192
9ef796b
e356810
 
 
 
 
 
cd6e192
9ef796b
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
---
library_name: transformers
tags:
- cryptology
- cipher
datasets:
- agentlans/high-quality-english-sentences
language:
- en
base_model:
- google-t5/t5-base
license: apache-2.0
---

This project contains a text-to-text model designed to decrypt English text encoded using a substitution cipher.
In a substitution cipher, each letter in the plaintext is replaced by a corresponding, unique letter to form the ciphertext.
The model leverages statistical and linguistic properties of English to make educated guesses about the letter substitutions,
aiming to recover the original plaintext message.

This model is for monoalphabetic English substitution ciphers and it outputs decoded text.

Example:

Encoded text: 
**Z hztwgx tstcsf qf z ulooqfe osfuqb tzx uezx awej z ozewsbe vlfwby fsmqisfx.**

Decoded text: 
**A family member or a support person may stay with a patient during recovery.**

Suggested Usage:
```py
#Load the model and tokenizer
cipher_text = "" #Encoded text here!
inputs = tokenizer(cipher_text, return_tensors="pt", padding=True, truncation=True, max_length=256).to(device)
outputs = model.generate(inputs["input_ids"], max_length=256)
decoded_text = tokenizer.decode(outputs[0], skip_special_tokens=True)
```