Propicto
/

t2p-t5-large-orfeo

text2text-generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

cecilemacaire commited on Jul 4, 2024

Commit

ff77171

·

verified ·

1 Parent(s): a63edff

Update README.md

Files changed (1) hide show

README.md +43 -1

README.md CHANGED Viewed

@@ -94,7 +94,49 @@ outputs = model.generate(inputs.to("cuda:0"), max_new_tokens=40, do_sample=True,
 pred = tokenizer.decode(outputs[0], skip_special_tokens=True)
 ```
-## Linking and viewing the predicted sequence of tokens to the corresponding ARASAAC pictograms
 ## Information

 pred = tokenizer.decode(outputs[0], skip_special_tokens=True)
 ```
+## Linking the predicted sequence of tokens to the corresponding ARASAAC pictograms
+```python
+import pandas as pd
+def process_output_trad(pred):
+    return pred.split()
+def read_lexicon(lexicon):
+    df = pd.read_csv(lexicon, sep='\t')
+    df['keyword_no_cat'] = df['lemma'].str.split(' #').str[0].str.strip().str.replace(' ', '_')
+    return df
+def get_id_picto_from_predicted_lemma(df_lexicon, lemma):
+    id_picto = df_lexicon.loc[df_lexicon['keyword_no_cat'] == lemma, 'id_picto'].tolist()
+    return (id_picto[0], lemma) if id_picto else (0, lemma)
+lexicon = read_lexicon("lexicon.csv")
+sentence_to_map = process_output_trad(pred)
+pictogram_ids = [get_id_picto_from_predicted_lemma(lexicon, lemma) for lemma in sentence_to_map]
+```
+## Viewing the predicted sequence of ARASAAC pictograms
+```python
+def generate_html(ids):
+    html_content = '<html><body>'
+    for picto_id, lemma in ids:
+        if picto_id != 0:  # ignore invalid IDs
+            img_url = f"https://static.arasaac.org/pictograms/{picto_id}/{picto_id}_500.png"
+            html_content += f'''
+            <figure style="display:inline-block; margin:1px;">
+                <img src="{img_url}" alt="{lemma}" width="200" height="200" />
+                <figcaption>{lemma}</figcaption>
+            </figure>
+            '''
+    html_content += '</body></html>'
+    return html_content
+html = generate_html(pictogram_ids)
+with open("pictograms.html", "w") as file:
+    file.write(html)
+```
 ## Information