Egida (LLM Safety)
Collection
Models and datasets used to improve the safety of LLMs, preventing harm in the presence of jailbreaking.
•
5 items
•
Updated
@misc{garciagasulla2025efficientsafetyretrofittingjailbreaking,
title={Efficient Safety Retrofitting Against Jailbreaking for LLMs},
author={Dario Garcia-Gasulla and Adrian Tormos and Anna Arias-Duart and Daniel Hinjos and Oscar Molina-Sedano and Ashwin Kumar Gururajan and Maria Eugenia Cardello},
year={2025},
eprint={2502.13603},
archivePrefix={arXiv},
primaryClass={cs.CL},
url={https://arxiv.org/abs/2502.13603},
}