MSTS: A Multimodal Safety Test Suite for Vision-Language Models Paper • 2501.10057 • Published 10 days ago • 8
MSTS: A Multimodal Safety Test Suite for Vision-Language Models Paper • 2501.10057 • Published 10 days ago • 8
AfriHate: A Multilingual Collection of Hate Speech and Abusive Language Datasets for African Languages Paper • 2501.08284 • Published 12 days ago • 6
Easily Accessible Text-to-Image Generation Amplifies Demographic Stereotypes at Large Scale Paper • 2211.03759 • Published Nov 7, 2022
Contrastive Language-Image Pre-training for the Italian Language Paper • 2108.08688 • Published Aug 19, 2021 • 2
XSTest: A Test Suite for Identifying Exaggerated Safety Behaviours in Large Language Models Paper • 2308.01263 • Published Aug 2, 2023
Safety-Tuned LLaMAs: Lessons From Improving the Safety of Large Language Models that Follow Instructions Paper • 2309.07875 • Published Sep 14, 2023
When and why vision-language models behave like bags-of-words, and what to do about it? Paper • 2210.01936 • Published Oct 4, 2022
Introducing v0.5 of the AI Safety Benchmark from MLCommons Paper • 2404.12241 • Published Apr 18, 2024 • 11
A Benchmark for Learning to Translate a New Language from One Grammar Book Paper • 2309.16575 • Published Sep 28, 2023 • 1
Do Language Models Know When They're Hallucinating References? Paper • 2305.18248 • Published May 29, 2023
Large Legal Fictions: Profiling Legal Hallucinations in Large Language Models Paper • 2401.01301 • Published Jan 2, 2024
Language Models are Multilingual Chain-of-Thought Reasoners Paper • 2210.03057 • Published Oct 6, 2022
Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them Paper • 2210.09261 • Published Oct 17, 2022
Follow the Wisdom of the Crowd: Effective Text Generation via Minimum Bayes Risk Decoding Paper • 2211.07634 • Published Nov 14, 2022
Safety-Tuned LLaMAs: Lessons From Improving the Safety of Large Language Models that Follow Instructions Paper • 2309.07875 • Published Sep 14, 2023