SafePaca

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

g8a9 authored a paper 4 days ago

MSTS: A Multimodal Safety Test Suite for Vision-Language Models

Paul authored a paper 4 days ago

MSTS: A Multimodal Safety Test Suite for Vision-Language Models

Paul authored a paper 11 days ago

AfriHate: A Multilingual Collection of Hate Speech and Abusive Language Datasets for African Languages

View all activity

safepaca's activity

g8a9

authored a paper 4 days ago

MSTS: A Multimodal Safety Test Suite for Vision-Language Models

Paper • 2501.10057 • Published 10 days ago • 8

Paul

authored a paper 4 days ago

MSTS: A Multimodal Safety Test Suite for Vision-Language Models

Paper • 2501.10057 • Published 10 days ago • 8

Paul

authored a paper 11 days ago

AfriHate: A Multilingual Collection of Hate Speech and Abusive Language Datasets for African Languages

Paper • 2501.08284 • Published 12 days ago • 6

Paul

in safepaca/absolute-harmfulness-predictor-redteam-osst 3 months ago

Adding `safetensors` variant of this model

#2 opened 3 months ago by

SFconvertbot

vinid

authored 6 papers 8 months ago

Easily Accessible Text-to-Image Generation Amplifies Demographic Stereotypes at Large Scale

Paper • 2211.03759 • Published Nov 7, 2022

Contrastive Language-Image Pre-training for the Italian Language

Paper • 2108.08688 • Published Aug 19, 2021 • 2

XSTest: A Test Suite for Identifying Exaggerated Safety Behaviours in Large Language Models

Paper • 2308.01263 • Published Aug 2, 2023

Safety-Tuned LLaMAs: Lessons From Improving the Safety of Large Language Models that Follow Instructions

Paper • 2309.07875 • Published Sep 14, 2023

When and why vision-language models behave like bags-of-words, and what to do about it?

Paper • 2210.01936 • Published Oct 4, 2022

TextGrad: Automatic "Differentiation" via Text

Paper • 2406.07496 • Published Jun 11, 2024 • 29

Paul

authored a paper 9 months ago

Introducing v0.5 of the AI Safety Benchmark from MLCommons

Paper • 2404.12241 • Published Apr 18, 2024 • 11

turingmachine

authored 3 papers 10 months ago

turingmachine

authored 6 papers about 1 year ago

Scaling Instruction-Finetuned Language Models

Paper • 2210.11416 • Published Oct 20, 2022 • 7

Holistic Evaluation of Language Models

Paper • 2211.09110 • Published Nov 16, 2022 • 1

Language Models are Multilingual Chain-of-Thought Reasoners

Paper • 2210.03057 • Published Oct 6, 2022

Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them

Paper • 2210.09261 • Published Oct 17, 2022

Follow the Wisdom of the Crowd: Effective Text Generation via Minimum Bayes Risk Decoding

Paper • 2211.07634 • Published Nov 14, 2022

Safety-Tuned LLaMAs: Lessons From Improving the Safety of Large Language Models that Follow Instructions

Paper • 2309.07875 • Published Sep 14, 2023

AI & ML interests

Recent Activity

Team members 7

safepaca's activity

Adding `safetensors` variant of this model