๐Ÿ“š Datasets

Name
Type
#ย Prompts
Purpose

Q&A

~200k

Wikipedia-based question-answer pairs that require reasoning across multiple documents. Useful for evaluating false positives and over-triggering on natural Q&A.

Jailbreak

79

Collection of jailbreak related prompts for ChatGPT. Useful for evaluating the detection rate of publicly known jailbreaks.

Prompt Injection

~278k

Dataset of prompts submitted to Lakeraโ€™s Mosscap prompt injection capture the flag game that was created as part of the DEF CON 31 AI Village Generative Red Team Challenge. Useful for evaluating detection rate on a large sample of real-world prompts that are a mixture of adversarial techniques and benign prompts.

Content Moderation

1680

Dataset of inputs that cover a wide range of content moderation use cases. Useful for evaluating the efficacy of content moderation.

Last updated