๐ Datasets
Q&A
~200k
Wikipedia-based question-answer pairs that require reasoning across multiple documents. Useful for evaluating false positives and over-triggering on natural Q&A.
Jailbreak
79
Collection of jailbreak related prompts for ChatGPT. Useful for evaluating the detection rate of publicly known jailbreaks.
Prompt Injection
~278k
Dataset of prompts submitted to Lakeraโs Mosscap prompt injection capture the flag game that was created as part of the DEF CON 31 AI Village Generative Red Team Challenge. Useful for evaluating detection rate on a large sample of real-world prompts that are a mixture of adversarial techniques and benign prompts.
Content Moderation
1680
Dataset of inputs that cover a wide range of content moderation use cases. Useful for evaluating the efficacy of content moderation.
Last updated