๐ Datasets
Last updated
Last updated
Q&A
~200k
Wikipedia-based question-answer pairs that require reasoning across multiple documents. Useful for evaluating false positives and over-triggering on natural Q&A.
Jailbreak
79
Collection of jailbreak related prompts for ChatGPT. Useful for evaluating the detection rate of publicly known jailbreaks.
Prompt Injection
~278k
Dataset of prompts submitted to Lakeraโs prompt injection capture the flag game that was created as part of the . Useful for evaluating detection rate on a large sample of real-world prompts that are a mixture of adversarial techniques and benign prompts.
Content Moderation
1680
Dataset of inputs that cover a wide range of content moderation use cases. Useful for evaluating the efficacy of content moderation.