Tag: safety

All the articles with the tag "safety".

LLMs Evals: A General Framework for Custom Evaluations

7 May, 2024

A general framework for building rule-based and model-graded evaluations for LLM-based applications.
Malicious LLM Prompt Detection in Python

22 Apr, 2024

Building a malicious prompt detector using traditional ML classifiers in sklearn, trained on Deepset's prompt-injection dataset.

LLMs Evals: A General Framework for Custom Evaluations