
- Jailbreak: Flags prompt-injection and jailbreak attempts.
- Difficulty: Rates how hard a prompt is, for routing simple vs complex models.
- Domain: Classifies the topic or domain of a request.
- Ambiguity: Flags vague or underspecified prompts.
Getting Started
Use a default Reflex in three steps.Train a Custom Reflex
When the defaults don’t match your categories, train your own in one API call. Bring labeled examples, let Morph synthesize a dataset from a description, or hand it unlabeled text to sort. A small Reflex trains in about 30 seconds.Choose your labels
The categories you want to sort text into, like
spam vs not_spam, or frustrated vs neutral.Train a Reflex
Send labeled examples, a description to synthesize from, or unlabeled text to sort. Training returns a
fine_tuned_model you classify against.Train a Custom Reflex
Send labeled examples, get a trained classifier. Full API: create, poll, predict, manage jobs.