Input Filter
Input filter is a deterministic component that sanitizes incoming prompts before they reach the base model. It applies rule-based checks, such as regular expressions to remove or redact suspicious patterns (e.g., known injection strings like "ignore previous instructions" or base64-encoded payloads). For instance, the filter might block prompts exceeding a certain length or containing disallowed keywords.
Last updated