--- sidebar_label: False Positives description: Red team false positive detection by enhancing grader context and scoring precision to prevent inaccurate vulnerability assessments in AI security testing workflows --- # Preventing False Positives False positives occur when a test case is marked as passing when it should have been marked as failing or vice versa. These inaccuracies typically arise when the grader lacks sufficient context about your target application to make proper assessments. By providing comprehensive context to Promptfoo, you can ensure accurate results. ## Understanding How the Grader Makes Decisions When you run a red team scan against a target, Promptfoo evaluates the results and determines whether the output passes or fails. These results are determined by a model, `gpt-5` by default. A **pass** score means that the output did not violate your application's intended behavior and returned an output that conforms with your requirements. A **fail** score means that the output deviated from your application's intended behavior. Pass and fail scores are separate from **errors**, where the output could not be parsed. The grader is also separate from the vulnerabilities results, which determines the severity of findings and details about remediations. Scores range from 0 to 1 and help guide the judge in agentic cases to distinguish outputs that are more impactful or higher risk. A score of `0` means there is a complete jailbreak or violation, whereas a score of `1` indicates the output fully passed without any compromise. ## Your Role in Ensuring Accurate Results ### 1. Provide Detailed Target Purpose Information The accuracy of grading directly depends on how thoroughly you define your application's behavior. We strongly recommend that you provide detailed information in the `Purpose` property of your target setup. Without this crucial context, the grader cannot correctly interpret whether outputs comply with your intended application behavior. #### Example of Well-Defined Purpose (Leading to Accurate Results) ```text The user is an employee at the company. The target system is a chatbot that provides access to company wide information. The chatbot should maintain a cordial, professional tone and provide courteous, concise answers that are to the point. There are multiple types of users (basic, HR, executive) with different access levels. This user is a basic employee with access to: - HR policies like expenses, vacation days, benefits and the company handbook - Company history - General information about the company and its products The user should not have access to: - Any confidential documents - Information about other employees - Sensitive information about the company like upcoming acquisitions or strategic plans ``` #### Example of Insufficient Purpose (Likely to Cause False Positives) ```text This is a company chatbot for employees. ``` When you provide vague descriptions like the one above, the grader lacks the necessary context to properly evaluate whether an output should pass or fail, resulting in false positives. ### 2. Calibrate Graders with Custom Examples #### In Promptfoo Enterprise Plugin-specific examples can be used for advanced grading. Without these examples, the grader lacks your specific interpretation of what constitutes acceptable vs. unacceptable outputs. Within Promptfoo Enterprise, you should customize the grader at the plugin level by providing clear examples of what you consider passing and failing outputs, along with your reasoning. The more detailed your examples, the better the grader can align with your expectations.