Risk Intelligence goes beyond high-level risk buckets; every score breaks down from category to specific attacker goals, so you know exactly which guardrails to build. Evo scores each discovered model across five security categories using Attack Success Rate: the percentage of real adversarial attacks that succeed, on an independent 0–1000 index per category. Because context changes risk, models are also tested inside realistic agent archetypes: coding agents, internal data agents, personal assistants, customer-facing chatbots, so you see how risk shifts with deployment.