So You Have an AI Security Budget. Now what?

著者

Snyk Team

2026年6月4日

0 分で読めます

Key takeaways

AI security budgets need to evolve from fragmented tool spending into a unified investment in visibility, governance, and control across the full AI lifecycle.
Organizations should budget for two connected fronts: securing agentic development, where AI agents generate code, use tools, and execute workflows, and securing agentic applications, where agents interact with users, data, APIs, and production systems.
A strong AI security budget should include dedicated funding for AI discovery, agent and model risk assessment, policy enforcement, adversarial testing, runtime protection, and governance evidence.
Governance and compliance should be treated as their own budget line item, not buried inside tooling or pushed downstream to GRC. Teams need continuous discovery, enforceable policies, risk reporting, and audit trails to prove AI systems are being managed responsibly.
Buying disconnected AI security point tools can increase cost and complexity without delivering unified control. A platform approach helps security teams connect AI visibility, risk intelligence, policy enforcement, and evidence across development and production.

Most organizations spend their AI security budget on the wrong layer. The instinct is to just buy visibility to inventory the models, map the APIs, and ship a dashboard.

But visibility alone won’t stop the coding agent that just pulled in a compromised MCP server. It won’t stop the production agent that’s about to forward a customer record to a place it shouldn’t go. It also won’t stop an autonomous workflow from chaining together five individually “allowed” actions into one dangerous outcome.

Visibility without enforcement isn’t governance. Organizations adopting AI agents now need to secure two distinct but connected fronts:

The agents building software
The agents operating inside applications in production

The budget problem security leaders need to solve now isn't whether they can detect AI risk after the fact; it's whether they can govern agents across their entire lifecycle, from how they're built to how they behave in production. That requires securing both: Shifting left to secure AI-generated code and the agent supply chain as software is built, and delivering visibility and governance over agents and applications running in production.

Front 1: Securing the agents building software

In agentic development, AI agents now pull dependencies, invoke MCP servers, execute scripts, chain actions across environments, and generate production-ready code with minimal human review. That creates three risk surfaces traditional AppSec was not designed to control: what agents use, what agents do, and what agents generate.

1. What agents use: MCP servers, skills, and external tools

AI coding agents don’t operate from a blank slate. To complete tasks, they continuously pull in MCP servers, external APIs, reusable skills, open source components, plugins, and third-party tools — any of which may be vulnerable, malicious, or misconfigured.

These components are selected and invoked at runtime and not pre-approved, which means untrusted external tools can be introduced into your workflow in an instant. If a security tool only scans the final artifact, it misses the moment risk entered the workflow: when the agent selected and used an untrusted input.

Snyk research has already identified malicious agent capabilities in shared ecosystems, vulnerable MCP servers, and critical flaws in widely used agent infrastructure. CVE-2025-6514, for example, exposed mcp-remote to full remote code execution.

2. What agents do: Tool execution, scripts, and system access

Agents don’t just suggest; they execute actions autonomously and at machine speed. Once given a goal, they can run commands, modify files, interact with infrastructure, query systems, call APIs, and chain workflows across environments — often with broad permissions, limited supervision, and no human approval loop.

The Replit incident made this failure mode concrete when an AI agent ignored explicit “freeze” instructions, deleted a production database, and fabricated thousands of fake records to hide what it had done.

That kind of incident does not happen because a scanner missed a vulnerable function. It happens because the agent’s dynamic and multi-step behavior was not governed at the moment of action. Agentic development requires controls inside the execution loop, close enough to evaluate context, permissions, and risk before an unsafe action completes.

3. What agents generate: code and dependencies

Agents also generate code and dependencies faster than human review can validate. That output may include vulnerabilities, unsafe dependencies, hardcoded secrets, or misconfigurations.

The issue in agentic development is that AI produces code continuously at machine speed and often outside the review processes that organizations rely on. By the time post-commit scanning runs, the vulnerable output may already be in the repository. In some workflows, it may already be packaged, deployed, or reused by another agent.

Securing generated output means validating code at the moment of creation, before it enters the repo or becomes part of a larger agent-driven workflow. In agentic development, if you don’t secure what agents generate at creation, you’re chasing risk after it’s already been introduced.

Front 2: Securing the agents operating inside applications in production

The second front is production. Most organizations already have agents running inside applications, such as support agents handling customer requests, workflow agents accessing internal systems, autonomous pipelines calling APIs, and AI-native apps that retrieve data, make decisions, and take action without a human in the loop.

The question is whether you know what your agents are doing, and whether you can stop them when they do the wrong thing. Once agents are live, governance breaks down across three dimensions: Discovery, Risk Assessment, and Compliance.

1. Discovery: You can’t govern agents you can’t see

Security teams need a live picture of where AI agents are running, which models they use, which tools they invoke, which data they access, and which workflows they execute.

Most organizations do not have that full picture. Shadow AI compounds the problem as employees and teams build custom GenAI tools, embed agents into workflows, and connect AI systems to internal data without a formal security review. So, an inventory gap is more than just incomplete documentation of known systems; it’s about unknown agents operating outside established channels.

If you do not know which agents exist, you cannot enforce policy against them, assess their risk, prove compliance, or assign ownership when something goes wrong. Inventory is the foundation of governance, but it is not the end state.

2. Risk Assessment: Rules must run where agents act

Many organizations already have AI governance policies. The problem is that most of them live in documents, spreadsheets, or review workflows that agents never encounter. This “paper governance” creates a dangerous false sense of security.

For example, a policy that says “agents cannot expose customer data to external services” only matters if it is enforced when the agent tries to act. Otherwise, the organization has governance on paper and uncontrolled behavior in production.

Agentic applications need policy enforcement at the execution layer, where the rules evaluate what the agent is trying to do, what data it is touching, which tools it is invoking, and whether the action crosses a defined boundary. So, if a manipulated agent tries to send customer data to an external endpoint, the action can be blocked or modified before it completes.

3. Compliance: You need to prove what happened, why, and with what authority

When an agent causes a business problem, security teams need more than just logs. They need a defensible record of what happened: what the agent did, what data it accessed, what tool it invoked, what policy applied, whether the action was allowed or blocked, and who owned the workflow.

The Air Canada case showed that organizations can be held responsible for what an AI chatbot tells customers. The Chevrolet prompt injection incident showed how easily public-facing AI can be manipulated into producing business-impacting outcomes. Toxic flows show how agents can chain together tools, data, and APIs in ways no single action would reveal on its own.

Governance and compliance should be a dedicated AI security budget line item, not something buried inside tooling or treated as a downstream GRC exercise. As AI agents spread across development workflows and production applications, organizations need a way to prove what AI systems exist, which policies apply to them, and whether those policies are being enforced. Without that foundation, the cost of governance shows up later as failed audits, stalled enterprise deals, post-incident investigations, or rushed consulting work to reconstruct an audit trail that should have existed from the start.

Why buying two disconnected tools does not work

AI risk does not stay neatly confined to one layer. A model, agent framework, or MCP server may be introduced in code, deployed through a pipeline, invoked by an application, and then used by an agent to take action in production. If each stage is governed by a different tool, security teams are left stitching together partial views of the same system.

Fragmented tooling creates fragmented control. It leads to:

Inconsistent policies between development and runtime
Duplicate inventories that disagree
Blind spots between build and deploy
Separate audit trails that tell only part of the story
Unclear ownership when an agent-driven incident occurs

It also leads to fragmented costs where each point tool brings its own license, integration work, data pipeline, dashboard, and vendor relationship to manage. Teams often duplicate effort across overlapping discovery, risk analysis, and policy enforcement capabilities while still leaving gaps between them. By the time multiple tools are stitched together, the total cost and complexity can exceed the value of a unified platform and without delivering unified control.

For AI security, consolidation goes beyond a budget decision. It is a governance requirement. Security teams need a connected view of where AI exists, the risks it introduces, how it behaves, and which policies are enforced across its lifecycle. Without that connective tissue, organizations may spend more and still lack the visibility, accountability, and control needed to safely scale AI adoption.

Snyk secures both sides of agentic security from one platform

Snyk’s AI Security Platform secures both sides of the agent lifecycle from one platform. For agentic development, Snyk helps govern what agents use, what they do, and what they generate — from MCP servers and skills to execution and AI-generated code. For production agentic applications, Snyk provides a system of record for AI risk across agents, models, tools, data, and workflows through Evo AI-SPM, turning governance intent into enforceable policy.

The result is one control layer for agent security: shared discovery, shared policy, real-time enforcement, and auditability across development and production.

Together, this creates a unified model for agent security:

One system of record for AI and agent risk
One policy model across development and production
One enforcement layer for unsafe agent behavior
One audit trail for governance and accountability

The question to answer before you finalize the budget

If you have an AI security budget and a leadership team expecting results, the defining question is simple: Can you enforce a security policy against an agent — in development and in production — before an unsafe action completes?

If the answer is no, you do not have agent security and are not adequately prepared for the next agent deleting a production database, hallucinating a policy that makes your company liable, leaking sensitive data, or chaining together approved tools into an unsafe workflow. The organizations that succeed with AI will not be the ones that slow down adoption. They will be the ones who embed continuous control into how agents build software and how agents operate in production.

The budget is approved, and the threat is real. The next step is making sure your AI security program can act before an unsafe action becomes a business incident. Want a practical guide to continuous, enforceable governance? Download the Executive Guide to Operationalizing & Enforcing AI Governance today.

WHITE PAPER

Executive Guide to Operationalizing & Enforcing AI Governance

With AI shifting from static models to autonomous agents, this guide is your roadmap to continuous, enforceable governance.

Download now