KognitaKognita.

Blog

An AI Issue Triage Bot Became a Supply Chain Attack in February 2026. Here's the Anatomy.

9 min read

In February 2026, a documented supply chain attack turned an AI coding tool's issue triage bot into an attack path. The bot had read access to internal repositories — which was the feature that made it useful. Adversarial content in a GitHub issue triggered a prompt injection that caused the agent to read and exfiltrate internal codebase information. No malicious code was deployed. No credentials were directly stolen. But the agent's lack of context boundaries, egress controls, and audit logging meant the attack succeeded before anyone knew it was happening.

Why AI agents are a different security surface

Traditional software is vulnerable through code logic — SQL injection, buffer overflows, insecure API calls. AI agents are vulnerable through language understanding — they read natural language content and act on it. An attacker who can control content that an agent reads can instruct the agent to act on their behalf:

AI agent attack surface vs. traditional software
AI agent attack surface (vs. traditional software):
  Traditional bot:
    → Executes predefined commands
    → Fixed input/output format
    → No language understanding of content
    → Attack: exploit the code logic

  AI agent:
    → Understands natural language instructions
    → Reads arbitrary content and acts on it
    → Can be instructed by attacker-controlled input
    → Attack: prompt injection via user-controlled content

This is the prompt injection problem applied to production agents. A developer running a personal AI tool with their own API key who reads a malicious GitHub issue is unlikely to act on the instruction in it — because they can tell the difference between a bug report and an instruction to exfiltrate data. An agent reading the same issue may not make that distinction. The agent does what the content tells it, because language understanding is the whole point.

What governance would have prevented the attack

The February 2026 incident was preventable. Each of the attack's steps required something that proper scope controls would have blocked:

The incident anatomy and the governance controls that would have stopped it
February 2026 incident (reported by CIO.com):
  Tool:         AI coding tool issue triage bot
  Deployed as:  per-developer GitHub integration
  Attack path:  adversarial issue content → prompt injection
                → agent reads internal repo context
                → agent leaks codebase information externally

  What governance would have caught it:
  → Context boundary: agent should not read production secrets
  → Egress control: agent should not make outbound calls
  → Audit log: first external call would have triggered alert
  → Scope limit: read-only codebase access, no write or call permissions

None of these controls are exotic security requirements. They are basic access governance: read only what you are authorized to read, do not make outbound calls outside defined boundaries, log every action. They are standard for any production service — and absent from most per-developer AI agent deployments because they were never designed as production services.

The incident-then-restriction cycle

Security incidents with AI agents follow a predictable pattern. Before the incident: agents are running with broad permissions because restricting them reduces their usefulness. After the incident: all agents are shut down, a security review consumes months, and when redeployment happens the restrictions are so conservative that the tool is no longer useful for most of what it was originally deployed for:

The governance failure cycle: before vs. after an AI agent incident
AI agent governance: before vs. after an incident
  Before:
    → Agent works, developers are happy
    → No restrictions = maximum productivity
    → No audit = no overhead
    → Risk: unknown

  After a supply chain incident:
    → Emergency decommission of all AI agents
    → 3-month security review before redeployment
    → Restrictions that block most useful use cases
    → Trust damage that outlasts the incident

This pattern is avoidable. The organizations that avoid it are the ones that define scope controls before deployment — not to restrict capability, but to define the boundaries within which the capability operates safely. This is the same principle behind what makes an AI agent runtime secure — security is not an add-on, it is the boundary definition that enables safe deployment.

What scope controls in a managed runtime look like

Effective AI agent security is not about preventing agents from reading codebases — that is the feature. It is about defining precise boundaries for what they can read, what they can call, and what they can produce:

Managed AI runtime scope controls for security
Managed AI runtime scope controls:
  Read boundary:     only repos explicitly authorized
  Write boundary:    no direct writes without human approval
  Network boundary:  no outbound calls outside allowlist
  Secret boundary:   no access to .env, secrets, credentials
  Audit log:         every file read, every tool call, every output
  Anomaly detection: unusual access pattern triggers review

Kognita's managed runtime enforces these boundaries as infrastructure. The agent reads the repositories it is authorized to read. It cannot make outbound calls to arbitrary endpoints. Every file access and tool call is logged. The audit log means that if an incident does occur, the investigation takes hours, not weeks — because the activity is recorded, not reconstructed from memory.

The conversation your CISO needs to have before AI agents ship

Most organizations have the CISO conversation about AI agents after something breaks. The CISO who was not involved in the deployment decision becomes the person who shuts it down. The governance that should have been part of the initial deployment becomes an emergency retrofit.

The productive sequence is: CISO reviews the runtime before the agent deploys. What repos can it read? What can it call? Is there an audit log? What happens if an agent behaves unexpectedly? A managed runtime that provides clear answers to these questions can get through a security review in days rather than blocking deployment for months.

Final take

AI agents with read access to codebases and language understanding of their content have a novel security surface that per-developer tools were not designed to govern. The February 2026 supply chain incident is a documented example of what that surface looks like when exploited.

Scope controls are not a restriction on what AI agents can do — they are the definition of what they are authorized to do. That definition is the only thing standing between a productive agent and a supply chain incident.