Blog

30 Agents Are Running. Nobody Outside Engineering Knows What They're Building.

9 min read

The sprint board looked fine. Tickets were moving. The velocity number was high. Then the sprint demo started, and within fifteen minutes the product owner was asking about three features that had shipped but were never scoped, and two features that were in scope but had not been touched. The scrum master had no explanation. The developers shrugged — they had been letting their agents run.

This is not a hypothetical scenario in 2026. It is becoming a routine failure mode for teams that have adopted AI coding at scale without updating how non-technical roles monitor what is being built. At one developer using Cursor, the sprint board more or less tracks reality. At ten developers each running two or three parallel agents, the sprint board is a snapshot of what was planned, not what is happening. The actual work is running faster, broader, and further outside ticket scope than any standup or Jira view reflects.

The people who feel this most acutely are the ones who were never inside the codebase to begin with: product owners, scrum masters, and business stakeholders. They are operating on information that was already imperfect and is now structurally disconnected from what agents are actually producing.

Why Jira stops being a reliable source of truth at agent scale

Jira was designed for human-speed development. A developer picks up a ticket, works on it for a day or two, and moves it to In Review. The board reflects a rough approximation of where work stands. The latency between action and status update is measured in hours, which is short enough to be useful.

AI agents break this model. A developer opens a ticket, hands it to an agent, and the agent can have a PR ready in twenty minutes. The ticket closes automatically when the PR merges. Meanwhile, the agent has also touched three other files that were not in scope, made changes in a shared service two teams depend on, and opened a follow-up PR that nobody created a ticket for. None of that is visible in Jira because Jira only knows about tickets — not about what agents actually modified.

Sprint board vs. reality under agent-scale development

What the sprint board shows:
  - 14 tickets In Progress
  - 6 tickets In Review
  - 3 tickets Done

What is actually happening:
  - 30 AI agents running across 10 developers
  - Each agent working on 2–4 tasks simultaneously
  - PRs opening faster than reviewers can check
  - Jira tickets closing before demos are scheduled
  - Work happening against services no one planned to touch this sprint

The problem compounds across a team. Ten developers running agents in parallel means the volume of code changes in a two-week sprint can be an order of magnitude higher than what the sprint board shows. Jira becomes an artifact of planning intent, not a record of what happened in the codebase.

The math of a sprint at AI scale

It helps to be concrete about the volume. Before AI-assisted development, a developer might touch 20–40 files in a focused two-week sprint. With agents running in parallel, that number can easily reach 80–160 files per developer, spread across services they may not have specifically set out to modify. Multiply that across a ten-person team.

What actually gets produced in a 2-week sprint at AI scale

In a 2-week sprint with 10 developers each running 3 agents:

  Code changes:     ~400–800 file modifications
  PRs opened:       40–90 pull requests
  Services touched: often 8–15 across the codebase
  Jira tickets closed: 30–60 (many auto-closed on PR merge)
  Time from ticket open to PR: sometimes under 20 minutes

  What the PO sees in Jira: tickets moving to Done.
  What is actually in those PRs: unknown until the sprint demo.

Forty to ninety pull requests in two weeks is not edge-case activity. On teams actively using agent tooling, this is becoming normal. The PR queue becomes impossible to review comprehensively. Things get merged because the sprint has to close. The product owner and scrum master have no independent way to know what those PRs contain — they are dependent on developers to characterize the work, and developers are themselves operating at a higher level of abstraction than before, delegating the implementation details to agents.

Scrum masters are running blind

The scrum master's job is to keep the sprint on track: surface blockers early, make sure work is aligned with what the team committed to, and protect the team's focus from scope creep. Every one of those functions depends on visibility into what is actually being worked on.

At agent scale, that visibility is gone. The scrum master can see that tickets are moving to Done. They cannot see which services agents are touching. They cannot see whether the work being done corresponds to the work that was planned. Non-technical roles were already one layer removed from the codebase — now they are two layers removed, because even the developers are partially abstracted from the implementation their agents are producing.

Scrum master visibility under agent-scale development

What a scrum master can see:
  -> Jira: ticket status, sprint velocity, who is assigned what
  -> Standup: what developers say they are working on
  -> PR queue: if they check GitHub, the volume and titles

What a scrum master cannot see:
  -> Which services each agent actually modified
  -> Whether agent work is aligned with ticket scope
  -> What was built that had no ticket at all
  -> Which epics are accumulating unplanned changes
  -> What the codebase looks like now versus when the sprint started

The standup is not helping. When a developer says "I'm working on the search redesign," that sentence used to mean the developer was writing code for the search redesign. It now means the developer has an agent running on the search redesign — and possibly three other agents running on other things simultaneously. The scrum master cannot get an accurate picture of team focus from standups anymore. The standup reflects intent, not execution.

Product owners are approving things they cannot see

The product owner's role in a sprint is to verify that what gets built matches what was agreed. That verification has always depended on the sprint demo — a two-week lag between when work happens and when the PO sees it. With AI-speed development, that lag is now filled with an enormous volume of changes that the PO has no way to preview before the demo.

Sprint demos were already stressful when the team shipped six or eight things. Now the team ships twenty-five things, including things the PO did not scope and excluding things the PO thought were in scope. The demo becomes a surprise encounter with the output of two weeks of agent activity. The PO has ninety minutes to review what thirty agents spent two weeks building. That is not a reasonable verification process.

The result is rubber-stamping. Not because product owners do not care, but because the window for real verification is too small and the volume is too high. Features get accepted because the sprint needs to close, not because the acceptance criteria were genuinely met. When Jira context is the only signal non-technical teams have, and Jira no longer reflects what agents actually built, the verification loop breaks entirely.

The sprint demo is now a post-mortem, not a review

There is an important distinction between a sprint demo that surfaces surprises and a sprint demo that surfaces surprises no one can explain. In the first case, the team discovers something they missed and can address it. In the second case, the team discovers something agents built autonomously, nobody scoped it, and the developer who ran the agent is not sure exactly why it went that direction.

This is the situation non-technical stakeholders are increasingly walking into. The demo reveals scope drift — features built outside plan, services modified without tickets, work that cannot be traced back to any agreed requirement. The scrum master cannot explain it because the sprint board never showed it. The product owner cannot evaluate it because they have no independent way to verify what was built against what was planned. The retrospective becomes a conversation about a black box.

The problem is not that AI agents are building wrong things. Often they are building things that make sense in the local context of a developer's session. The problem is that there is no system-level view of what all those agents produced together — no way for non-technical roles to query what changed, compare it to what was planned, and understand the gap before the demo.

What non-technical teams actually need

Non-technical team members do not need a real-time agent dashboard. They do not need to watch agents run. What they need is a system-of-record view of what agents produced — queryable in plain language, connected to the Jira work that was planned, and available before the sprint demo rather than after.

"What services changed in the last two weeks?" is a question a product owner should be able to answer before walking into the demo. "Which Jira epics have active code work against them right now?" is a question a scrum master should be able to answer during the sprint, not retrospectively. "What does the codebase look like now versus when we started the sprint?" is a question a non-technical stakeholder should be able to ask without filing a request to engineering.

Kognita gives non-technical team members exactly this kind of query layer. It indexes the codebase continuously and connects it to Jira, so a product owner or scrum master can ask in plain language what changed, which services are involved, and whether the work aligns with the tickets that were planned. It is not a surveillance tool for agents — it is a system-of-record view that non-technical people can use without technical training.

What non-technical stakeholders can query after a sprint

What non-technical stakeholders can ask Kognita after a sprint:

  "What services changed in the last two weeks?"
  -> payment-service, search-api, user-settings, notification-worker,
     admin-dashboard (5 services; 3 were not in any sprint ticket)

  "Which Jira epics have active code changes against them right now?"
  -> Epic PLAT-14 (infrastructure): 12 files changed, no sprint ticket
  -> Epic PRD-88 (checkout redesign): 31 files, matches 4 sprint tickets
  -> Epic PRD-91 (onboarding flow): 7 files, 0 sprint tickets assigned

  "What did the codebase look like in the search service a week ago vs. now?"
  -> SearchController: 3 new endpoints added
  -> RankingService: scoring algorithm rewritten
  -> No tickets reference either change

Managed AI tooling for non-technical teams is increasingly not optional. The gap between what agents build and what non-technical roles can see is growing with every sprint. Closing that gap requires a layer that speaks plain language to people who live in Jira and board presentations, not GitHub and pull request queues.

Final take

Sprint boards were already a simplification of reality. At agent scale, they become fiction. Tickets close in minutes, work happens across services no one planned to touch, and the volume of output exceeds what any review process can meaningfully verify. Product owners, scrum masters, and business stakeholders are operating on planning artifacts while the actual codebase moves in directions nobody is tracking in real time.

The answer is not to slow down the agents. The answer is to give non-technical roles a way to see what agents are producing — in plain language, connected to Jira, and available before the sprint demo turns into a post-mortem. The sprint board will never catch up to agent-speed development. The system of record needs to.

When thirty agents are running, the sprint board is not the source of truth. The codebase is. Non-technical teams need a way to read it.