Blog

Leadership Is Tracking AI ROI on Engineering Output. The Other 60% of the Team Doesn't Count.

9 min read

Every board deck with an AI adoption slide shows the same metrics: developer story points up, time to PR down, deployment frequency up. These are the numbers engineering leadership tracks, they improve when developers have AI tools, and they make the investment look like a clear win. The problem is that these metrics measure one function in a delivery chain that involves five or six functions. The 60% of the team that doesn't write code doesn't appear in the measurement at all.

This is not a deliberate choice to ignore non-developers. It's the natural result of who owns the AI investment. Engineering bought the tools, engineering measures the impact, engineering reports to leadership on engineering metrics. The downstream effects — QA cycle stretch, support escalation increase, stakeholder alignment degradation — don't show up in the reporting cycle that justified the investment. They show up in sprint reviews and customer calls.

The measurement gap mirrors the access gap

The functions that didn't get AI access are also the functions that aren't being measured in the AI ROI calculation. QA got no tools and no measurement of QA outcomes in the AI report. Support got no tools and no measurement of support outcomes. Product got limited tools and no measurement of spec quality or alignment cost. The ROI calculation is complete for the team that has AI and silent for the teams that don't.

This creates a reporting artifact where AI looks like an unambiguous win. Developer velocity: up. Delivery timeline: unchanged or marginally better. The gap between those two measurements is the cost of one-sided AI deployment, distributed across teams that aren't in the report.

What AI ROI reports measure and what they miss

What the typical AI ROI report measures:
  ✓ Developer story points per sprint (+35%)
  ✓ Time from commit to PR review (−42%)
  ✓ Deployment frequency (+28%)
  ✓ Developer satisfaction scores (+high)

  What it doesn't measure:
  ✗ QA cycle time change (longer — more output to test)
  ✗ Support escalation rate change (higher — more bugs)
  ✗ PM time spent on spec review with engineers (unchanged or higher)
  ✗ Stakeholder alignment meeting count (unchanged)
  ✗ Customer-reported issues post-release (often higher)
  ✗ New hire ramp time for non-developer roles (unchanged)

Where time actually goes in a delivery cycle

Coding is not the majority of time in a software delivery cycle. It's a significant portion, but specification, alignment, testing, review, and post-release support collectively take more time than the code-writing step itself. AI for developers improves the code-writing step. It doesn't improve any of the surrounding steps unless those teams also have AI.

Accelerating the wrong bottleneck is the predictable outcome of tool deployment that measures only code output. When developers write code 40% faster and QA testing stays the same, the constraint moves to QA. When QA moves faster and stakeholder sign-off stays the same, the constraint moves to alignment. The delivery timeline doesn't improve until every constraint in the chain is addressed.

Where time actually goes in a delivery cycle — and where AI helps

Where time actually goes in a delivery cycle:
  Coding: 20–30% of total delivery time
  Spec and alignment: 15–25%
  QA and testing: 20–30%
  Review, staging, sign-off: 15–20%
  Support and iteration: 10–15%

  AI for developers improves: coding (20–30%)
  AI for whole team improves: all of the above

What a complete ROI frame looks like

A complete AI ROI calculation for a software company measures the impact across every function in the delivery chain. Engineering metrics are part of it — they're the most mature and the easiest to track. But a complete picture includes QA cycle time, support escalation rate, PM alignment overhead, and the metric that actually matters to the business: feature-to-customer cycle time.

When those metrics are included, the ROI picture changes. A company where developers are 40% faster but QA takes 60% longer hasn't improved delivery — it has shifted the bottleneck. A company where every role has AI tools and every constraint in the delivery chain has been addressed can show genuine feature-to-customer improvement.

What a complete AI ROI calculation includes

What a complete AI ROI calculation includes:
  Engineering metrics: story points, deployment frequency (already tracked)
  QA metrics: cycle time per sprint, test coverage change, regression rate
  Support metrics: escalation rate, resolution time, repeat incident rate
  PM metrics: spec revision cycles, engineering alignment time
  Onboarding metrics: time-to-productivity for new hires by role
  Total delivery: feature-to-customer cycle time (the actual output metric)

The measurement case for whole-team AI

Leadership that wants to make accurate AI investment decisions needs to expand the measurement frame before expanding the investment. The question isn't just "what does this AI tool cost and what does it do to developer velocity?" It's "what does this AI deployment cost, and what does it do to feature-to-customer cycle time?"

That framing leads naturally to a different procurement question: which roles need AI, and what kind? Developer AI tools are well-understood and well-priced. What's less understood is the ROI of codebase AI for QA, for product, for support, and for operations. The answer is the same as the developer AI ROI — these roles have high-leverage system questions they currently answer slowly or not at all. When they get tools that answer those questions, their part of the delivery chain improves.

Kognita as the missing measurement substrate

Kognita gives every role access to the same codebase the developers query with their AI tools. Product managers, QA engineers, support teams, operations staff, and leadership can all ask codebase questions in plain language and get current-state answers. This doesn't replace developer AI tools — those remain valuable for code writing and developer-specific workflows. It extends system knowledge access to the roles that are currently excluded.

When all roles have codebase access, the metrics that were excluded from the ROI calculation become improvable. QA cycle time shortens because QA engineers know what changed before testing starts. Support escalation rates drop because support can check current system behavior before routing. PM alignment overhead drops because product managers can answer basic system questions without developer time.

Final take

AI ROI tracked on engineering output is not wrong — it's incomplete. The engineering metrics improve. They improve because developers have better tools. The question leadership should be asking is whether the company is delivering faster, not whether developers are writing code faster. Those are different questions that often have different answers when tool access is one-sided.

The companies that get genuine delivery improvement from AI are the ones that extend access beyond engineering. Every role in the delivery chain — product, QA, support, ops — has high-leverage questions about the system that AI can answer faster than the current manual process. That's a measurable improvement that doesn't show up in engineering velocity metrics. It shows up in cycle time, escalation rates, and customer satisfaction.

Tracking AI ROI on engineering metrics only is like measuring restaurant performance on kitchen speed and ignoring wait times, serving accuracy, and customer satisfaction. The kitchen got faster. The restaurant is not obviously better. Measure the whole chain.