Blog

ChatGPT, Copilot, Cursor — What None of Them Do for Your Product Manager

9 min read

A product owner at a 40-person SaaS company spent three hours last Tuesday trying to find out what the engineering team shipped in the previous sprint. She checked Jira — tickets were in "Done" but with no meaningful description of what changed. She looked at Confluence — the sprint notes were two weeks old. She asked ChatGPT to summarize what her team built. ChatGPT gave her a thoughtful, fluent, entirely fictional response with no connection to any code her team had written. She ended up scheduling a 30-minute call with the engineering lead to ask what was actually in production. This is not an AI literacy problem. She uses AI constantly. The problem is that none of the AI tools she has access to were built for the questions she actually has.

Product managers and product owners have been told, repeatedly, that AI will change how they work. And it has — in certain ways. ChatGPT helps write PRDs. Notion AI helps structure meeting notes. Some PMs use Claude to draft user stories. But the questions that consume the most time and create the most friction are not writing questions. They are verification questions, status questions, and system questions. What shipped? Does it match what was scoped? What changed in the codebase that I need to know about? Not a single mainstream AI tool answers any of these — and the reason is structural, not cosmetic.

ChatGPT: great at writing, useless at your sprint

ChatGPT is genuinely useful for a product manager. You can use it to draft acceptance criteria, rewrite a PRD section, summarize a competitor's feature set, or brainstorm edge cases. It is an excellent thinking partner for work that lives in documents and language. The moment you ask it something grounded in your actual product — "what did my engineering team ship this sprint?" — the wheels come off. ChatGPT has no access to your codebase. It has no access to your Jira. It doesn't know what PROD-441 says, what the checkout service does, or whether the payment flow that launched on Thursday matches what the ticket described. It will generate a confident-sounding answer if you let it. That answer will be made up.

The "write me a PRD" use case is real and valuable. The problem is that writing the PRD is 20% of a product owner's verification burden. The other 80% is confirming that engineering built what the PRD described — and ChatGPT has zero ability to help with that half. It can help you articulate the requirement. It cannot tell you whether the requirement was met.

GitHub Copilot: not even in the right category

GitHub Copilot is a code autocomplete and in-editor assistant for software engineers. It helps developers write functions faster, suggests test cases, and explains code snippets in context. It is an excellent developer tool. A product owner cannot use it for anything product-management-related. There is no interface for asking "what changed in the search service this sprint." There is no way to query it against Jira tickets. It has no concept of acceptance criteria, sprint scope, or roadmap alignment.

This sounds obvious stated plainly, but the confusion is real: organizations roll out Copilot as their AI strategy and product managers wonder why it doesn't help them. Copilot's target user is an engineer writing code. It has no ambitions beyond that lane. Asking a product owner to use Copilot for sprint visibility is like asking them to use a compiler. The category is wrong.

Cursor: the tool your engineers love that you can't open

Cursor is an AI-first code editor. Engineering teams that have adopted it tend to love it — it has good codebase indexing, strong in-editor AI assistance, and increasingly capable agentic features. It is also completely inaccessible to a non-technical product owner. Even if you opened Cursor and pointed it at your repository, what you would see is raw code. Cursor would invite you to write code or modify files. The interface assumes you know what a function is, what a module does, and how to read a diff. Most product owners do not, and they should not need to. That is not their job.

There is a version of Cursor's question-answering that sounds promising — you can ask it things like "what does this function do?" and get a plain-language explanation. But the questions a product owner actually has are not "what does this function do?" They are "what did we ship?" and "does it match PROD-441?" and "what services were unexpectedly modified?" Cursor cannot answer those questions in any form a non-technical user can act on.

PM question vs. what each AI tool can actually answer

PM question vs. what each AI tool can actually answer:

  "What did the team ship this sprint?"
  -> ChatGPT:   No access to your codebase or Jira. Can't answer.
  -> Copilot:   Code autocomplete tool. Has no concept of your sprint.
  -> Cursor:    Developer editor. Would show you raw diffs. Unhelpful.

  "Does this implementation match PROD-441?"
  -> ChatGPT:   Has no idea what PROD-441 says or what was built.
  -> Copilot:   Not designed for ticket-to-code verification. At all.
  -> Cursor:    Could open the file, but won't map it to a ticket.

  "What services did the search redesign touch?"
  -> ChatGPT:   Requires codebase knowledge it doesn't have.
  -> Copilot:   Could suggest code in those files. Can't enumerate them.
  -> Cursor:    Developer would know. You'd need to ask one.

  "What's in production that wasn't on the roadmap?"
  -> ChatGPT:   Would need both your codebase and Jira to answer this.
  -> Copilot:   Entirely out of scope. This is not what it does.
  -> Cursor:    Out of scope. You'd need an engineer to run the query.

The questions that none of them answer

The PM's actual question stack is consistent across every product organization. It comes up every sprint, in every standup, in every sprint review. It shows up when a stakeholder asks why something isn't in production yet. It shows up when a customer files a bug against a feature that wasn't supposed to be live. The questions are: what changed in checkout this sprint? Does this implementation match the ticket? What services did the search redesign touch? What's in production that wasn't on the roadmap?

Answering any one of these requires two things: codebase state and Jira state, reconciled in plain language. Product owners are constantly stuck in translation — between what engineering built and what the ticket said, between what Jira shows and what's actually deployed. ChatGPT, Copilot, and Cursor each have one half of one side of this problem. None of them have both halves. None of them reconcile codebase and Jira. None of them return a plain-language answer a non-technical product owner can act on without a follow-up call to an engineer.

The PM's question stack — what they need each sprint and where each answer lives today

The PM's question stack — what they need each sprint and where each answer lives today:

  Sprint completion questions
  -> What got shipped?               Source: Sprint demo (biweekly, 90 min)
  -> What's still in progress?       Source: Jira (if updated)
  -> What moved scope?               Source: Nobody knows until retro

  Acceptance verification questions
  -> Does the build match the ticket? Source: Ask an engineer (days delayed)
  -> Is the feature live in prod?     Source: Ask an engineer or QA
  -> What exactly did it change?      Source: Read the PR (requires GitHub)

  Architecture and impact questions
  -> What services were touched?      Source: Read the PR or ask an engineer
  -> What broke that wasn't expected? Source: Find out when a bug is filed
  -> What shipped outside scope?      Source: Sprint demo, too late to act

  Result: almost every answer requires either an engineer's time or a biweekly wait.

The "write me a PRD" fallacy

There is a class of AI use case that looks like it solves the product owner's problem but doesn't. ChatGPT can help you write a PRD. It can help you structure acceptance criteria, generate user story variants, and fill in edge cases you might have missed. This is genuinely valuable. The fallacy is treating it as the full solution to the PM's AI need.

Writing the requirement is upstream of the real problem. The real problem is verifying that the requirement was met after engineering built something. A well-written PRD is useless if you have no way to check whether engineering implemented what it described. AI tools that help PMs write better requirements are useful. AI tools that help PMs verify those requirements against what shipped are the missing half — and that half doesn't exist in ChatGPT, Copilot, or Cursor.

The Engineering Manager and VP of Engineering both feel this gap too, from a different angle. They want product to be able to self-serve status questions rather than routing them through engineering every time. When AI makes shipping faster, verification can't stay slow — but the tool that solves verification for the PM doesn't exist in any mainstream AI suite.

Why this matters more as engineering moves faster

AI coding tools — Claude Code, Cursor in agent mode, Cline, GitHub Copilot Workspace — are not just making individual developers faster. They are changing the pace at which engineering teams ship. Features that took two weeks now take three days. Sprints that used to produce eight tickets now produce twenty-four. The engineering output rate has jumped. The product owner's visibility has not.

When engineering ships once a sprint, a biweekly demo is a reasonable way to stay informed. When engineering ships continuously, a biweekly demo means the PM is always discovering what happened two weeks ago. That gap — between continuous engineering output and periodic product visibility — is exactly where scope drift, unverified acceptance criteria, and production surprises live. The answer is not a better PRD template. The answer is a tool built for the PM's actual questions.

What a tool built for the PM's questions actually looks like

Kognita was built for the questions that ChatGPT, Copilot, and Cursor cannot answer. A product owner can ask "what changed in checkout this sprint?" and get a structured answer: which services were modified, which Jira tickets those changes correspond to, and whether what was built matches what the tickets described — in plain language, without a GitHub login, without reading a single line of code. A CTO can ask "what's in production that wasn't on the roadmap?" and get an audit of deployed features cross-referenced against accepted Jira epics. A VP of Engineering can point a product owner at Kognita instead of routing every status question through an engineer interrupt.

The difference is not that Kognita is smarter than ChatGPT or more capable than Cursor. The difference is that it was built with the PM's question as the unit of value — not the engineer's code, not the document, not the chat interface. Product teams need answers, not access — not a code editor, not a raw Jira query, not a chat bot that makes things up when it doesn't have the information. They need a system that understands both what was scoped and what was shipped, and can tell them the difference.

What Kognita returns for the questions ChatGPT, Copilot, and Cursor cannot answer

What Kognita returns for the questions ChatGPT, Copilot, and Cursor cannot answer:

  Query: "What changed in checkout this sprint?"
  -> Lists modified services, files changed, linked Jira tickets, merged PRs
  -> No GitHub login required. No engineer interrupt.

  Query: "Does the PROD-441 implementation match the acceptance criteria?"
  -> Reads the ticket's acceptance criteria from Jira
  -> Reads the actual implementation from the codebase
  -> Returns a plain-language gap analysis

  Query: "What services did the search redesign touch?"
  -> Traces service dependencies modified across the sprint's commits
  -> Returns affected services, owners, and whether changes were in scope

  Query: "What's in production that wasn't on the roadmap?"
  -> Cross-references deployed features against accepted Jira epics
  -> Surfaces gaps the sprint demo never would have caught

Final take

Product managers are not failing to get value from AI. They are using ChatGPT for writing, Copilot doesn't apply to them, and Cursor is a developer tool they cannot use. The tools that exist are good at the things they were designed for — and none of them were designed for the product owner's sprint questions.

The gap is structural. You cannot fix it by prompting ChatGPT more cleverly or convincing your PM to learn how to use Cursor. The PM's question stack — what shipped, does it match the ticket, what changed, what's unexpectedly in production — requires a tool that has both codebase context and Jira context and can return a plain-language answer to a non-technical user. That tool is not in the current mainstream AI suite.

The failure is not that product managers are bad at using AI. It is that every mainstream AI tool was built for the engineer at the keyboard — and the product owner's question, the one that costs the most time and creates the most friction every sprint, has been left completely unanswered.