See also (wiki): workflow-redesign · assistive-to-agentic-shift · agentic-ai-governance
Executive Summary
- McKinsey QuantumBlack (“Seizing the Agentic AI Advantage,” March 2025, lead authors Sukharevsky, Kerr, Hjartar, Hämäläinen, Bout, Di Leo) finds ~78% of companies report using generative AI, yet roughly the same share report no meaningful bottom-line impact. Only ~1% describe their AI strategy as mature.
- The pattern separating the two groups is not model choice or vendor. It is workflow redesign. Horizontal copilots (M365, Gemini for Workspace, generic ChatGPT seats) yield 5–10% time savings on existing tasks. Vertical, reengineered agent workflows reach 30–50% time savings — and 60–90% in call-center resolution when the process itself is rebuilt around agent capabilities.
- Roughly 90% of vertical agent use cases remain stuck in pilot. Fewer than 10% reach production. The failure mode is not technical; it is organizational — scattered efforts, AI/IT silos, weak data foundations, and leaders who insert agents into workflows designed for humans.
- McKinsey anchors the playbook on four pillars — people, governance, technology, data — and an “Agentic AI Mesh” architecture that keeps agents, models, tools, and data interchangeable to avoid vendor lock-in.
- For a 200–5,000-employee company: one reengineered vertical workflow tied to the P&L will outperform ten horizontal copilot rollouts. The question is which workflow, not which tool.
The Paradox Stated Plainly
Near-universal adoption. Near-absent earnings impact. That is the gap McKinsey documents, and it aligns with every independent data point on the desk: BCG’s AI Radar 2026 showing 6% of companies capturing most of the value, the NBER productivity RCTs showing gains concentrate in redesigned workflows, and the Atlan 200-deployment analysis showing median +159.8% ROI only where workflow redesign preceded tooling.
The McKinsey diagnosis: companies scaled the easy layer (horizontal copilots) and stalled on the hard layer (vertical, function-specific agent deployments that touch the P&L). Copilot licenses get signed because they are cheap per seat and safe to approve. Vertical agent work gets abandoned because it requires owning the workflow — not just buying software.
Source credibility: HIGH for McKinsey’s survey of enterprise adoption posture; MEDIUM for McKinsey-reported case study numbers (no independent control group; client anonymized). Treat the pillar framework as a planning lens, not as an RCT finding.
Horizontal vs. Vertical: The Numbers
| Pattern | Time savings cited | What it requires |
|---|---|---|
| Horizontal copilot (assist human in existing workflow) | 5–10% | License, training, acceptable-use policy |
| Agent automates discrete steps in a workflow | 30–50% | Process mapping, data access, tool integration |
| Workflow redesigned around agent capabilities | 60–90% (call-center resolution example) | Leadership mandate, org design change, change management |
The delta between “5–10%” and “60–90%” is not a better model. It is whether leadership was willing to redesign the work.
McKinsey’s cited case examples:
- Global bank: 50%+ reduction in IT modernization timelines using engineering-team agents.
- Multiagent market-data cleanup: ~$3M projected annual savings.
- Credit-memo restructuring at a financial institution: 60% productivity gain for analysts.
- Retail banking loan processing with agents: 20–60% staff productivity gain.
These case studies are vendor-published (McKinsey has a commercial interest in the agentic thesis) and represent selected wins with no control group and no independent verification. Cross-reference against: METR RCT (experienced developers 19% slower with AI on open-ended tasks), CMU study (40.7% code complexity increase under AI-assisted development), and Atlan’s 200-deployment analysis where median ROI turned positive only after workflow redesign. The direction of the McKinsey signal is consistent with independent evidence; the magnitudes are best treated as upper bounds from cherry-picked wins.
The Four Pillars
McKinsey’s framework:
- People. New roles (agent owners, human-in-the-loop reviewers, workflow engineers) and reskilling for the roles that remain. Not “AI training for everyone” — role-specific redesign.
- Governance. Autonomy has to be bounded by business goals and risk appetite. What is the agent allowed to decide alone, what requires approval, and who owns the outcome when it goes wrong.
- Technology. Scalable infrastructure: orchestration, memory, evaluation, tool integration. The infrastructure most mid-market companies have is a Copilot license and a Slack channel.
- Data. Quality, lineage, and “data productization” — treating data as a maintained asset with an owner, not a file on a share drive.
The pillar that fails first in most mid-market rollouts is data. Not because the data is dirty — it always is — but because no one owns making it AI-ready before the agent is pointed at it.
The Agentic AI Mesh
McKinsey’s architectural recommendation is a modular “mesh” in which agents, data, models, and tools stay interchangeable. The underlying concern: early agent deployments are being built on proprietary stacks that create vendor lock-in at the orchestration layer. If the next generation of models comes from a different provider, or the next orchestration framework outperforms the current one, a mesh architecture lets you swap. A closed stack does not.
For a 500-person company: this is not urgent today. But when the pilot moves to production and the second and third agent ship, lock-in starts to matter. Do not sign a multi-year orchestration contract before the second agent is in production.
Six Barriers McKinsey Names
- Scattered efforts with no top-level direction.
- Immature tooling and talent gaps.
- Technical limitations of first-generation LLMs.
- Silos between AI teams and IT.
- Poor data quality and organization.
- Change resistance from employees.
Four of the six are organizational, not technical. That is the whole thesis.
Key Data Points
| Data Point | Value | Source | Date |
|---|---|---|---|
| Companies reporting gen AI use | ~78% | McKinsey QuantumBlack | Mar 2025 |
| Companies reporting no significant P&L impact | ~80% | McKinsey QuantumBlack | Mar 2025 |
| “Mature” AI strategies | ~1% | McKinsey QuantumBlack | Mar 2025 |
| Vertical use cases stuck in pilot | ~90% | McKinsey QuantumBlack | Mar 2025 |
| Horizontal copilot time savings | 5–10% | McKinsey QuantumBlack | Mar 2025 |
| Agent-automated step time savings | 30–50% | McKinsey QuantumBlack | Mar 2025 |
| Redesigned call-center resolution time savings | 60–90% | McKinsey QuantumBlack | Mar 2025 |
| Retail banking loan-processing productivity gain | 20–60% | McKinsey QuantumBlack | Mar 2025 |
| IT modernization timeline reduction (global bank case) | 50%+ | McKinsey QuantumBlack | Mar 2025 |
Temporal note: the source is Q1 2025 (Tier 2). Results may differ with current models; the qualitative pattern — horizontal deployments underperform vertical reengineered ones — is corroborated by later 2025–2026 evidence (BCG AI Radar 2026, Atlan 200-deployment analysis, NBER coordination RCT).
What This Means for Your Organization
The default path — buy Copilot seats, announce the rollout, wait for the productivity number to appear in the next board deck — is the 5–10% path. It is not wrong. It is just not the path that moves the P&L. The companies moving the P&L picked one vertical workflow, redesigned it around agent capabilities, and accepted that the hard work was inside the organization, not inside the model.
For a 200–5,000-employee company, the sequencing is inverted from how most vendors pitch it. Do not start with the technology choice. Start with the workflow: which function has a defined process, clean-enough data, a willing leader, and a measurable P&L line. That is the pilot. The model, the orchestration framework, and the mesh architecture come second — and all three should stay swappable for at least the first 18 months.
If you want an outside read on which workflow at your organization has the right combination of data, leadership, and P&L leverage to be the first vertical agent deployment — and which pillars will fail first if you do not address them — I’d welcome the conversation. brandon@brandonsneider.com
Sources
- McKinsey & Company, QuantumBlack, “Seizing the Agentic AI Advantage,” March 2025. Authors: Alexander Sukharevsky, Dave Kerr, Klemens Hjartar, Lari Hämäläinen, Stéphane Bout, Vito Di Leo. https://www.mckinsey.com/capabilities/quantumblack/our-insights/seizing-the-agentic-ai-advantage — Credibility: HIGH for framework and enterprise posture survey; MEDIUM for case-study magnitudes (vendor-published, selected wins, no control group).
- Digital Commerce 360, “McKinsey: AI agents, not chatbots, drive future enterprise value,” July 28, 2025. https://www.digitalcommerce360.com/2025/07/28/mckinsey-ai-agents-enterprise-value/ — Secondary coverage; used for quote verification.
- SearchYour.ai, “Seizing the Agentic AI Advantage — McKinsey’s (QuantumBlack),” 2025. https://www.searchyour.ai/en/seizing-the-agentic-ai-advantage-mckinseys-quantumblack — Secondary synthesis; used for pillar and mesh detail.
- Cross-reference base rates: BCG AI Radar 2026 (6% high-performer cohort); NBER AI team coordination RCT (n=7,137); Atlan 200-deployment analysis (median +159.8% ROI after workflow redesign); METR RCT (experienced developers 19% slower, July 2025); CMU code complexity study (+40.7%).
Brandon Sneider | brandon@brandonsneider.com April 2026