See also (wiki): wiki/ai-adoption-scaling.md, wiki/ai-change-management.md, wiki/ai-delivery-pods.md, wiki/ai-pilot-to-production.md, wiki/workflow-redesign.md
Executive Summary
- Surface-level adoption is the norm, not the exception. Deloitte (n=3,235, 2025) finds 37% of enterprise AI deployments achieve surface-level adoption — access without embedded workflow use. The gap between tool access and business-outcome-producing usage is the largest single source of unrealized AI ROI in the Fortune 500.
- Adoption is an engineered outcome, not a cultural one. Rewired (Lamarre, Smaje, Levin, 2nd ed., 2024, Ch. 30, p.457) states the principle directly: organizations that treat adoption as a consequence of good technology consistently underperform those that design adoption explicitly — with dedicated roles, visible incentives, and ongoing measurement.
- The reuse model is the economic mechanism behind compound AI returns. Rewired (Ch. 31, p.469) and BCG (2025) converge on the finding that top-quartile AI performers build each domain on the infrastructure of the last. The median gap between first and fifth deployment cost is 62% for organizations with reuse infrastructure versus 95% for those without (BCG, 2025).
- Change capacity is measurable and finite. Prosci (n=1,107, 2025) finds 73% of organizations are at or near change saturation. Fortune 500 programs with 8–12 concurrent AI deployments across departments are statistically guaranteed to exceed their organizational change capacity, producing quiet non-compliance rather than visible resistance.
- The KPI hierarchy matters. MIT CISR’s finding that Stage 1 organizations (tool deployment without workflow redesign) underperform their industries financially despite high tool usage rates is the empirical case for measuring Tier 3 business outcomes — P&L line items — as the authority, not Tier 1 system performance metrics.
Part 1 — The Adoption Engineering Framework
Why adoption fails: the four root causes
The post-mortem pattern from enterprise AI adoption failures is remarkably consistent. Across the BCG, Deloitte, Prosci, and Writer research bases, four root causes account for the majority of underperforming deployments:
1. No adoption owner. The deployment launched with an engineering team (who built it), a change manager (who sent the training invite), and nobody accountable for whether the workflow actually changed. The most common pod composition failure: the “change lead / adoption owner” role documented in wiki/ai-delivery-pods.md is the role cut first when pod budgets are under pressure.
2. No benefit visibility at the user level. Enterprise-level ROI projections do not motivate frontline adoption. BCG AI at Work 2025 (n=10,600) finds that individual-level benefit visibility — a specific workflow task that the AI makes measurably easier — is the strongest predictor of frontline adoption. Employees who can point to “the AI handles the first draft of every case summary, which saves me 45 minutes per claim” adopt at 3x the rate of employees who have received a message about enterprise productivity benefits.
3. No visible personal upside. Writer / Workplace Intelligence (n=1,600 US executives and knowledge workers) finds 31% of workers admit active sabotage of AI rollouts — rising to 41% among Gen Z/Millennials. The root cause in most cases is the absence of a credible personal upside for the employee. If the AI reduces task time but that time saving accrues to headcount reduction rather than role enrichment, skill development, or reduced administrative burden, the rational employee response is to minimize the AI’s apparent effectiveness.
4. No midstream adjustment rhythm. Adoption programs that launch and move on — no feedback loop, no weekly metrics review, no mechanism to detect and correct problems within the first 90 days — consistently end up measuring the same failing metrics at the 12-month mark that were visible at the 90-day mark. The adjustment that would have cost $50K at 60 days costs $500K at 12 months because the failure mode has embedded itself in workflows and user behavior.
The adoption engineering model
The five components of designed adoption, mapped to the timeline after go-live:
| Component | Timeline | Responsible role | Measurement |
|---|---|---|---|
| Pre-go-live benefit briefing | Week −4 to −1 | Adoption owner + direct managers | Manager confirmation that briefing delivered |
| Incentive structure activation | Day 1 | HR + domain owner | Formal policy documented before launch |
| Leading indicator baseline | Week 1–2 | Adoption owner | Workflow-change rate; feedback submission rate |
| First adjustment cycle | Week 3–4 | Pod (workflow designer + ML engineer) | Regression test post-adjustment; user satisfaction delta |
| 60-day adoption review | Day 60 | Domain owner | Tier 2 workflow metrics; leading indicator trends |
The 60-day adoption review is the go/no-go decision point for the second pod cycle. If Tier 2 workflow metrics are not directionally positive by Day 60, the pod does not advance to the next domain — it spends the next 30 days diagnosing and remediating the adoption failure.
Part 2 — The Reuse Infrastructure
Why each domain should be cheaper than the last
The “scattered pilots” pattern that Rewired (Ch. 3) identifies as the primary anti-pattern produces not just zero P&L impact but also zero organizational capital: each new deployment starts from scratch because no reuse infrastructure was built in the previous one.
The reuse model flips this: Domain 1 investment is deliberate about what gets built as shared infrastructure, not just as deployment-specific tooling. The four shared assets:
Data infrastructure reuse. The data pipeline built for Domain 1 — ingestion connectors, normalization logic, quality monitoring, access governance — is the starting point for Domain 2 if Domain 2 draws on overlapping data sources. A supply planning AI and a demand forecasting AI share logistics, inventory, and sales data; sequencing them adjacent to each other extracts this reuse benefit. BCG documents the outcome: top-quartile AI performers redeploy the outputs of high-performing workflows as inputs to adjacent ones.
Evaluation framework reuse. The ground-truth test sets, scoring criteria, and regression benchmarks built for Domain 1 become the template for all subsequent workflows. Domain 2’s eval engineer starts from the Domain 1 framework and adapts it — the template design is done, the methodology is proven, the automated scoring pipeline exists. Organizations that invest in evaluation infrastructure for Domain 1 deploy Domain 2 at 30–50% of the Domain 1 evaluation setup cost.
Change management template reuse. The resistance patterns, WITFM communication, manager briefing kit, and escalation scripts developed for Domain 1’s workforce are structurally reusable. The specific workflow impacts differ; the psychological and organizational dynamics are consistent. A 50-person claims department and a 50-person supply chain team experience AI adoption through the same emotional sequence (awareness → anxiety → skepticism → trial → embedding). The communication templates adapt; the architecture reuses.
Governance artifact reuse. The AI acceptable use policy, risk classification framework, escalation paths, and review cadences developed for Domain 1 become the governance baseline for Domain 2. The AI center of excellence maintains this library; pods adapt the artifacts rather than generating them from scratch.
BCG reuse economics (2025): The median gap between first and fifth deployment cost (as a percentage of Domain 1 build cost) is 62% for organizations with reuse infrastructure versus 95% for those without. Put differently: an organization that builds reuse infrastructure cuts its Domain 5 build cost to 38% of Domain 1. An organization that does not still pays 95% of Domain 1 cost for Domain 5. The total program cost difference over five domains is roughly 2.2x.
Building for reuse from Day 1
The reuse architecture decisions that must be made at Domain 1 design time — not retrofitted later:
-
Use the central data platform, not a deployment-specific data store. It is faster to build a Domain 1-specific database. It does not compound. Every subsequent domain will rebuild the integration work. The enterprise data platform (Snowflake, Databricks, Azure Synapse, or equivalent) is the reuse mechanism.
-
Design the evaluation framework as a template, not a one-off. The ground-truth test sets and scoring scripts built for Domain 1 should be stored in a shared evaluation library with documented methodology. The eval engineer who builds Domain 1’s framework should document it as if they expect someone else to run Domain 2 — because they probably will be.
-
Write the governance artifacts in reusable form. The AI acceptable use policy should reference the domain-neutral principles and include a domain-specific appendix — so Domain 2 appends a new appendix rather than writing a new policy.
-
Build the change management assets as a library, not a project. The manager briefing template, the WITFM communication framework, and the first-30-days adoption monitoring checklist should live in the CoE’s shared drive, not in the Domain 1 project folder.
Part 3 — KPI Hierarchy and Measurement
The three-tier KPI structure
The measurement architecture that connects engineering performance to business outcomes — the gap that most Fortune 500 AI dashboards fail to close:
Tier 1 — System health (engineering accountability)
Updated daily, automated where possible. Audience: pod engineering team.
- Task success rate: % of queries/requests where the AI output meets the defined quality threshold
- Error rate: % of outputs flagged for human review, by error category
- Latency: median and 95th percentile response time
- Retrieval accuracy (for RAG systems): % of retrievals that are relevant to the query
- Hallucination flag rate: % of outputs where the AI assertion is unverifiable or contradicted by source documents
Rewired (Ch. 5, p.94) specifies these metrics explicitly as the evaluation discipline required for agentic deployments. The eval engineer owns this tier.
Tier 2 — Workflow performance (domain pod accountability)
Updated weekly. Audience: domain owner + pod.
- Cycle time: end-to-end process time for the AI-enabled workflow vs. baseline
- Throughput: output volume per unit of time or headcount
- Quality defect rate: downstream errors, rework rate, exception escalations
- Escalation rate: % of AI decisions escalated to human review (trend matters: a rising escalation rate is an early warning signal of model drift or distribution shift)
- User adoption depth: % of workflow steps where users are actively engaging with AI output vs. bypassing it
Tier 3 — Business impact (domain owner and CFO accountability)
Updated monthly. Audience: CEO, CFO, board.
- P&L line item change: cost per unit, revenue per workflow, gross margin contribution
- Headcount-to-output ratio: if the AI is reducing the denominator, this shows up here
- NPS/CSAT delta for customer-facing workflows
- Working capital impact for financial workflow improvements (cycle time reduction releasing trapped capital)
The governance discipline: Tier 1 and Tier 2 metrics are inputs to the domain owner’s monthly Tier 3 report. An AI system with excellent Tier 1 metrics and flat Tier 3 metrics is not a success from the CFO’s perspective — it is an unresolved workflow design problem.
What the KPI hierarchy looks like in practice
E.ON Next’s customer service AI deployment (Rewired Ch. 5, pp.79–80) is the most fully documented case in the Rewired corpus. The metrics:
| Tier | Metric | Result |
|---|---|---|
| Tier 1 | Task success rate | +14% transaction success |
| Tier 2 | Average handle time | −8% |
| Tier 2 | Workflow quality | Sustained over 12 months (continuous eval loop) |
| Tier 3 | CSAT | +6 percentage points |
| Tier 3 | Cost-per-call | ~50% reduction |
The E.ON Next case is notable for two reasons beyond the outcome metrics: the team maintained a weekly evaluation loop (not a one-time launch review), and the Tier 3 improvements held 12 months post-launch because the evaluation loop caught and corrected performance degradation before it reached the user experience.
Part 4 — Domain Sequencing for Scale
The three sequencing variables
Change capacity budget. Prosci (n=1,107, 2025) finds 73% of organizations are at or near change saturation — the point where additional simultaneous change initiatives produce net-negative absorption. Before opening a new domain, assess: what other transformation initiatives are running simultaneously in the target department? A supply chain department mid-way through an ERP upgrade has consumed most of its change capacity before the AI pod arrives.
The diagnostic: how many significant change initiatives are active in the target department in the past 18 months? More than two is a yellow flag; more than three is a red flag that the department cannot absorb another major change without one failing.
Data infrastructure dependencies. Domains that share underlying data sources should be sequenced adjacent to each other — not because of workflow similarity, but because the data infrastructure built for Domain 1 accelerates Domain 2 when they draw from the same underlying sources.
Practical test: map the data sources required for each candidate domain. Domains sharing 50%+ of data sources are infrastructure-adjacent; sequencing them adjacent extracts 30–50% of the data setup cost for Domain 2.
Organizational readiness gradient. BCG’s adoption research identifies that departments with high manager AI champion density and existing digital workflow fluency absorb AI adoption 3–4x faster than those without. Sequencing to the ready departments first builds visible wins that fund and politically support subsequent deployments — even if the ready departments are not the highest-value domains.
The sequencing principle: do not pick the highest-value domain for Domain 1. Pick the domain that maximizes the probability of a visible win within 90 days of launch. The Domain 1 win buys the political capital for the high-value Domain 2 and 3 investments.
The domain sequencing decision matrix
Score each candidate domain on the three variables before committing the next pod:
| Variable | High readiness (3) | Medium readiness (2) | Low readiness (1) |
|---|---|---|---|
| Change capacity | No other major change initiatives in past 12 months; active AI champion in place | One other significant change initiative; AI champion nominated | Multiple concurrent changes; no AI champion |
| Data adjacency to previous domain | 50%+ data source overlap | 25–50% overlap | <25% overlap |
| Manager readiness | >70% of managers in department have completed AI facilitator training | 40–70% trained | <40% trained |
- Score 7–9: Priority next domain
- Score 5–6: 90-day prep window before launch (champion activation, manager training, data source mapping)
- Score 3–4: Defer to Year 2 roadmap; substantial prerequisites required
The Year 1 to Year 2 transition
The inflection point in the enterprise AI scaling program is the Year 1-to-Year 2 transition: from “building the capability” to “operating the capability at scale.”
The MIT CISR Stage 2-to-3 transition characteristics that mark the Year 2 target state:
- At least one domain pod running at production grade with documented Tier 3 impact
- Data infrastructure serving at least two domains from shared platform
- Evaluation framework library with at least two completed domain templates
- AI center of excellence with governance cadence (not just approval authority)
- Internal AI champion network with at least one active champion per major business unit
Organizations that reach the Year 2 transition with all five components in place are positioned for the 3–5x increase in domain coverage that characterizes Stage 3 organizations. Those that reach Year 2 with only technology deployment (and without the governance, champion network, and shared infrastructure) are in Stage 2 with increasing overhead costs and declining marginal returns from each new deployment.
See research/09-ai-adoption-cycle/year-2-ai-roadmap-scaling-beyond-pilot.md for the Year 2 planning template.
Part 5 — The Midstream Adjustment Discipline
Rewired’s Chapter 33 principle in practice
Rewired (Ch. 33, p.501) acknowledges what most AI program documentation avoids: the first plan will be wrong. The discipline is not plan perfection — it is a structured re-planning rhythm that detects and corrects before errors compound.
The four-week pod cycle (see wiki/ai-delivery-pods.md) is the unit of the re-planning rhythm. The adjustment types that a well-functioning pod addresses within the four-week cycle:
Prompt/instruction adjustment (Days 1–7 turnaround): The AI is generating outputs that meet technical quality standards but fail domain-specific quality requirements. Root cause: the task specification was wrong or incomplete. The workflow designer and ML engineer revise the system prompt and instruction set; the eval engineer verifies against the ground-truth test set before re-deployment.
Data quality adjustment (Days 7–14 turnaround): The AI is generating outputs with high retrieval recall but incorrect factual content. Root cause: the underlying data source has quality issues the initial data quality assessment did not surface. The data engineer and domain expert investigate and remediate; the evaluation team verifies before re-deployment.
Workflow design adjustment (Days 14–30 turnaround): The AI is technically performing correctly but user adoption is flat. Root cause: the workflow design does not fit how users actually work. The workflow designer conducts 5–10 user observation sessions, identifies the mismatch, and redesigns the handoff points. Domain owner approves the new design before implementation.
Value capture adjustment (30–60 days turnaround): Tier 1 and Tier 2 metrics are positive but Tier 3 impact is not materializing. Root cause: the workflow was improved but the freed time or capacity is not being redirected to the metric that matters. The domain owner redesigns how the productivity gain is captured — a management and process design problem, not an AI engineering problem.
Failure mode: the midstream pivot that becomes a restart
The midstream adjustment discipline fails when the pod treats an adjustment as a reason to restart the deployment from scratch — going back to design, re-doing stakeholder alignment, requesting a new budget cycle. The correct response to most midstream problems is a bounded adjustment within the existing framework, not a restart.
The test for a bounded adjustment vs. a restart: “If this adjustment solves the identified problem, does the deployment continue to its original target state?” If yes, it is an adjustment. If no — if the problem reveals a fundamental flaw in the domain selection or workflow design — it is a restart, and the governance process for sunsetting a pilot should be initiated with transparency rather than continued investment in a deployment that has failed structurally.
Part 6 — Practitioner Evidence
“My fundamental message to our teams is we’re going to rewrite the org charts. We are going to rewrite how work gets done.” — Steve Chase, Vice Chair AI & Digital Innovation, KPMG US · April 2026
The KPMG approach to scaling: Chase describes designing each workflow change as an organizational redesign, not a technology rollout. The adoption and scaling implication: the workflow change is the product, not the AI tool. Source: research/13-multimodal-sources/enterprise-ai-innovators/2026-04-14-bold-fast-responsible-workflows-with-kpmg-us-vice-chair-ai-d.md
“One of the blockers of adoption is sabotage. AI is scarier than other humans doing it because your livelihood is on the line. So they look for errors as a way to do a ‘gotcha’ moment and have the pilot fail.” — Arya Bolurfrushan, Founder and CEO, Applied AI · April 2026
The adoption implication: the most dangerous adoption failure mode is not passive non-compliance but active sabotage by workers who experience the AI deployment as an economic threat without a visible personal upside. Designing the incentive structure before launch — and communicating it before resistance forms — is the preventive intervention. Source: research/13-multimodal-sources/ai-for-the-c-suite/2026-04-14-ayra-bolurfrushan-most-companies-are-thinking-about-ai-compl.md
“Nobody’s coming. The leaders need to lead their own AI transformation.” — From AI for the C-Suite · April 2026
The scaling implication: organizations that wait for employee enthusiasm to organically generate adoption momentum wait indefinitely. The adoption owner role, manager training sequencing, and champion network are the organizational structures that create the momentum enterprises mistake for organic adoption. Source: research/13-multimodal-sources/ai-for-the-c-suite/2026-04-14-nobodys-coming-why-leaders-must-lead-their-own-ai-transforma.md
Part 7 — Actionable Next Steps
For the program sponsor at the 90-day post-launch review:
- Pull the Tier 2 workflow metrics for the first deployed domain. If cycle time or throughput has not moved directionally by Day 90, identify which of the four root causes (no adoption owner, no benefit visibility, no personal upside, no adjustment rhythm) applies. Each has a specific 30-day remediation.
- Confirm that Domain 1’s shared infrastructure — data pipeline, eval framework, change management templates, governance artifacts — is documented and in the CoE’s shared library before approving Domain 2. If it is still “in the pod,” it will not reuse.
For the domain owner preparing the Year 1 CFO report:
- Report Tier 3 metrics only. Lead with the P&L line item that moved, the baseline, the current number, and the 12-month trajectory. Do not lead with activity metrics or technology metrics. The CFO’s question is “did this produce business impact?” — answer that question directly.
- If Tier 3 has not moved, present the specific hypothesis for why (workflow design gap, data quality issue, change saturation, incentive misalignment) and the specific 90-day remediation plan. Do not present the absence of Tier 3 impact without a diagnosis and a time-bound fix.
For the CIO preparing the Year 2 scaling roadmap:
- Audit which of the five Year 2 target state components are in place: production pod, shared data infrastructure, eval framework library, CoE governance cadence, champion network. Each gap is a 90-day remediation item before the Year 2 scaling program launches.
- Run the domain sequencing matrix against the Year 2 candidate list. Sequence to the high-readiness domain, not the highest-value domain. The high-value domain gets the Domain 3 slot after the Year 2 win has been banked.
Related Wiki Articles
- wiki/ai-adoption-scaling.md — concept page; adoption engineering model; reuse scaling model; KPI hierarchy; midstream adjustment rhythm
- wiki/ai-change-management.md — structural discipline behind adoption engineering; ADKAR for AI; resistance patterns; manager-champion 8.7x multiplier
- wiki/ai-delivery-pods.md — the operating unit that runs the four-week adoption rhythm; pod composition; adoption owner role
- wiki/ai-pilot-to-production.md — transition mechanics from pilot to production pod; handoff failure modes; GoLive criteria
- wiki/data-products-reuse.md — the reuse infrastructure that makes each domain cheaper than the last; data product library
Related Research
- research/04-consulting-firms/mckinsey-rewired-2nd-edition-synthesis.md — Rewired Ch. 30 (make adoption stick), Ch. 31 (design for scale and reuse), Ch. 32 (KPI discipline), Ch. 33 (midstream adjustments)
- research/09-ai-adoption-cycle/ai-steady-state-operating-model.md — what scaling complete looks like; governance-to-cadence transition; sustained ops cost
- research/09-ai-adoption-cycle/year-2-ai-roadmap-scaling-beyond-pilot.md — Year 2 roadmap structure; domain sequencing template
- research/07-adoption-challenges/ai-change-management-best-practices.md — Prosci, Kotter, ADKAR adapted for AI; saturation-point diagnostics
- research/07-adoption-challenges/forced-adoption-outcomes.md — outcomes data on forced vs. voluntary adoption (Writer, WalkMe, IgniteTech, Shopify)