Real ROI by Business Function: Where Enterprises Are Actually Seeing Returns from AI

Executive Summary

  • Only 5-6% of enterprises qualify as “AI high performers” with measurable EBIT impact of 5%+ (McKinsey 2025)
  • 88% of companies use AI in at least one function, but meaningful bottom-line impact remains rare
  • The highest proven ROI comes from the most boring applications: customer service routing, invoice processing, fraud detection
  • Academic studies with control groups show mixed results: 14-40% productivity gains in some tasks, 19% slower in others
  • Most enterprise AI ROI comes from configuring SaaS tools, not custom development – the “build your own” era of 2024 largely failed
  • The functions where AI delivers the most value are NOT the ones getting the most press

The Evidence Hierarchy: What We Actually Know

Before ranking functions, it matters enormously where the data comes from. Here is the honest landscape of AI ROI evidence, ordered by reliability.

Tier 1: Randomized Controlled Trials (Highest Reliability)

These are rare but they exist, and they paint a more nuanced picture than vendor marketing.

Brynjolfsson, Li, & Raymond (Stanford/MIT, 2023-2024) – Customer Support

  • 5,179 customer support agents, staggered rollout, real workplace setting
  • AI assistance increased productivity by 14% on average (issues resolved per hour)
  • Novice/low-skilled workers: 34% improvement; experienced workers: minimal gains
  • Requests to speak to a manager declined by 25%
  • Published in the Quarterly Journal of Economics (2025)
  • Verdict: Real, measured, peer-reviewed. The strongest evidence we have for any business function.

Dell’Acqua et al. (Harvard/BCG, 2023) – “The Jagged Frontier”

  • 758 BCG consultants (7% of individual contributor workforce), pre-registered experiment
  • For tasks within AI capabilities: 25% faster, 40% higher quality, 12% more tasks completed
  • For tasks outside AI capabilities: 19 percentage points worse performance
  • Introduced the critical concept of AI’s “jagged technological frontier”
  • Verdict: Peer-reviewed, rigorous, but reveals AI is a double-edged sword. Context determines everything.

Noy & Zhang (MIT, 2023) – Professional Writing Tasks

  • 453 college-educated professionals, randomized ChatGPT access
  • Time to complete writing tasks decreased by 40%, output quality rose by 18%
  • Benefits concentrated among lower-ability workers (inequality compression)
  • Published in Science
  • Verdict: Strong methodology, but tasks were 20-30 minute writing exercises, not full workflows.

METR (2025) – Experienced Software Developers

  • 16 experienced open-source developers, 246 tasks, randomized AI access
  • Developers using AI were 19% SLOWER, not faster
  • Developers believed they were 20% faster (massive perception gap)
  • Developers averaged 5 years of experience on their specific repositories
  • Tools used: Cursor Pro with Claude 3.5/3.7 Sonnet (frontier models at the time)
  • Verdict: Small sample but rigorous methodology. Critically, shows experienced developers on familiar codebases may not benefit. Contradicts industry narratives.

Tier 2: Large Consulting Surveys (Useful but Self-Reported)

McKinsey Global Survey – “The State of AI” (2025)

  • 88% of companies use AI in at least one function
  • Only 39% report any EBIT impact at enterprise level; most say it is less than 5%
  • Only ~6% qualify as “high performers” (5%+ EBIT impact from AI)
  • Revenue increases most commonly reported in: marketing & sales, strategy & corporate finance, product development
  • Cost reductions of 10-20% reported in: software engineering, manufacturing, IT
  • High performers invest 3x more in process redesign than in software itself
  • High performers are 3x more likely to have fundamentally redesigned workflows
  • Caveat: Self-reported survey data from executives. Selection bias is real.

BCG – “From Potential to Profit” (January 2026)

  • 1,400+ C-suite executives surveyed
  • Big gaps emerging between “winners” and “observers”
  • Focus on strategic priorities, transforming processes, and preparing workforces
  • Caveat: C-suite self-reporting; details behind paywall.

Deloitte – “State of AI in the Enterprise” (2026)

  • 3,235 leaders surveyed (Aug-Sep 2025)
  • 74% say projects meet or exceed ROI expectations
  • But: satisfactory ROI typically takes 2-4 years (3-4x longer than conventional tech)
  • 44% of cybersecurity respondents report ROI surpassing expectations (highest of any function)
  • Only 20% of organizations achieving revenue growth through AI (vs. 74% who hope to)
  • Caveat: “Meeting expectations” is not the same as “delivering strong ROI.” Expectations may have been lowered.

Tier 3: Vendor-Funded Studies (Use with Extreme Caution)

Forrester TEI (Total Economic Impact) Studies

  • All TEI studies are commissioned and paid for by the vendor being studied
  • Recent AI-related TEIs include: Microsoft 365 Copilot, Microsoft Foundry, Five9, boost.ai, PolyAI, Writer
  • Methodology: Forrester interviews select customers, builds financial model with benefits/costs/risks
  • Forrester claims editorial independence: “Clients cannot purchase favorable opinions or results”
  • However: vendors select which customers Forrester interviews, positive results are the norm, and these studies cannot be used in Forrester’s syndicated (independent) research
  • Microsoft Copilot TEI claims: up to 353% ROI for SMBs
  • Verdict: Treat as marketing collateral with a research veneer. The methodology is sound in theory but the selection bias in customer interviews is structural. These are NOT independent research.

Tier 4: Vendor Claims (Marketing, Not Evidence)

  • GitHub Copilot: “55% faster task completion” (internal study, not independently verified)
  • Google Cloud: “74% of executives achieve ROI in first year with AI agents”
  • Salesforce: Various AI-driven revenue claims
  • Verdict: Useful for understanding product capabilities, not for ROI planning.

Business Functions Ranked by Proven ROI

Based on the evidence hierarchy above, here is an honest ranking from strongest proven ROI to weakest.

Tier A: Strong Evidence of Positive ROI

1. Customer Service AI (Chatbots, Routing, Agent Assistance)

Evidence strength: HIGH – multiple RCTs plus extensive enterprise deployment data

Metric Result Source
Productivity increase 14% (issues resolved/hour) Brynjolfsson et al. (Stanford/MIT)
Novice worker improvement 34% Brynjolfsson et al. (Stanford/MIT)
Cost per chat reduction 70% Vodafone case study
Routine inquiry handling 80% of inquiries automated IBM
Operational cost reduction 23-30% Multiple enterprise reports
Projected labor cost savings $80B by 2026 Gartner
ROI ratio $3.50 returned per $1 spent Industry composite
Time to positive ROI 8-14 months Multiple reports

Why it works so well:

  • High volume, repetitive, well-structured interactions
  • Easy to measure (resolution time, cost per contact, customer satisfaction)
  • Primarily SaaS configuration, not custom development
  • AI handles Tier 1 queries, routes complex ones to humans
  • Works best for: FAQ deflection, initial triage, agent assistance, sentiment routing

The honest caveat: The 34% gains for novices vs. minimal gains for experienced agents (Brynjolfsson) suggests AI acts more as a “knowledge equalizer” than a universal productivity booster. Organizations staffed entirely by senior agents may see smaller gains.

Configuration vs. custom: ~80-90% of value comes from configuring off-the-shelf platforms (Zendesk AI, Intercom Fin, Freshworks Freddy, Five9). Custom development adds value only for industry-specific knowledge bases or complex multi-system integrations.


2. Finance AI (Fraud Detection, Invoice Processing, Forecasting)

Evidence strength: HIGH for fraud detection and invoice processing; MODERATE for forecasting

Metric Result Source
Fraud detection accuracy >90% Industry composite
JP Morgan fraud/ops savings $50M/year JP Morgan case study
Visa fraud prevention $40B in prevented fraud (2023) Visa
Invoice processing cost reduction 80%+ (from $15-40 to under $5/invoice) Industry benchmarks 2025
Forecast accuracy improvement 25-40% Multiple enterprise reports
AP automation ROI 420% annual (breakeven in 2 months) Case study composite
Finance team productivity boost 44-54% McKinsey
Manual data work reduction 20-30% (top-tier firms) McKinsey

Why it works so well:

  • Fraud detection has immediate, measurable dollar impact
  • Invoice processing is high-volume, rules-based, perfect for AI
  • Financial data is structured, clean, and abundant
  • ROI is directly quantifiable in dollar terms

The honest caveat: The $50M and $40B numbers from JP Morgan and Visa reflect massive scale enterprises. Mid-market companies will see proportionally smaller but still meaningful returns. The “44-54% productivity boost” from McKinsey is self-reported survey data, not experimentally verified.

Configuration vs. custom: Fraud detection is overwhelmingly SaaS (Featurespace, Feedzai, built-in bank platform features). Invoice processing is now largely AI-OCR SaaS (90%+ value from configuration). Financial forecasting is where custom development starts to matter – integrating company-specific data sources, custom models for unusual business patterns.


3. IT Operations AI (AIOps, Incident Response, Monitoring)

Evidence strength: MODERATE – strong operational metrics but limited independent studies

Metric Result Source
Event noise reduction ~99.2% Enterprise benchmarks
MTTR reduction 50-66% Multiple case studies
Outage reduction ~70% Mature enterprise deployments
Hours saved per incident 4.87 hours SolarWinds 2025
Monthly hours reclaimed Up to 9,500 (global deployments) Enterprise benchmarks
Cybersecurity ROI exceeding expectations 44% of respondents Deloitte 2026
ROI payback period 3-6 months Industry estimates

Why it works so well:

  • Downtime costs are enormous ($10K+/hour), so even small improvements have massive ROI
  • Alert fatigue is a real, measurable problem that AI directly addresses
  • Pattern recognition across millions of log entries is genuinely superhuman
  • Clear before/after metrics (MTTD, MTTR, uptime SLA compliance)

Configuration vs. custom: ~70% SaaS configuration (Datadog, PagerDuty, ServiceNow AIOps, Splunk). Custom development matters for correlating proprietary system data and building organization-specific runbooks.


Tier B: Moderate Evidence of Positive ROI

4. Supply Chain AI (Demand Planning, Inventory Optimization)

Evidence strength: MODERATE – strong theoretical basis, real results at scale, but high failure rate

Metric Result Source
Forecast accuracy improvement 20-50% Multiple enterprise reports
Forecast error reduction From 28.76% to 16.43% Academic/ML benchmarks
Cost savings 26-31% McKinsey
Executives reporting 12-month ROI 77% Industry survey
GenAI initiatives struggling with sustained ROI Up to 95% Industry analysis

Why it works so well (when it does):

  • Demand forecasting is a mathematical problem with clear right/wrong answers
  • Integrating external signals (weather, events, economic data) is genuinely valuable
  • Small improvements in forecast accuracy translate to large inventory savings
  • Established ML techniques (not just GenAI) with decades of refinement

The honest caveat: The 95% failure rate for sustained ROI is the critical number here. Success requires clean data, clear governance, and deliberate risk management. Most companies have none of these. The 77% reporting 12-month ROI may reflect survivorship bias – companies that failed abandoned the effort.

Configuration vs. custom: This is one area where custom development genuinely matters (~40-50% of value). Off-the-shelf tools (Blue Yonder, Kinaxis, o9 Solutions) provide the platform, but integrating company-specific data sources, supplier relationships, and demand signals requires significant custom work.


5. Sales AI (Forecasting, Lead Scoring, Pipeline Management)

Evidence strength: MODERATE – strong adoption data, ROI numbers mostly self-reported

Metric Result Source
Revenue increase 13-15% Industry surveys
Sales cycle reduction Up to 68% shorter Industry reports
Forecast accuracy improvement From 60-75% to 90-98% Platform vendor data
Quota attainment improvement Up to 30% Vendor reports
ROI over 3 years 299% average Unified platform vendor data
Teams with AI seeing revenue growth 83% vs 66% without Industry survey
Time to positive ROI 12-18 months Industry consensus

Why it works so well (when it does):

  • CRM data is abundant and structured
  • Lead scoring is a classic ML classification problem with clear feedback loops
  • Forecast accuracy directly impacts planning, hiring, and cash management
  • Sales teams are highly measurable by nature

The honest caveat: The “90-98% accuracy” claims come from platform vendors. Only 7% of sales organizations actually achieve 90%+ forecast accuracy. The gap between vendor claims and enterprise reality is enormous. Also, 69% of sales ops leaders say forecasting is getting harder, suggesting AI has not yet solved the core problem for most.

Configuration vs. custom: ~85% SaaS configuration (Salesforce Einstein, Gong, Clari, HubSpot). Custom development mainly for: proprietary data integration, industry-specific scoring models, and multi-system orchestration.


6. Marketing AI (Content, Personalization, Campaign Optimization)

Evidence strength: MODERATE – strong adoption, ROI data mostly from ecommerce/digital

Metric Result Source
Average ROI ~300% (marketing teams using AI in 2025) Industry surveys
Campaign launch speed 75% faster Multiple reports
Ad spend ROI improvement 30% higher Industry benchmarks
Personalized email transaction rates 6x higher Ecommerce data
Revenue increase from personalization Up to 41% Ecommerce studies
Time savings per employee ~11.4 hours/week McKinsey
Executives who can measure ROI confidently Only 29% Industry survey

Why it works so well (in specific contexts):

  • Content generation is a natural fit for LLMs
  • A/B testing provides clear feedback loops
  • Personalization at scale was impossible without AI
  • Digital marketing has excellent attribution data

The honest caveat: The 300% average ROI figure is self-reported and likely inflated by selection bias. The most telling statistic: only 29% of executives say they can confidently measure AI ROI in marketing. If you cannot measure it, you cannot claim it. Much of marketing AI ROI may be phantom gains from attribution modeling changes rather than actual incremental value.

Configuration vs. custom: ~90% SaaS configuration (HubSpot, Marketo, Jasper, Copy.ai, Persado). Custom development rarely justified unless dealing with highly regulated industries or proprietary data advantages.


Tier C: Limited or Mixed Evidence of ROI

7. HR AI (Recruiting, Screening, Workforce Planning)

Evidence strength: LOW-MODERATE – strong efficiency metrics, significant bias/legal risks

Metric Result Source
AI adoption in recruiting 43% (up from 26% YoY) Industry survey
Cost-per-hire reduction 20-40% Multiple reports
Time-to-hire reduction 30-50% Multiple reports
Enterprise annual savings Average $2.3M Industry estimate
ROI timeline 8-18 months Industry consensus
Bias reduction (when properly implemented) 56-61% Research studies
Organizations with ongoing bias challenges 67% Industry survey
Enterprises planning AI-run hiring by 2026 33% HR Dive

Why gains are real but risky:

  • Resume screening is high-volume and repetitive (good for AI)
  • Scheduling automation eliminates enormous coordination overhead
  • Candidate sourcing across platforms benefits from AI aggregation

The honest caveat: HR AI carries the highest regulatory and reputational risk of any function. New York City already requires annual bias audits for automated hiring tools. California finalized AI hiring regulations in October 2025. Colorado AI Act takes effect June 2026. People mirror AI hiring biases ~90% of the time even when aware of them (University of Washington, 2025). The cost savings are real but the legal liability is growing faster than the savings.

Configuration vs. custom: ~85% SaaS (HireVue, Eightfold, Phenom, Greenhouse AI features). Custom development needed mainly for bias testing/compliance frameworks specific to organizational demographics.


8. Software Engineering AI (Coding Assistants, Code Review)

Evidence strength: CONTRADICTORY – strong vendor claims vs. sobering independent studies

Metric Result Source
Task speed-up (vendor claim) Up to 55% GitHub internal study
Average time saved per week 3.6 hours Industry analytics (135K developers)
Code suggestion acceptance rate ~30% Industry data
AI-generated code quality issues 1.7x more issues than human code CodeRabbit analysis
Security weaknesses in Python output 29.1% contain vulnerabilities Independent analysis
Developer trust in AI output Only 33% trust it Industry survey
METR RCT: experienced dev speed change 19% SLOWER METR 2025 (peer-reviewed)
Developer perception vs. reality gap Believed 20% faster; actually 19% slower METR 2025
Enterprises achieving profitability from AI tools Less than 47% Industry survey

The contradiction explained:

  • AI coding tools help with unfamiliar codebases and boilerplate (aligned with BCG “jagged frontier” – tasks within AI capabilities see big gains)
  • AI coding tools hurt experienced developers on familiar codebases (METR study)
  • The 55% speed-up is real for certain task types but does not translate to 55% overall productivity gain
  • 70% of AI suggestions are rejected, suggesting the “productivity” framing overstates actual impact
  • Quality concerns (1.7x more issues, 29% security weaknesses) create downstream costs not captured in speed metrics

Configuration vs. custom: Nearly 100% SaaS/configuration (GitHub Copilot, Cursor, Windsurf, Tabnine). Custom fine-tuning on proprietary codebases is emerging but adds cost with uncertain returns.


Evidence strength: LOW-MODERATE – growing adoption, limited rigorous ROI data

Metric Result Source
Time savings 6-20% per week for 62% of professionals Wolters Kluwer
Revenue gains attributed to AI 6-20% for ~50% of professionals Industry survey
Contract review speed 60-80% faster Vendor platforms
Time saved per contract draft 1-2 hours Top US law firm case study
Documents processed increase 4x per week Top US law firm case study
Organizations with better CLM integration ROI 40-60% better ROI Industry analysis
Executives reporting EBITDA lift from AI Only 15% Forrester 2026

Why it is slow to prove ROI:

  • Legal work requires high accuracy – errors have serious consequences
  • Attorney time is expensive, so even modest time savings are meaningful in dollar terms
  • But: the adversarial nature of legal work means AI errors are actively exploited by opposing counsel
  • Regulatory caution limits aggressive deployment

The honest caveat: The 60-80% faster contract review numbers come from vendor platforms measuring specific tasks, not end-to-end legal processes. Total time savings of 6-20% per week is more realistic. Legal AI remains largely in “augmentation” mode – speeding up research and first drafts, not replacing judgment calls.

Configuration vs. custom: ~70% SaaS (Harvey, Casetext/CoCounsel, Spellbook, Luminance). Custom development matters for: firm-specific clause libraries, jurisdiction-specific training, integration with proprietary document management systems.


Configuration vs. Custom Development: The Honest Breakdown

The Big Picture

The “build your own AI” wave of 2024 has largely failed. As Gartner notes, enterprises tried to build their own AI, ran proofs of concept, hired ML engineers, experimented with custom models – and most of it failed. In 2025-2026, CIOs are shifting decisively to commercial off-the-shelf solutions.

Where Does the ROI Actually Come From?

Approach Est. Share of Enterprise AI Spending Est. Share of Enterprise AI ROI
Configuring existing SaaS with AI features ~45-55% ~60-70%
Integrating/connecting AI-enabled tools ~25-30% ~15-20%
Custom AI model development ~15-25% ~10-15%

Note: These are estimated ranges synthesized from Gartner, Deloitte, and McKinsey data. No single source provides this exact breakdown. The directional message is clear: most ROI comes from configuration, not custom development.

Where Custom Development Actually Matters

Custom AI development delivers outsized ROI in a narrow set of conditions:

  1. Proprietary data advantage: When your organization has data no one else has (e.g., JP Morgan’s transaction graph for fraud detection)
  2. Regulatory moat: When industry-specific compliance requires models trained on domain-specific data
  3. Supply chain specificity: When demand signals are unique to your business (e.g., a retailer combining weather, local events, and store-level inventory data)
  4. Integration orchestration: When the value comes from connecting 5+ systems in ways no single vendor supports
  5. Competitive differentiation: When AI IS the product, not a tool supporting the product

Where Custom Development Is a Waste of Money

  • Building a chatbot from scratch when Zendesk/Intercom/Freshworks can be configured in weeks
  • Training custom LLMs when prompt engineering with commercial APIs achieves 90% of the result
  • Building proprietary document processing when AI-OCR SaaS has reached 98-99% accuracy
  • Custom lead scoring models when CRM-native AI features exist
  • Any project where the primary goal is “we want to own our AI” rather than solving a specific business problem

The 39% Tax

Gartner data shows that 39% of developer time is spent designing, building, and testing custom integrations. For AI projects specifically, 95% of IT leaders cite integration as a challenge to seamless AI implementation. This “integration tax” is the hidden cost that destroys ROI projections for custom AI development.


The AI That Nobody Talks About

The highest-ROI AI applications are almost universally the most boring ones. Here is what actually saves money.

Invoice and Accounts Payable Processing

  • Per-invoice cost reduced from $15-40 to under $5 (80%+ reduction)
  • AI-OCR accuracy: 98-99%
  • Processing speed: 90% faster
  • Invoice processing per FTE: up to 400% increase
  • Market growing from $2.8B (2024) to projected $47.1B (2034)
  • Breakeven: 2 months. Annual ROI: 420%.
  • This is possibly the single highest-ROI AI application in enterprise, and nobody writes breathless blog posts about it.

Email Routing and Ticket Classification

  • Automatically categorizes, prioritizes, and routes incoming communications
  • Reduces misrouting by 60-80%
  • Saves 5-15 minutes per ticket in manual triage time
  • At scale (10,000+ tickets/month), this saves 800-2,500 hours/month
  • Entirely SaaS configuration – zero custom development needed

Data Entry and Form Processing

  • AI reads forms, extracts structured data, populates systems
  • 99.5% field-level accuracy (better than human data entry)
  • Eliminates double-entry across systems
  • Typical savings: 10-15 minutes per form x thousands of forms/month

RPA (Robotic Process Automation) + AI

  • 100-200% ROI in first 12 months
  • Operational cost reduction: 30-80%
  • Global work hours saved: estimated 2.2 billion annually
  • Cargill: automated 70% of order processing, saving $15M/year
  • NHS Newcastle: 7,000 hours/year saved automating HR workflows
  • ROI payback: 6-9 months
  • Market growing from $3.79B (2024) to projected $30.85B (2030)

Alert Deduplication and Noise Reduction

  • IT operations teams drown in alerts – AIOps reduces noise by 99.2%
  • This is not glamorous. It is a goldmine.
  • Every false alert that wakes up an on-call engineer at 3am costs morale, productivity, and retention

Document Classification and Compliance Tagging

  • Automatically tags documents for retention policies
  • Routes compliance-sensitive materials to appropriate reviewers
  • Reduces compliance audit prep time by 40-60%
  • Zero controversy because it replaces work nobody wanted to do

The Common Thread

These applications share characteristics that explain their success:

  • High volume, low complexity: Thousands of identical decisions per day
  • Clear right/wrong answers: Easy to measure accuracy
  • Structured or semi-structured data: AI does not need to “reason,” just classify
  • Zero emotional attachment: Nobody’s identity is tied to routing emails
  • Immediate, measurable savings: Dollar-for-dollar comparison with human labor cost
  • Low risk of catastrophic failure: A misrouted email is annoying, not lawsuit-inducing

Key Academic Studies Reference Table

Study Year N Design Domain Key Finding Published In
Brynjolfsson, Li, Raymond 2023-24 5,179 agents Staggered RCT Customer support +14% productivity; +34% for novices QJE (2025)
Dell’Acqua et al. (BCG/Harvard) 2023 758 consultants Pre-registered RCT Consulting +25% speed, +40% quality inside frontier; -19pp outside Organization Science (2025)
Noy & Zhang (MIT) 2023 453 professionals Randomized experiment Writing tasks -40% time, +18% quality Science (2023)
METR 2025 16 developers, 246 tasks Randomized crossover Software development 19% slower with AI (contradicts beliefs) arXiv preprint
Microsoft New Future of Work 2025 Meta-analysis Literature review Multiple Mixed results across domains Microsoft Research

What This Means for Enterprise Decision-Makers

The Three Rules of AI ROI

Rule 1: Start with the boring stuff. Invoice processing, email routing, ticket classification, and data entry will deliver faster ROI with lower risk than any “transformative AI strategy.” These are the foundation.

Rule 2: Configure before you build. For every dollar spent on custom AI development, ask: “Can we get 80% of this value by configuring an existing SaaS tool?” The answer is usually yes. The Gartner data is clear: most custom AI projects from 2024 failed. Buy, configure, integrate.

Rule 3: Measure function-by-function, not enterprise-wide. “What is our AI ROI?” is the wrong question. “What is our AI ROI for invoice processing?” is the right question. The McKinsey data shows that even high performers see wildly different returns across functions. Aggregate ROI metrics are meaningless.

The Uncomfortable Truths

  1. Most AI ROI claims are marketing. Vendor-funded Forrester TEI studies, internal GitHub studies, and platform vendor ROI calculators are not independent evidence. Plan accordingly.

  2. The METR study should worry everyone selling AI coding tools. Experienced developers on familiar codebases got 19% slower – and believed they were 20% faster. If perception and reality can diverge that much, how many other “productivity gains” are phantoms?

  3. High performers invest 3x more in process redesign than software. The McKinsey data is unambiguous: buying AI tools without redesigning workflows produces minimal returns. The tool is 25% of the value; the workflow redesign is 75%.

  4. HR AI is a legal time bomb. The efficiency gains are real but the regulatory landscape is moving faster than the technology. New York, California, Colorado, and the EU are all tightening rules. A bias lawsuit can erase years of hiring cost savings in a single settlement.

  5. 2-4 years to ROI is the honest timeline. Deloitte reports that most organizations achieve satisfactory AI ROI in 2-4 years, 3-4x longer than conventional technology deployments. Anyone promising 90-day transformation is selling something.


Sources

Academic / Peer-Reviewed

  • Brynjolfsson, Li, Raymond. “Generative AI at Work.” Quarterly Journal of Economics (2025). Stanford SIEPR
  • Dell’Acqua et al. “Navigating the Jagged Technological Frontier.” Organization Science (2025). Harvard Business School
  • Noy & Zhang. “Experimental evidence on the productivity effects of generative artificial intelligence.” Science (2023). Science
  • METR. “Measuring the Impact of Early-2025 AI on Experienced Open-Source Developer Productivity.” (2025). METR

Consulting Firm Reports

  • McKinsey. “The State of AI in 2025.” McKinsey
  • BCG. “From Potential to Profit: Closing the AI Impact Gap.” (January 2026). BCG
  • Deloitte. “State of AI in the Enterprise 2026.” Deloitte

Analyst Firms

  • Gartner. “Worldwide AI Spending Will Total $2.5 Trillion in 2026.” Gartner
  • Gartner. “40% of Enterprise Apps Will Feature AI Agents by 2026.” Gartner
  • Forrester. TEI Methodology. Forrester
  • Forrester. TEI of Microsoft 365 Copilot. Forrester/Microsoft

Industry Reports and Data

  • Menlo Ventures. “2025: The State of Generative AI in the Enterprise.” Menlo Ventures
  • Wolters Kluwer. “Legal AI Adoption: Time Savings, Contract Review, Revenue Growth.” Wolters Kluwer
  • Master of Code. “AI ROI: Why Only 5% of Enterprises See Real Returns in 2026.” Master of Code
  • PwC. “2026 AI Business Predictions.” PwC

Created by Brandon Sneider | brandon@brandonsneider.com March 2026