MIT NANDA GenAI Divide report showing 95% of enterprise AI pilots fail to deliver measurable ROI in 2025-2026MIT's Project NANDA found that 95% of GenAI pilots produced zero measurable P&L impact, a finding corroborated by McKinsey, BCG, and IBM in 2025.
95% of Enterprise AI Pilots Fail to Deliver ROI | NeuralWired
Enterprise AI • Research Analysis

95% of Enterprise AI Pilots Fail to Deliver ROI. Four Research Teams Just Confirmed It.

Your company just spent six months and a million dollars on a generative AI pilot. The vendor demos looked flawless. The internal presentations sparked genuine excitement. Then the results came in, and nothing moved. Not revenue. Not costs. Not customer retention.

If that sounds familiar, you are in the majority. A large, expensive, embarrassingly well-funded majority.

Four major research institutions, using four separate methodologies, all landed on the same uncomfortable finding about enterprise AI investment in 2025: somewhere between 60% and 95% of organizations are spending real money on AI and producing nothing measurable in return. The AI ROI crisis is not a pessimist’s talking point anymore. It is the consensus position of the best-sourced data in the field.

Here is what the research actually shows, why the failures keep happening, and what the small group of winners is doing that everyone else isn’t.


The Real Numbers: What Four Independent Studies Found

The most important thing to know before citing any AI failure statistic is that several of the most-shared numbers online are not traceable to real research. A figure that circulated widely in early 2026, claiming “$684 billion invested with 80.3% producing nothing,” appears across dozens of content sites but traces back to no primary dataset, no named methodology, and no actual report. It is not from RAND. It is not from Gartner. It does not exist in the primary literature.

What does exist is more interesting, and more damning.

95%
of GenAI pilots show zero measurable P&L impact
MIT Project NANDA, July 2025
39%
of organizations report any enterprise-wide EBIT impact from AI
McKinsey State of AI 2025
42%
of companies abandoned most of their AI initiatives in 2025, up from 17% in 2024
S&P Global Market Intelligence, March 2025
25%
of AI initiatives delivered expected ROI, per CEO self-report
IBM Institute for Business Value, May 2025

MIT’s Project NANDA published the sharpest number. Their GenAI Divide: State of AI in Business 2025 report reviewed over 300 publicly disclosed AI initiatives and conducted structured interviews with representatives from 52 organizations, plus survey responses from 153 senior leaders. The finding: 95% of GenAI pilots delivered no measurable profit-and-loss impact. Only 5% of integrated systems created significant value.

McKinsey’s State of AI 2025 is the largest survey, with nearly 1,993 respondents across 105 countries. It found that just 39% of organizations report any enterprise-wide EBIT impact from AI. Only about 5.5% to 6% of respondents qualify as true high performers, meaning their organizations attribute more than 5% of EBIT to AI use.

S&P Global Market Intelligence surveyed more than 1,000 respondents across North America and Europe in March 2025 and found that the share of companies abandoning most of their AI initiatives jumped to 42% in one year, up sharply from 17% the prior year. The average organization scrapped 46% of AI proof-of-concepts before they ever reached production.

The IBM Institute for Business Value CEO Study surveyed 2,000 CEOs across 33 countries and found that only 25% of AI initiatives had delivered expected ROI over the preceding few years, and only 16% had scaled enterprise-wide.

Key Context These four studies use different definitions of “failure” and different methodologies. MIT tracks GenAI pilots specifically on P&L impact. McKinsey tracks EBIT at enterprise scale. S&P tracks initiative abandonment rates. IBM tracks CEO self-reported ROI. The fact that all four land in the same territory (a small single-digit percentage of organizations capturing most of the value) is more persuasive than any single number would be on its own.

For spending context: Stanford HAI’s AI Index 2025 tracked $252.3 billion in corporate AI investment in 2024, with private investment climbing 44.5% year-over-year. Gartner forecasts worldwide AI spending will reach $2.52 trillion in 2026. The money is real. The returns, for most organizations, are not.


Why Enterprise AI Projects Actually Fail

The RAND Corporation’s 2024 qualitative study, based on interviews with 65 experienced AI and machine learning practitioners, is explicit: “By some estimates, more than 80% of AI projects fail. That’s twice the failure rate of non-AI IT projects.” RAND framed this as an estimate rather than a hard statistic, which is the intellectually honest position. But the directional claim is consistent with every quantitative study that followed.

What RAND and the subsequent research agree on is that the failure causes are almost never technical. Models work. APIs work. The infrastructure, mostly, works. The failures are organizational.

The Integration Gap

McKinsey’s data reveals something counterintuitive: function-level AI wins (software engineering and IT teams seeing 10% to 20% cost reductions, for example) coexist with near-zero enterprise-wide EBIT impact. Teams are building AI tools. Those tools are producing local efficiencies. But the enterprise-level needle doesn’t move because the tools exist inside departmental silos, disconnected from the workflows that drive revenue and cost at scale.

The organizations in McKinsey’s high-performer cohort are not running more pilots. They are forcing workflow redesign. There is a meaningful difference between adding AI to an existing process and redesigning the process around AI’s actual capabilities.

The Data Readiness Problem

Gartner predicted in July 2024 that at least 30% of generative AI projects would be abandoned after proof of concept by end of 2025, citing poor data quality, inadequate risk controls, escalating costs, and unclear business value. Gartner’s June 2025 follow-up extended that prediction to agentic AI: over 40% of agentic AI projects are expected to be canceled by end of 2027, for essentially the same reasons.

The pattern Gartner keeps documenting is that most enterprise AI failures trace upstream to data infrastructure, not model selection. Organizations with fragmented, inconsistent, or poorly governed data are running state-of-the-art models on inputs that guarantee mediocre outputs. The models perform exactly as well as the data allows. The data, in most enterprises, does not allow much.

The FOMO-Driven Pilot Problem

IBM’s CEO study names a failure mode that does not appear in the other research but is instantly recognizable to anyone who has sat through an enterprise AI strategy meeting: “FOMO-driven” pilots. Organizations launch AI initiatives because competitors are launching AI initiatives, not because they have identified a specific problem that AI is the right tool to solve. The result is a portfolio of proofs-of-concept that demonstrate capability without establishing business value, and that get quietly shelved when the next technology cycle begins.

“Some large companies’ pilots and younger startups are really excelling with generative AI. It’s because they pick one pain point, execute well, and partner smartly with companies who use their tools.”

Aditya Challapally, Lead Author, MIT NANDA GenAI Divide Report (Fortune, August 2025)

The “one pain point” framing is more radical than it sounds. Most enterprise AI strategies involve multiple simultaneous pilots across multiple functions. The MIT data suggests that approach produces 95% failure rates. The alternative is a level of focus that most organizations, politically and structurally, find difficult to achieve.


What the 5% of Winners Do Differently

BCG’s Widening AI Value Gap report (published September 2025, surveying 1,250-plus global firms) found that only 5% of companies are achieving AI value at scale. BCG calls this cohort “future-built” firms. They share specific structural traits, not just better models or more budget.

What Average Firms Do What Future-Built Firms Do
Multiple simultaneous pilots across functions Single focused use case with defined P&L ownership
Measure success by pilot completion Measure success by business metric change within 90 days
Deploy AI into existing workflows Redesign workflows before and during AI deployment
Data readiness addressed post-launch Data infrastructure audited and fixed before launch
AI team isolated in IT or innovation lab AI ownership embedded in business unit P&L

“Agentic AI isn’t a future concept. It’s already reshaping workflows and redefining roles. Companies should view it as the next step in scaling AI, not as the starting point.”

Amanda Luther, Managing Director and Senior Partner, Boston Consulting Group, co-author of The Widening AI Value Gap

BCG’s data also shows that 60% of companies are not achieving material value at all, reporting minimal revenue and cost gains despite substantial investment. The distribution is not a bell curve. It is a winner-take-most dynamic where a small cohort is pulling away from the field. The companies in that cohort are not smarter. They moved earlier on data infrastructure, defined success in business terms before launch, and treated AI deployment as a change-management problem rather than a technology rollout.

If you’re building an enterprise AI program and you haven’t done a formal audit of your data readiness before approving new spend, Gartner’s prediction applies directly to you.


The Skeptic’s Case: Is the AI Failure Narrative Overblown?

The most credentialed critic of the 95% figure is Paul Roetzer, founder and CEO of the Marketing AI Institute. Speaking on The Artificial Intelligence Show in August 2025, Roetzer was direct about the MIT NANDA methodology: “Please don’t put any weight into this study. This is not a viable, statistically valid thing.”

His critique is specific and worth taking seriously. MIT’s 95% figure tracks GenAI pilots on a narrow, 6-month P&L-only definition of success. That definition excludes efficiency gains, cost reductions, customer churn improvements, and sales pipeline velocity. Roetzer’s argument is that an organization that deploys a GenAI tool and reduces its customer support ticket resolution time by 40% would count as a “failure” under NANDA’s methodology, because that improvement did not show up as a measurable P&L impact within six months.

“Anytime you see a headline like that, you have to immediately step back and say, okay, that seems unrealistic.”

Paul Roetzer, Founder and CEO, Marketing AI Institute, speaking on The Artificial Intelligence Show, Episode 164 (August 2025)

He also notes a potential framing consideration: NANDA’s research mission is building an “Internet of AI Agents,” meaning the report’s implicit argument is that today’s static GenAI tools fail while adaptive agentic systems succeed. That is not a reason to dismiss the report, but it is a reason to hold the 95% figure as directional rather than precise.

Our read: Roetzer’s methodological critique is valid. The 95% figure almost certainly overstates the failure rate under a broader definition of value. But it probably understates the failure rate under a strict enterprise-ROI definition, because organizations are generally terrible at measuring AI value even when it exists. The honest answer is that somewhere between 60% and 95% of enterprise AI initiatives are producing less value than their sponsors expected, which is damning enough without needing to settle on a single number.


What Happens Next: The 18-Month Outlook

Gartner’s “Trough of Disillusionment” framing for GenAI in 2026 fits the historical Hype Cycle pattern and is a reasonable, falsifiable prediction. After peak hype comes a period where the gap between expectation and delivered value becomes impossible to ignore, investment gets more selective, and the organizations that built real infrastructure during the hype phase begin pulling away from those that didn’t.

Three things are worth watching over the next 12 to 18 months.

Agentic AI cancellation rates will become the new headline metric. Gartner predicts more than 40% of agentic AI projects will be canceled by end of 2027. Given that agentic AI is currently in an earlier hype phase than GenAI was in 2024, the cancellation rate could be higher. Watch for enterprise announcements of agentic AI programs in Q3 2026, and note whether they include defined success metrics and timelines.

The winners will start becoming identifiable by name. The BCG “future-built” 5% is currently an anonymous cohort. As the field matures, the firms that built the right infrastructure and redesigned workflows rather than layering AI on top of broken processes will start producing public case studies with real numbers. Those case studies, when they arrive, will be more valuable than any survey data.

CFO scrutiny will reshape how pilots get approved. IBM found that only 25% of AI initiatives delivered expected ROI. That number is entering boardroom conversations. Finance leaders who previously approved AI spend on the basis of competitive parity (“our competitors are doing this”) are beginning to demand pre-defined success metrics and ROI timelines before sign-off. That shift, if it continues, will produce fewer pilots and better ones.

Three Actions for Technology Leaders Right Now

1. Audit your data infrastructure before approving any new AI spend. Gartner’s data consistently shows that data readiness, not model selection, is the primary predictor of AI success.

2. Define success in business terms, with a timeline and an owner, before a pilot launches. “Measurable reduction in customer support costs by Q3” is a success metric. “Explore AI capabilities” is not.

3. Consider stopping two current pilots before starting one new one. The evidence suggests that focus produces better outcomes than portfolio diversification when it comes to enterprise AI.


FAQ: Enterprise AI Failure Rates

Why do most enterprise AI projects fail?

Independent research from MIT, RAND, McKinsey, and S&P Global converges on organizational causes rather than technical ones: poor data readiness, unclear success metrics, weak workflow integration, and treating AI deployment as a technology rollout instead of a change-management initiative. The models mostly work. The organizations often don’t.

What percentage of AI projects fail in 2026?

Estimates vary by study and definition. MIT found 95% of GenAI pilots show no measurable P&L impact. McKinsey found only 39% of organizations report any enterprise-wide EBIT impact. S&P Global found 42% of companies abandoned most AI initiatives in 2025. No single authoritative percentage exists across all AI project types, but the consistent finding is that fewer than 10% of organizations capture most of the value.

Is the MIT 95% AI failure statistic accurate?

The MIT NANDA report is real, published in July 2025, based on 300-plus initiative reviews and 52 organizational interviews. The 95% figure reflects a strict 6-month P&L-only definition of success. Marketing AI Institute’s Paul Roetzer has publicly challenged the methodology for excluding efficiency and productivity gains. The figure is directionally useful but should not be treated as a precise universal failure rate.

How much are companies investing in AI in 2026?

Stanford HAI’s AI Index 2025 tracked $252.3 billion in corporate AI investment in 2024, with private investment rising 44.5% year-over-year. Gartner forecasts worldwide AI spending will reach $2.52 trillion in 2026. GenAI-specific pilot investment was estimated at $30 to $40 billion in 2025 per MIT NANDA’s own baseline.

What do successful enterprise AI programs have in common?

BCG’s analysis of 1,250-plus global firms found that the 5% achieving AI value at scale share three traits: they identify one specific business pain point rather than launching broad pilot portfolios, they redesign workflows around AI rather than layering AI onto existing processes, and they fix data infrastructure before deployment rather than after. The differentiator is organizational discipline, not model selection.


The enterprise AI ROI crisis is not a story about artificial intelligence failing. It is a story about organizations failing to create the conditions under which AI can succeed. The technology works. The problem is that most enterprises are deploying it into environments it cannot fix: fragmented data, undefined success metrics, siloed workflows, and approval processes driven by competitive anxiety rather than business logic.

The 5% that are winning are not using better models. They built better foundations first.

Over the next 18 months, as Gartner’s predicted “Trough of Disillusionment” plays out and CFO scrutiny tightens, the gap between that leading cohort and the field will widen. The organizations that survive the trough will be the ones that treated their first round of AI failures as diagnostic information rather than sunk costs.

For more on how enterprises are navigating this gap, read our analysis of why CTOs are falling behind on AI skills and our breakdown of the shadow AI crisis hitting enterprise governance.

Stay Ahead of the AI Signal

The Neural Loop delivers the enterprise AI research that actually matters, without the vendor spin. One email, twice a week.

Subscribe to The Neural Loop

Leave a Reply

Your email address will not be published. Required fields are marked *