In a report by Fivetran, titled The Enterprise Data Infrastructure Benchmark 2026, it is highlighted that despite substantial financial commitments, the fragility of data pipelines remains an obstacle to analytics and AI progress in large enterprises.
The research draws on input from 500 senior data and technology leaders from organisations with over 5,000 employees. Nearly 97% of these leaders report that pipeline failures have delayed analytics or AI initiatives, indicating reliability as a growing factor in enterprise AI delivery.
Rather than underinvestment, the report identifies supporting architecture as the primary challenge. It notes that enterprises are allocating an average of $29.3 million annually to data initiatives, yet reliability issues continue to impact business value.
With 14% of these budgets — approximately $4.2 million annually — allocated to integration, many organisations operate a mix of legacy ETL systems and DIY pipelines. As data volumes increase, these systems become more difficult to maintain. The benchmark highlights an estimated $3 million in monthly business exposure due to pipeline downtime and operational disruption, reflecting a gap between investment and measurable returns.
Reliability challenges increase as data environments scale, with enterprises managing an average of over 300 pipelines. The study finds that 53% of engineering capacity is focused on maintaining existing pipelines, limiting resources for innovation and AI initiatives.
This often results in operational disruption, with an estimated 4.7 pipeline failures per month, each taking nearly 13 hours to resolve. The resulting downtime exceeds 60 hours per month, delaying analytics delivery and AI deployment timelines.
As AI adoption increases, a shift towards open data infrastructure architectures is anticipated. These approaches emphasise automated data movement and interoperability, supporting more resilient and scalable environments while reducing engineering overhead.
It is increasingly suggested that as enterprises aim to improve competitiveness, adoption of open data infrastructure strategies may play a role in improving flexibility and operational resilience.