AI Workloads Are Forcing DevOps Teams to Rethink Infrastructure

By Terry Storrar, managing director, Leaseweb UK.

  • Thursday, 28th May 2026 Posted 7 hours ago in by Phil Alsop

There is no doubt that AI has changed the pace of enterprise IT. What once took months of planning is now expected to be completed in days. Development teams are being asked to test generative AI applications, deploy internal copilots, fine-tune large language models, and support increasingly data-intensive workloads. Oh, and all as fast as humanly (or as it turns out, not so humanly) possible.

This trend is creating a challenge for DevOps teams that goes far beyond software delivery. Organisations are discovering that traditional approaches to cloud infrastructure are struggling to keep up with modern workload demands. GPU availability, escalating cloud costs, compliance pressures, and concerns around operational control are all pushing infrastructure strategy back into the spotlight.

As a result, Infrastructure-as-a-Service (IaaS) is turning from a background utility into a strategic requirement for organisations building AI-driven services.

Exposing infrastructure gaps

Many organisations entered the AI race assuming their existing cloud strategies would suffice. In reality, AI workloads behave quite differently from conventional enterprise applications. Training models, running inference workloads, or processing large datasets places significant pressure on compute, storage, and networking resources. GPU acceleration is becoming essential, while throughput, latency, and scalability requirements are growing rapidly.

For DevOps teams already balancing uptime, deployments, security, and developer productivity, this creates a difficult operational environment. They now need to also ensure they’re capable of accessing infrastructure quickly enough to support experimentation without introducing unnecessary operational complexity.

This is where IaaS has become increasingly important. The ability to provision infrastructure on demand, scale environments dynamically, and deploy temporary workloads without long procurement cycles gives organisations the agility AI workloads require. In many cases, it is the difference between accelerating innovation and slowing projects down before they begin.

Public cloud alone is not solving every problem

Let’s be very clear, hyperscale cloud providers remain central to enterprise IT. Their platforms have enabled organisations to scale globally, launch applications rapidly, and access advanced AI tooling that would once have been inaccessible to most businesses. However, organisations are also becoming more aware of the operational trade-offs that come with relying entirely on hyperscalers.

One of the biggest concerns is cost. AI workloads can become extremely expensive at scale, particularly when GPU-heavy infrastructure is involved. Organisations experimenting with large language models are often finding that cloud spend increases much faster than expected.

Visibility is another challenge. As infrastructure environments become more complex, understanding where workloads are running, how resources are allocated, and how costs are accumulating can become difficult.

Additionally, compliance and sovereignty concerns are becoming harder to ignore. For organisations operating in regulated sectors, questions around data residency, jurisdiction and infrastructure access are now critical considerations; particularly as AI governance requirements continue to evolve. At the same time, many DevOps teams are under pressure to avoid becoming overly dependent on proprietary ecosystems that limit portability or increase long-term operational risk.

None of this reduces the importance of hyperscalers. They remain essential for scalability, elasticity, and global reach. But organisations are increasingly recognising that public cloud alone may not be the ideal environment for every AI workload.

Spotlight on specialist IaaS providers 

This shift is creating new opportunities for regional and specialist IaaS providers. Rather than attempting to compete directly with hyperscalers on size, many providers are differentiating through flexibility, operational transparency, and workload-specific infrastructure. For AI-focused environments, that can be particularly valuable. Specialist providers are often able to deliver tailored GPU infrastructure, customised networking, and high-performance storage environments designed specifically for compute-intensive workloads. For DevOps teams, this flexibility matters. AI projects frequently move quickly from proof-of-concept to production, creating unpredictable demand patterns that require infrastructure teams to adapt rapidly.

Support models are also becoming increasingly important. When infrastructure issues affect critical workloads, organisations want direct access to engineers who understand the environment rather than relying solely on standardised support processes. Local providers can often deliver a more responsive operational experience; particularly for organisations with complex or specialised infrastructure requirements.

There is also growing interest in sovereignty-focused infrastructure. Governments and regulators place greater emphasis on data governance, so organisations are seeking providers that can offer greater visibility into where data resides and how infrastructure environments are managed. For businesses operating in sectors such as finance, healthcare, or the public sector, this level of control is becoming crucial.

The hybrid approach

As a result, infrastructure strategies are becoming more distributed. Rather than relying entirely on one provider model, many organisations are now combining hyperscale cloud environments with regional or specialist IaaS platforms depending on workload requirements.

A hyperscaler may still support customer-facing applications or global services, while specialist infrastructure is used for GPU-intensive AI training, regulated datasets, or latency-sensitive workloads. This approach gives DevOps teams greater flexibility to optimise infrastructure around performance, cost, compliance, and operational priorities. It also reduces reliance on a single ecosystem. With AI adoption continuing, this hybrid infrastructure model is likely to become more common across enterprise environments.

Ultimately, the organisations that will succeed with AI are the ones acknowledging the importance of building infrastructure environments capable of supporting rapid experimentation, scalable deployment, and long-term operational resilience.

For DevOps teams, that means infrastructure can no longer be treated as an afterthought. With AI workloads continuing to grow, organisations will need infrastructure strategies that balance scalability, flexibility, governance, and cost control. In the long run, that will involve combining the strengths of hyperscale cloud providers with the agility and operational focus of specialist IaaS platforms. Because in the arms race to operationalise AI, infrastructure is rapidly becoming one of the biggest differentiators.

With the uncertainty and volatility surrounding the current AI boom, what can data infrastructure...
Girl Tech programme helps over 2,000 young women pursue digital careers, championing gender parity...
By Ben Allcock, Vice President – B2B, TP-Link UK&I.
In a Q&A with Zendesk’s VP Marketing EMEA, Emma Acton reflects on how enterprise marketing is...
Zendesk EMEA CTO Matthias Goehler discusses how AI is reshaping customer service, from automation...
By George Ashwin, UK Channel Director at AddOn Networks
Connectivity, Not Compute, Will Decide the AI Race. By Mahesh Krishnaswamy, Founder and CEO of...