Skip to main content

Cloud Ops: The New IT for the Cloud Era

Over the past few months of interviewing and researching dozens of companies—particularly small to mid-sized SaaS businesses—one pattern keeps emerging: the desire to stand up a Cloud Operations (Cloud Ops) organization.

It makes sense on the surface. Cloud is now the infrastructure of choice, so naturally, someone needs to “own” it. But what’s unfolding in practice often misses the mark.

Many companies are attempting to solve growing cloud complexity by taking all their DevOps, SRE, and platform engineering talent and consolidating them into a Cloud Ops team. The idea? Share them across product teams so no one gets overwhelmed.

If that sounds familiar, it should. It’s the same centralization tactic used by traditional IT for decades. And it's creating the same problems.


When Cloud Ops Becomes Old IT in Disguise

Here’s the playbook we’re seeing:

  • Move DevOps, SRE, and Ops into a central Cloud Ops team.

  • Let them handle infrastructure, CI/CD, monitoring, and cloud security across all teams.

  • Expect engineering teams to "consume" cloud through tickets or requests.

While the intention is to reduce duplication and increase efficiency, the result is often the opposite: bottlenecks, lack of ownership, slow velocity, and frustrated developers.

Sound familiar?


What Good Cloud Ops Looks Like

A healthy Cloud Ops organization shouldn't be a service desk or a ticket resolver. It should be a strategic enabler.

Here’s what good looks like:

🔍 Acts as a Governing Body, Not a Bottleneck

Cloud Ops should define the guardrails—not build the road every time. It sets standards and policies, not pipelines for individual teams.

🧭 Leads the Cloud Center of Excellence (CCoE)

A strong Cloud Ops team leads the organization's CCoE, guiding cloud strategy, governance, and enablement.

💰 Owns FinOps Practices

They should drive cost optimization, cloud budgeting, and cost visibility across teams. Cloud costs are everyone's responsibility, but Cloud Ops makes it actionable.

🏗️ Defines Cloud Architecture

From choosing the right building blocks to ensuring best practices for resilience and scalability, Cloud Ops leads architectural decisions without owning all the implementation.

🔐 Drives Security Best Practices

Cloud Ops builds and enforces cloud security policies, IAM standards, and compliance guidelines in collaboration with security teams.

🛠️ Maintains Cloud Policies & Service Catalogs

They manage reusable infrastructure components and approved services, empowering teams to move fast within well-defined boundaries.

🚀 Enables Platform Engineering

Cloud Ops can operate as a platform team, building developer portals, paved paths, and internal tooling that make the cloud easier and safer to consume.

📚 Educates & Consults

Most importantly, Cloud Ops should be a force multiplier—consulting with engineering teams, running enablement sessions, and building documentation—not inserting itself into every delivery pipeline.


What Cloud Ops Should Not Be Doing

Let’s be clear on what Cloud Ops is not:

  • ❌ Building product-specific CI/CD pipelines

  • ❌ Owning application monitoring dashboards

  • ❌ Managing infrastructure per product team

  • ❌ Sitting in the critical path of every engineering workflow

That’s not scalable. That’s old-school IT.


Final Thoughts

Cloud Ops isn’t about rebranding your IT department or centralizing DevOps into a one-size-fits-all model. Done right, it’s a strategic capability that empowers engineering teams to innovate faster—with guardrails, not gates.

If you’re building a Cloud Ops function, make sure it's designed for enablement—not control.

Because in the cloud era, velocity is the new uptime.


Comments

Popular posts from this blog

2020 State of DevSecOps by Accurics

 This is an excellent report for all IT Pros and Engineers.   Highlights: Storage is most impacted solution Open security groups or network configuration Secrets are not so secret Unused resources are not secure. Take a look at these.  Look again.  These are not highly skilled problems.  They just need guidelines and proactive management.  The article uses policy as code as a solution for many of the problems.  I will drill into each of these more in the future.  I wanted to get the awareness out first and then, come back to solutions.  

Learn Anti-Leadership from Basecamp

 There are many different articles out there and Twitter comments about the Basecamp drama.  I am not going to post any here because it might seem biased depending on the article.  Google them yourself.  In short, Basecamp made a policy to not allow political discussions at work.  Coinbase did this previously too and applauded Basecamp for it.   Apparently, for years there has been a list of funny customer names at floating around Basecamp.  This list or even the knowledge that Basecamp had a list, was disturbing to some employees.  Also, some employees tried to start a Diversity and Inclusion practice.  Despite how much the founders of Basecamp promoted DI, they didn't feel they were being taken serious.  They felt the company was only about the founders and not about employees.    If this isn't enough, the founders debated and even called out employees for their comments regarding the topics, publicly.  This is my s...

Set a Vision and reach goals

 Leaders should be setting the vision for the company or team.  When tech teams are always jumping from priority to priority, they will get frustrated.  When you set the vision, show the team you stand behind it.  Get the buy in from your team.  The companies ability to set a vision for a team or project and work towards it, will set long term culture and retention.  Thoughts: 1. Create a long term vision with short term goals. 2. Use OKRs to communicate the actions and measurements of the goals. 3. Post regular messages or videos about the goal progress. 4.  You may never reach the full vision but you worked toward it. 5.  If you don't the vision or goals, ask around.    Lead with a daily passion for the companies vision.