Blog
How to keep cloud costs under control (and what to do if they get away from you)
When workloads shift to the public cloud, it’s suddenly easy to try new things. The result can be bill shock for finance. Here's how to keep costs under control.
Recently I participated in TelecomTV’s Telcos and Public Cloud Summit where an audience poll asked about the main barriers between telcos and the public cloud:
As usual, the risk of vendor lock-in was regarded as the biggest barrier with 52% of the vote (eyeroll); but coming in a strong second was ‘concerns over public cloud pricing and associated costs,’ which received votes from 46% of the audience. Ray LeMaistre writes, “It is becoming an increasing worry, and there have long been suggestions that using the public cloud is not always as cost effective and flexible as it might seem.”
Oh, brother! I can already hear the telco execs mumbling from here. Before you go running back to your on-premise data center, hold the phone. You’ll spend more money in the public cloud only if you’re doing it really, really wrong. So, instead of running back to the headache of managing on-premise stacks, how about you learn how to become a cost-optimization ninja instead?
To start, let’s look at what is likely the largest category on your cloud bill: your spend on cloud computing resources. To determine the price of compute resources, Amazon Web Services (AWS), Google Cloud, and Microsoft Azure use these three pricing models:
AWS claims that customers can save as much as 90% by using AWS Spot Instances over On-Demand Instances, and 50-70% over its Reserved Instances. Google Cloud says its preemptible VM instances typically cost 70-80% less than on-demand rates, and 50% less than three-year reserved VMs. Telcos should totally optimize their compute usage alongside these pricing strategies, and when they do, they will save a ton of money.
Reserve and spot instances are different in three big ways:
All three hyperscalers offer the idea of a reserved instance; AWS calls it simply Reserved Instances. Azure has Reserved Virtual Machine (VM) Instances and Google Cloud has reserve VM capacity. It’s all the same basic idea—if you make a long-term commitment (one to three years) to use a specific instance type from a hyperscaler, you’ll secure the resources and get a discounted hourly rate. The longer the commitment and bigger the upfront payment you’re willing to make, the more money you’ll save. Is it a good strategy? Sure, for a number of reasons, including:
The downside is that you’re locked into the spend. If your needs change, you may end up paying for capacity you don’t need.
A lot of people think using reserved instances is the safest approach to saving on compute. But is it really the BEST way to save? Not necessarily.
AWS’s Spot Instances, Google Cloud’s Spot VMs or Preemptible VMs, and Azure’s Spot VMs offer unused compute capacity at significantly lower prices—like 90% lower. Why the steep discount? You can lose them to a higher bidder at any moment, with only a couple minutes’ notice. If you’re willing and able to move your workloads to accommodate this dynamism, you can save big bucks.
Here are some examples that are a good fit for spot instances:
Spot instances come with risks due to price volatility and potential terminations. They may not be suitable for workloads with strict uptime requirements or real-time processing needs. You should consider these trade-offs and assess the suitability of spot instances based on your workload’s characteristics and requirements.
Most telco execs feel spot instances can’t be used for their workloads. But I know of a UK MVNO that kept an open mind on spot instances, figured out how to make work, and is now reveling in the savings.
Steve McDonald is COO and CTO at giffgaff, a UK mobile virtual network operator (MVNO). In Episode 68 of my Telco in 20 podcast, Steve shared how giffgaff uses AWS Spot Instances to save big money. He points out (at timestamp 12:25) that once giffgaff moved its operation to AWS, it moved its entire production estate to EC2 Spot Instances. To make it work for giffgaff he says, “You have to be in a position where you can build a new server and insert it into the cluster. We do that now about 60-80 times a day.”
How cool is that? 😎
giffgaff tossed conventional wisdom out the window, and refactored its entire production estate so it can run on AWS Spot Instances. You see, most people tend to overestimate the likelihood of preemption of a spot instance and underestimate their tolerance for it. In reality, instance preemption is not as common as you may think in most instance families. By being flexible with different instance types, significant cost savings can be achieved without significant negative impact. With a well-designed spot usage strategy, giffgaff has likely been able to achieve the 50-90% savings Amazon claims. Essentially, by utilizing AWS Spot Instances, giffgaff can take advantage of a market inefficiency with respect to compute pricing while others overspend on elastic compute (EC2) spend. That’s a smart move. Want to learn way more than you ever thought possible about AWS pricing inefficiencies with regard to spot instance? Read this great blog from Eric Pauley for more information on this cost optimization strategy.
The telcos and vendors that win big with the public cloud will be those that get really skilled at optimizing the cost of their workloads. Watch out if your teammate or vendor is recommending an exclusively reserved instance to run their workload. Discuss spot instances with them to see if you can squeeze out more savings and make your money go further. By saving on compute, you can stretch your budget and fund all those cool public cloud projects you need to work on, like genAI and, of course, Totogi!
Want more clever cost-saving tips like this? We do, too! In fact, TelcoDR is exploring an exclusive deal with software startup CloudFix to offer a cloud optimization tool for telcos that will save big money, too. We’ll be announcing it in a few weeks. Stay tuned, and keep on optimizing your cost in the cloud!
WATCH:
AWS Made Easy: Cut costs in half by right-sizing AWS EC2 instances
CloudFix Founder and CEO Rahul Subramaniam offers practical tips on cost optimization and cutting costs by right-sizing AWS EC2 instances.
Recent Posts