Scaling Smart: Introducing Choreo’s Scale-to-Zero for Optimal Resource Utilization

Products
Technology for your digital platform that you can run yourself, in a private cloud, or as SaaS.
API Management
Integration
Identity & Access Management
Your Digital Platform, as a Service
Platform
- Your internal developer platform as a service to quickly deliver applications.
Solutions
By Technology
- API Management
  Solutions to deliver your APIs in any environment.
- Integration
  Break down data silos, streamline workflows, and unlock insights.
- Identity & Access Management
  Create secure, unique, and compelling digital experiences with ease.
  
  Customer IAM
  
  For Consumers (B2C)
  
  For B2B Applications (B2B CIAM)
  
  For Citizens (G2C)
  
  Workforce IAM
  
  API Access Management
By Industry
- Healthcare
  Provide awesome digital experiences for payers, providers, and patients.
- Finance
  Deliver secure financial experiences at speed.
- Government
  Deliver secure and simplified digital services for governments and citizens.
Featured Blog
- Implementing an Event-Driven GraphQL BFF with Real-Time Notifications
  Read Blog
Resources
- Training and Certification
- Events
Featured Whitepaper
- Transactions in a Microservice World
  Read Whitepaper
Support
- Subscription
  Learn About the Benefits of a WSO2 Subscription for Commercial Use.
- Updates
  Maintain the health and security of your solution.
- Consulting Services
  A full range of services to help you get the most out of your WSO2 project.
Featured Blog
- Self Manage Your WSO2 Support Portal Users
  Read Blog
Company
- About WSO2
  Learn about who we are and our journey.
- Team
  The people behind our innovative technologies.
- Customers
  See how we’ve helped leading enterprises from around the world.
- Careers
  Discover how you can make an impact.
- Partners
  Learn about the benefits of partnering with us.
- News
  The latest updates and insights.
Customer Spotlight
- YOMA x WSO2: Building Better Banking Relationships
  Read Case Study
Community
Contact Us
EN
- French
- German
- Portuguese
- Spanish
Profile

Lakmal Warusawithana
Senior Director - Cloud Architecture - WSO2

While the cloud facilitates quicker and easier completion of tasks, it’s important to use resources efficiently. Careless usage can lead to high costs over time. Choreo’s new scale-to-zero feature allows you to minimize costs by scaling down application resource usage to almost zero when not in active use. This capability is a significant step forward towards apps being more resource-conscious and cost-effective.

Choreo’s serverless service model

Scale-to-zero is primarily aimed at service types within Choreo. Traditionally, services are designed to run continuously, ready to handle requests at any moment. However, in reality, many services don’t receive continuous requests, yet they remain running as there’s no mechanism to scale down when idle and scale up on demand. This is the gap that Choreo's latest feature aims to bridge, offering a significant improvement in efficiency.

Now services in Choreo can scale down when they are not in use so that idle services don’t unnecessarily consume resources. But what happens when a new request comes in? Here's the clever part: the first incoming request is temporarily held back while Choreo instructs its internal APIs to scale up the service workload. Once the service is adequately scaled and ready, the request is then processed.

Scale-to-zero will be enabled by default

With the scale-to-zero feature, Choreo is transforming how HTTP-based services operate within its ecosystem. This change affects all types of services, including public APIs, internal APIs used within organizations, and all webapps. By default, these services will now adopt scale-to-zero configurations.

But what does this mean in practice? The minimum replica count for services is now set to zero, enabling full scalability. Users have the flexibility to set a maximum replica count based on their anticipated service load. When the need arises, the service scales up by adding more replicas to the cluster in response to incoming requests.

The scaling of services is based on the number of requests waiting in the load balancer, which manages the traffic to the services. This method is particularly suitable for HTTP-based services because it focuses on the real-time demand instead of just monitoring the CPU and memory usage within containers. Users can adjust the queue size in the load balancer, giving them control over how quickly and efficiently their services scale up to meet demand.

Is HPA being phased out?

The Horizontal Pod Autoscaler (HPA) will remain unchanged. If users prefer to scale their application services based on CPU or memory consumption, they can still use HPA. However, unlike scale-to-zero, HPA doesn't allow scaling down to zero replicas since CPU and memory usage never reaches zero.

Summary

Read our documentation to discover more about how to use scale-to-zero in your application. If you haven't already, sign up and begin your journey with Choreo today for free.

Language English

Implementing an Event-Driven GraphQL BFF with Real-Time Notifications

Transactions in a Microservice World

Self Manage Your WSO2 Support Portal Users

YOMA x WSO2: Building Better Banking Relationships

Scaling Smart: Introducing Choreo’s Scale-to-Zero for Optimal Resource Utilization

Choreo’s serverless service model

Scale-to-zero will be enabled by default

Is HPA being phased out?

Summary

API Management

Integration

IAM

Internal Developer Platform

Solutions

Resources

Support

Company

Follow us