GCP GKE SLA Credits & Refunds Guide
How the GCP GKE SLA works: uptime tiers, exclusions, claim windows, and how to recover the credits you're owed when GKE goes down.
GCP GKE SLA Credits & Refunds
Google Cloud publishes a service-specific SLA for GKE that describes exactly when a compute workload qualifies for credits — and the thresholds are stricter than most teams realize. This guide breaks down the GKE commitment, what Google considers a downtime period, and how to file a financial credit request through the Cloud Console.
What this guide covers
- The official GCP GKE uptime commitment and credit tiers
- Which incidents qualify (and which exclusions silently disqualify claims)
- How to file a GKE credit request inside the GCP claim window
- Why manual claim recovery typically leaves money on the table
Frequently asked questions about GCP GKE SLAs
What is the typical SLA uptime guarantee for GCP GKE?
Google publishes tiered uptime commitments for GKE: 99.95% monthly uptime for the Kubernetes control plane in regional clusters and 99.5% for the control plane in zonal clusters. Autopilot clusters add a pod-level SLA of 99.9% (regional) for workloads scheduled across multiple zones. The node pool (your worker VMs) inherits the standard Compute Engine SLA.
How do I claim GCP GKE SLA credits after an outage?
File a Financial Credit Request through Google Cloud Support within 30 days of the end of the affected billing month — the deadline is shorter than AWS or Azure, which catches a lot of teams out. Include your Project ID, the affected GKE resources, downtime intervals (with timezone), supporting evidence from Cloud Monitoring or your own observability stack, and a calculation showing where Monthly Uptime Percentage fell below the SLA threshold. Google issues approved credits against your billing account, not as cash refunds.
What exclusions apply to the GCP GKE SLA?
Specifically for GKE, downtime caused by your workload configuration (failing readiness probes, OOMKills, missing PodDisruptionBudgets during upgrades, or single-replica deployments that lose their only pod) is excluded — Google's SLA covers the control plane and platform, not your scheduling decisions.
Why is it difficult to get refunds for GKE outages manually?
Compute outages rarely show up cleanly. A GKE workload might be partially degraded — some instances fail, others stay healthy — and the SLA only counts the portion of capacity that was actually unavailable. To prove a claim you need per-instance (or per-task) error-rate data, not just an aggregate dashboard. Most teams discover this only when their first credit request comes back denied for "insufficient evidence."
Related GCP SLA guides
Other Google Cloud services with their own published SLA and 30-day claim window:
- GCP Compute Engine SLA credits — Compute
- GCP Cloud Functions SLA credits — Compute
- GCP Cloud Run SLA credits — Compute
- GCP Cloud Storage SLA credits — Storage
Don't miss GCP's 30-day claim window
GCP's claim deadline for GKE is the shortest of the three major clouds, and most teams miss it for the same reason: nobody owns "file SLA credit requests" as a recurring task. By the time finance closes out the month, the window is already gone.
Next Signal monitors GKE availability, files the Financial Credit Request inside Google's deadline, and tracks the claim through resolution. See how it works or start a free trial.
Related SLA guides
Other GCP services with their own SLA credit recovery process.