Skip to main content

Azure Azure OpenAI SLA Credits & Refunds Guide

How the Azure Azure OpenAI SLA works: uptime tiers, exclusions, claim windows, and how to recover the credits you're owed when Azure OpenAI goes down.

Azure Azure OpenAI SLA Credits & Refunds

Azure OpenAI downtime that's billed against your Azure subscription is usually creditable, but the SLA fine print determines how much. This guide walks through the Azure OpenAI availability commitment Microsoft publishes, the exclusions that quietly disqualify many claims, and what FinOps teams do to systematically recover credits across an Azure tenant.

What this guide covers

  • The official Azure Azure OpenAI uptime commitment and credit tiers
  • Which incidents qualify (and which exclusions silently disqualify claims)
  • How to file an Azure OpenAI credit request inside the Azure claim window
  • Why manual claim recovery typically leaves money on the table

Frequently asked questions about Azure Azure OpenAI SLAs

What is the typical SLA uptime guarantee for Azure Azure OpenAI?

Azure guarantees 99.9% uptime for Azure OpenAI Service on Standard (pay-as-you-go) and Provisioned Throughput Units (PTUs) deployments. Preview models, fine-tuning operations, and Batch API jobs are not covered by the availability SLA. If Azure fails to meet this commitment during a billing cycle, you are eligible to receive a portion of your Azure OpenAI spend back as a service credit.

How do I claim Azure Azure OpenAI SLA credits after an outage?

Submit a billing support request through the Azure portal: Help + Support → New support request → Issue type: Billing → Problem type: Service credit request. Within two months of the billing period in question, provide the affected Subscription ID and Resource ID, the start and end timestamps of the impacted period, your evidence (Azure Monitor logs, Resource Health alerts, or independent monitoring), and your calculated Monthly Uptime Percentage for Azure OpenAI. Microsoft validates against its internal incident records before issuing the credit to your billing account.

What exclusions apply to the Azure Azure OpenAI SLA?

Crucially, HTTP 429 throttling for exceeding your deployment's tokens-per-minute (TPM) quota is not considered downtime, and models in preview or in their deprecation window are explicitly excluded from SLA coverage.

Why is it difficult to get refunds for Azure OpenAI outages manually?

AI/ML SLAs are still maturing, and Azure OpenAI carries some of the most nuanced terms in the cloud catalog. Rate limits, queue depths, and model availability all get measured differently, and the SLA often excludes throttling that the provider deems "expected." Teams that successfully claim Azure OpenAI credits do so by capturing per-request latency and error-code data and matching it precisely against the published terms.

Related Azure SLA guides

Other Azure services creditable through the same portal-based billing request process:

Recover Azure credits without a portal grind

Azure billing support requests for Azure OpenAI aren't difficult to file — they're tedious. Each one takes the same kind of subscription-ID, resource-ID, timestamp, and uptime-calculation packaging, repeated for every incident across every subscription you own.

Next Signal detects Azure OpenAI SLA breaches across your Azure tenants, packages the credit request in the format Microsoft expects, and submits it. See how it works or start a free trial.