> azure-reliability

Expert knowledge for Azure Reliability development including best practices, decision making, architecture & design patterns, limits & quotas, and deployment. Use when designing AZ zone/zone-redundant setups, resilient Functions, AKS, MySQL HA migrations, or Queue Storage limits, and other Azure Reliability related development tasks. Not for Azure Resiliency (use azure-resiliency), Azure Monitor (use azure-monitor), Azure Service Health (use azure-service-health), Azure Sre Agent (use azure-sre-

fetch
$curl "https://skillshub.wtf/MicrosoftDocs/Agent-Skills/azure-reliability?format=md"
SKILL.mdazure-reliability

Azure Reliability Skill

This skill provides expert guidance for Azure Reliability. Covers best practices, decision making, architecture & design patterns, limits & quotas, and deployment. It combines local quick-reference content with remote documentation fetching capabilities.

How to Use This Skill

IMPORTANT for Agent: Use the Category Index below to locate relevant sections. For categories with line ranges (e.g., L35-L120), use read_file with the specified lines. For categories with file links (e.g., [security.md](security.md)), use read_file on the linked reference file

IMPORTANT for Agent: If metadata.generated_at is more than 3 months old, suggest the user pull the latest version from the repository. If mcp_microsoftdocs tools are not available, suggest the user install it: Installation Guide

This skill requires network access to fetch documentation content:

  • Preferred: Use mcp_microsoftdocs:microsoft_docs_fetch with query string from=learn-agent-skill. Returns Markdown.
  • Fallback: Use fetch_webpage with query string from=learn-agent-skill&accept=text/markdown. Returns Markdown.

Category Index

CategoryLinesDescription
Best PracticesL33-L62Patterns and configurations to make Azure services (AKS, Functions, DBs, messaging, IoT, networking, DR) highly available, resilient to failures, and recover quickly from outages.
Decision MakingL63-L68Guidance on using availability zones, nonregional services, and resilient Azure Functions architectures to design highly available, fault-tolerant Azure solutions.
Architecture & Design PatternsL69-L74Designing Azure apps for availability zones, choosing zonal vs zone-redundant resources, and hardening deployments for zone failures and high availability.
Limits & QuotasL75-L79Details on Azure Queue Storage message size limits, including max message size, behavior when limits are exceeded, and best practices for handling large payloads.
DeploymentL80-L83Guidance for migrating Azure Database for MySQL Flexible Server to zone-redundant high availability, including architecture, prerequisites, and step-by-step migration process.

Best Practices

TopicURL
Design resilient clusters in Azure Kubernetes Servicehttps://learn.microsoft.com/en-us/azure/reliability/reliability-aks
Configure reliability for Azure API Centerhttps://learn.microsoft.com/en-us/azure/reliability/reliability-api-center
Harden Azure App Service Environment reliabilityhttps://learn.microsoft.com/en-us/azure/reliability/reliability-app-service-environment
Architect highly available Azure Application Gateway v2https://learn.microsoft.com/en-us/azure/reliability/reliability-application-gateway-v2
Configure high availability in Azure Database for PostgreSQLhttps://learn.microsoft.com/en-us/azure/reliability/reliability-azure-database-postgresql
Plan reliability for Azure Bot Servicehttps://learn.microsoft.com/en-us/azure/reliability/reliability-bot
Configure reliability for Azure Chaos Studiohttps://learn.microsoft.com/en-us/azure/reliability/reliability-chaos-studio
Achieve high availability in Azure Cosmos DB NoSQLhttps://learn.microsoft.com/en-us/azure/reliability/reliability-cosmos-db-nosql
Harden Azure Data Factory for outageshttps://learn.microsoft.com/en-us/azure/reliability/reliability-data-factory
Implement resilient architectures in Azure Databrickshttps://learn.microsoft.com/en-us/azure/reliability/reliability-databricks
Ensure reliability for Azure Device Registry metadatahttps://learn.microsoft.com/en-us/azure/reliability/reliability-device-registry
Design high availability for Azure DocumentDBhttps://learn.microsoft.com/en-us/azure/reliability/reliability-documentdb
Design resilient architectures with Azure Event Gridhttps://learn.microsoft.com/en-us/azure/reliability/reliability-event-grid
Increase reliability of Azure Event Hubs streaminghttps://learn.microsoft.com/en-us/azure/reliability/reliability-event-hubs
Design reliable analytics with Microsoft Fabrichttps://learn.microsoft.com/en-us/azure/reliability/reliability-fabric
Implement resilient architectures with Azure Functionshttps://learn.microsoft.com/en-us/azure/reliability/reliability-functions
Implement resilient architectures with Azure Functionshttps://learn.microsoft.com/en-us/azure/reliability/reliability-functions
Implement disaster recovery for Azure Image Builderhttps://learn.microsoft.com/en-us/azure/reliability/reliability-image-builder
Design resilient device connectivity with Azure IoT Hubhttps://learn.microsoft.com/en-us/azure/reliability/reliability-iot-hub
Design resilient workflows with Azure Logic Appshttps://learn.microsoft.com/en-us/azure/reliability/reliability-logic-apps
Increase reliability of Azure Managed Redis cacheshttps://learn.microsoft.com/en-us/azure/reliability/reliability-managed-redis
Improve reliability of Azure Notification Hubshttps://learn.microsoft.com/en-us/azure/reliability/reliability-notification-hubs
Design resilient disaster recovery with Azure Site Recoveryhttps://learn.microsoft.com/en-us/azure/reliability/reliability-site-recovery
Implement resilient architectures in Azure SQL Databasehttps://learn.microsoft.com/en-us/azure/reliability/reliability-sql-database
Increase reliability of Azure Stream Analytics jobshttps://learn.microsoft.com/en-us/azure/reliability/reliability-stream-analytics
Plan high availability in Azure VMware Solutionhttps://learn.microsoft.com/en-us/azure/reliability/reliability-vmware-solution

Decision Making

TopicURL
Identify Azure services with availability zone supporthttps://learn.microsoft.com/en-us/azure/reliability/availability-zones-service-support
Select and understand Azure nonregional serviceshttps://learn.microsoft.com/en-us/azure/reliability/regions-nonregional-services

Architecture & Design Patterns

TopicURL
Enable and plan zone-resilient Azure workloadshttps://learn.microsoft.com/en-us/azure/reliability/availability-zones-enable-zone-resiliency
Design and harden zonal Azure resource deploymentshttps://learn.microsoft.com/en-us/azure/reliability/availability-zones-zonal-resource-resiliency

Limits & Quotas

TopicURL
Understand Azure Queue Storage message size limitshttps://learn.microsoft.com/en-us/azure/reliability/reliability-storage-queue

Deployment

TopicURL
Migrate MySQL Flexible Server to zone-redundant HAhttps://learn.microsoft.com/en-us/azure/reliability/migrate-database-mysql-flex

> related_skills --same-repo

> microsoft-foundry

Expert knowledge for Microsoft Foundry (aka Azure AI Foundry) development including troubleshooting, best practices, decision making, architecture & design patterns, limits & quotas, security, configuration, integrations & coding patterns, and deployment. Use when building Foundry agents with Azure OpenAI, vector search/RAG, Sora video, realtime audio, or MCP/LangChain APIs, and other Microsoft Foundry related development tasks. Not for Microsoft Foundry Classic (use microsoft-foundry-classic),

> microsoft-foundry-tools

Expert knowledge for Microsoft Foundry Tools (aka Azure AI services, Azure Cognitive Services) development including best practices, decision making, architecture & design patterns, limits & quotas, security, configuration, integrations & coding patterns, and deployment. Use when using Content Understanding analyzers, Content Moderator APIs, Foundry containers, VNet/Key Vault security, or Entra auth, and other Microsoft Foundry Tools related development tasks. Not for Microsoft Foundry (use micr

> microsoft-foundry-local

Expert knowledge for Microsoft Foundry Local (aka Azure AI Foundry Local) development including troubleshooting, best practices, decision making, configuration, and integrations & coding patterns. Use when using Foundry Local CLI, chat/transcription APIs, tools, OpenAI/LangChain clients, or upgrading legacy SDKs, and other Microsoft Foundry Local related development tasks. Not for Microsoft Foundry (use microsoft-foundry), Microsoft Foundry Classic (use microsoft-foundry-classic), Microsoft Foun

> microsoft-foundry-classic

Expert knowledge for Microsoft Foundry Classic (aka Azure AI Foundry classic) development including troubleshooting, best practices, decision making, architecture & design patterns, limits & quotas, security, configuration, integrations & coding patterns, and deployment. Use when building Foundry agents with RAG, tools, evaluators, Azure OpenAI, VNet/Private Link, or CI/CD deployments, and other Microsoft Foundry Classic related development tasks. Not for Microsoft Foundry (use microsoft-foundry

┌ stats

installs/wk0
░░░░░░░░░░
github stars525
██████████
first seenMar 17, 2026
└────────────

┌ repo

MicrosoftDocs/Agent-Skills
by MicrosoftDocs
└────────────

┌ tags

└────────────