azure-diagnostics
Debug Azure Production Issues
This skill helps developers diagnose and troubleshoot production issues in Azure by providing structured debugging workflows, diagnostic commands, and KQL queries for Container Apps.
Download the skill ZIP
Upload in Claude
Go to Settings → Capabilities → Skills → Upload skill
Toggle on and start using
Test it
Using "azure-diagnostics". My Container App shows status 'Running' but returns 502 errors
Expected outcome:
- Check the ingress targetPort configuration matches your app's listening port
- Verify the health endpoint returns HTTP 200
- Review container startup logs for errors
- Ensure Dockerfile EXPOSE matches ingress configuration
Using "azure-diagnostics". Write a KQL query to find failed requests
Expected outcome:
- AppRequests
- | where Success == false
- | where TimeGenerated > ago(1h)
- | summarize count() by Name, ResultCode
- | order by count_ desc
Security Audit
SafeThis skill contains only documentation with example Azure CLI commands and KQL queries. Static scanner flagged 70 instances of 'external_commands' but these are markdown code blocks showing example commands, not executable code. Network findings are documentation URLs. Env access findings reference standard Azure Container Apps environment variables. All high/critical findings are false positives - the skill is safe for publication.
Critical Issues (1)
High Risk Issues (1)
Medium Risk Issues (3)
Quality Score
What You Can Build
Debug Container App image pull failures
Diagnose why a container image fails to pull from Azure Container Registry and get step-by-step fix commands
Analyze application logs with KQL
Query Application Insights to find recent errors, failed requests, and slow performance issues
Fix cold start timeout issues
Identify and resolve slow first-request issues in Azure Container Apps by adjusting replica settings
Try These Prompts
My Azure Container App is failing to start. Help me diagnose the issue using the azure-diagnostics skill.
I keep getting image pull errors in my Container App. How do I fix ACR authentication issues?
Help me write KQL queries to find errors in my Application Insights logs from the last hour.
My container keeps restarting due to health probe failures. What should I check?
Best Practices
- Start with resource health checks before diving into detailed logs
- Use systematic diagnosis flow: symptoms -> health -> logs -> metrics -> changes
- Leverage AppLens MCP for AI-powered root cause analysis when available
- Document all attempted remediation steps for future reference
Avoid
- Do not skip health checks when investigating container restarts
- Do not assume Azure service health without checking HealthResources first
- Do not ignore provisioning state failures - these indicate deployment issues
Frequently Asked Questions
What does this skill actually do?
Can this skill fix my Azure issues?
What Azure services are supported?
Do I need Azure CLI installed?
What is AppLens?
How do I query Azure Resource Graph?
Developer Details
Author
microsoftLicense
MIT
Repository
https://github.com/microsoft/github-copilot-for-azure/tree/main/plugin/skills/azure-diagnostics/Ref
main
File structure