Compétences error-diagnostics-error-analysis
📩

error-diagnostics-error-analysis

Sûr

Diagnose Production Errors

This skill helps developers quickly diagnose and resolve production errors using systematic debugging techniques, distributed tracing analysis, and comprehensive observability patterns.

Prend en charge: Claude Codex Code(CC)
đŸ„‰ 74 Bronze
1

Télécharger le ZIP du skill

2

Importer dans Claude

Allez dans ParamĂštres → CapacitĂ©s → Skills → Importer un skill

3

Activez et commencez Ă  utiliser

Tester

Utilisation de "error-diagnostics-error-analysis". Error: Database connection timeout after 30s in order-service

Résultat attendu:

  • Analysis: Connection pool exhaustion detected
  • Root cause: Long-running queries holding all connections
  • Recommended fix: Implement query timeout and optimize N+1 patterns
  • Prevention: Add circuit breaker and connection pool monitoring

Utilisation de "error-diagnostics-error-analysis". Intermittent 502 errors in API gateway

Résultat attendu:

  • Pattern: Errors occur during peak traffic windows
  • Correlation: New autoscaling policy deployed yesterday
  • Root cause: Backend services scaling slower than load balancer
  • Recommendation: Adjust scaling parameters and add health check validation

Audit de sécurité

Sûr
v1 ‱ 2/24/2026

After evaluating 108 static findings, all detections are false positives. The scanner misinterpreted markdown documentation patterns as security issues: backticks in code blocks were flagged as shell execution, example URLs were flagged as network exfiltration, and environment variable reads in example error-tracking code were flagged as credential access. The sensitive data deletion code (lines 751-752) is a security best practice that removes cookies/authorization headers before sending error reports. This is a legitimate error diagnostics skill providing observability documentation.

2
Fichiers analysés
1,194
Lignes analysées
0
résultats
1
Total des audits
Aucun problÚme de sécurité trouvé
Audité par: claude

Score de qualité

38
Architecture
100
Maintenabilité
87
Contenu
50
Communauté
100
Sécurité
91
Conformité aux spécifications

Ce que vous pouvez construire

Investigate Production Incidents

Analyze production errors, correlate with deployments, and identify root cause using distributed tracing and log analysis.

Debug Application Errors

Examine stack traces, identify error patterns, and implement fixes for application-level errors.

Improve System Observability

Design and implement comprehensive error tracking, monitoring, and alerting solutions for better incident detection.

Essayez ces prompts

Basic Error Analysis
Analyze this error message and stack trace. Identify the likely cause and suggest a fix: $ERROR_MESSAGE
Distributed System Debugging
Debug this distributed system error. The error occurred in service $SERVICE_NAME with trace ID $TRACE_ID. Examine the distributed trace and identify which upstream service caused the failure.
Post-Incident Review
Conduct a post-incident review for this outage. Error pattern: $ERROR_PATTERN. Timeline: $TIMELINE. What were the contributing factors and what preventive measures would you recommend?
Observability Implementation
Design an observability implementation for a Node.js/Express application. Include error tracking setup with Sentry, distributed tracing with OpenTelemetry, and alerting rules for critical errors.

Bonnes pratiques

  • Always correlate errors with deployments, configuration changes, and external events
  • Implement structured logging with correlation IDs for distributed tracing
  • Create retry logic with exponential backoff for transient failures
  • Establish error budgets and alerting thresholds based on user impact

Éviter

  • Ignoring intermittent errors - they often indicate systemic issues
  • Implementing generic error handling without context-specific recovery
  • Sending raw error data to external systems without scrubbing sensitive information
  • Setting alerting thresholds too low, causing alert fatigue

Foire aux questions

What information should I provide for effective error analysis?
Provide the complete error message, full stack trace, timestamps, affected service names, recent deployment history, and any relevant log excerpts.
How do I debug errors in a distributed system?
Use correlation IDs to trace requests across services, examine distributed tracing data, identify the failure point, and trace backward to find the root cause.
What observability tools does this skill support?
The skill covers Sentry, DataDog, OpenTelemetry, Prometheus, Grafana, Jaeger, and other major observability platforms.
Can this skill help implement error tracking?
Yes, the implementation playbook includes code samples for Sentry, DataDog, and other error tracking SDK integration.
How do I prioritize errors for fixing?
Prioritize by user impact, error frequency, severity, and whether the error indicates a systemic issue or single-point failure.
What is the recommended error handling strategy?
Implement defense in depth: retry logic with backoff, circuit breakers, proper error propagation, comprehensive logging, and automated alerting.

Détails du développeur

Structure de fichiers

📁 resources/

📄 implementation-playbook.md

📄 SKILL.md