技能 service-mesh-observability

🔭

service-mesh-observability

Name: service-mesh-observability
Author: wshobson

安全 🌐 網路存取⚙️ 外部命令📁 檔案系統存取

快速設定服務網格可觀測性

也可從以下取得: sickn33

服務網格遙測資料很難在追蹤、指標和儀表板之間串接。本技能提供 Istio 和 Linkerd 可觀測性的現成範本和查詢。

支援: Claude Codex Code(CC)

📊 69 充足

下載技能 ZIP

在 Claude 中上傳

前往設定 → 功能 → 技能 → 上傳技能

開啟並開始使用

測試它

正在使用「service-mesh-observability」。 Give me a concise checklist to enable Istio observability with Prometheus and Jaeger.

預期結果:

Deploy Prometheus with the Istio scrape config and ServiceMonitor.
Enable tracing in Istio and point Zipkin to Jaeger collector.
Install Jaeger all-in-one and expose the UI port.
Add PromQL panels for request rate, errors, and P99 latency.

正在使用「service-mesh-observability」。 How do I check which services are making the most requests?

預期結果:

Use Linkerd viz top command: linkerd viz top deploy/my-app
Or query Prometheus: sum(rate(istio_requests_total[5m])) by (destination_service_name)
Check the Grafana Istio dashboards for visualized request rates by service.

正在使用「service-mesh-observability」。 Set up alerting for high error rates.

預期結果:

Create a PrometheusRule with expression: sum(rate(istio_requests_total{response_code=~"5.."}[5m])) by (destination_service_name) / sum(rate(istio_requests_total[5m])) by (destination_service_name) > 0.05
Set for: 5m threshold to avoid alert flapping.
Label with severity: critical and include service name in summary.

安全審計

安全

v4 • 1/17/2026

Pure documentation skill containing YAML templates, PromQL queries, and CLI examples for service mesh observability. All static findings are false positives: the scanner misinterpreted PromQL metric names (containing 'md5', 'sha' substrings) as weak crypto, flagged documentation links as network IOCs, and misidentified YAML field names as path traversal. The content is static documentation that matches its stated purpose exactly.

已掃描檔案

579

分析行數

發現項

審計總數

風險因素

🌐 網路存取 (12)

skill-report.json:6 SKILL.md:259 SKILL.md:261 SKILL.md:263 SKILL.md:380 SKILL.md:381 SKILL.md:382 SKILL.md:383 SKILL.md:280 SKILL.md:282 SKILL.md:284 SKILL.md:296

⚙️ 外部命令 (17)

SKILL.md:23-34 SKILL.md:34-49 SKILL.md:49-85 SKILL.md:85-89 SKILL.md:89-109 SKILL.md:109-113 SKILL.md:113-156 SKILL.md:156-160 SKILL.md:160-179 SKILL.md:179-183 SKILL.md:183-240 SKILL.md:240-244 SKILL.md:244-264 SKILL.md:264-268 SKILL.md:268-320 SKILL.md:320-324 SKILL.md:324-361

📁 檔案系統存取 (1)

SKILL.md:203

審計者: claude 查看審計歷史 →

品質評分

架構

100

可維護性

內容

社群

100

安全

規範符合性

你能建構什麼

建立網格監控

使用範本為新的服務網格串接 Prometheus、Grafana 和追蹤功能。

調查延遲高峰

應用 PromQL 查詢和追蹤設定來找出高延遲服務。

定義網格 SLO

使用黃金信號指導來為服務建立 SLO 和警報規則。

試試這些提示

快速入門

列出在新叢集中啟用 Istio 指標和追蹤的最低限度步驟和範本。

儀表板

提供按服務分類的請求率、錯誤率和 P99 延遲的關鍵 PromQL 查詢。

追蹤部署

提供用於分散式追蹤的 IstioOperator 和 Jaeger 部署範例。

完整堆疊

將 Prometheus、Grafana、Jaeger、Kiali 和 OTel 範本組合成階段性部署計劃。

最佳實務

在開發環境中高取樣追蹤，在生產環境中降低取樣以控制成本。
在所有服務中使用一致的追蹤上下文傳播。
使用 PrometheusRule 中定義的清晰閾值對黃金信號進行警報。

避免

在 Prometheus 上收集高基數標籤而沒有限制。
在生產環境中預設執行 100% 追蹤。
在沒有服務依賴關係和拓撲儀表板的情況下運作。

常見問題

這與 Istio 和 Linkerd 相容嗎？

是的，它提供 Istio 和 Linkerd 可觀測性工作流程的範例。

這個技能的限制是什麼？

它僅提供範本和指導，不會部署或驗證配置。

我可以與 OpenTelemetry 整合嗎？

是的，它包含 OpenTelemetry Collector 配置和 Istio Telemetry 範例。

它會存取我的資料或憑證嗎？

不，它包含靜態文檔，不會存取檔案或環境資料。

如果我的指標遺失怎麼辦？

驗證 Prometheus 刮取目標、服務標籤和 Istio 遙測設定是否正確。

它與供應商工具相比如何？

它是供應商中立的，使用常見的開源元件和查詢。

開發者詳情

作者

wshobson

授權

MIT

儲存庫

https://github.com/wshobson/agents/tree/main/plugins/cloud-infrastructure/skills/service-mesh-observability

引用

main

檔案結構

📄 SKILL.md