技能 firecrawl-agent

📦

firecrawl-agent

Name: firecrawl-agent
Author: firecrawl

低風險 ⚙️ 外部命令🌐 網路存取

自主從網站提取結構化資料

手動跨多個頁面進行網頁爬取既耗時又需要技術能力。此 AI 代理能夠自主導航複雜網站並提取結構化資料為 JSON，無需編寫自定義爬取程式。

支援: Claude Codex Code(CC)

📊 69 充足

下載技能 ZIP

在 Claude 中上傳

前往設定 → 功能 → 技能 → 上傳技能

開啟並開始使用

測試它

正在使用「firecrawl-agent」。從 example.com 提取定價方案

預期結果:

JSON 檔案包含：tiers 陣列，每個方案包含 name、price、features

正在使用「firecrawl-agent」。使用結構描述 {name, price, category} 提取產品

預期結果:

標準化欄位的產品物件 JSON 陣列，可直接用於資料庫匯入

安全審計

低風險

v1 • 4/10/2026

Static analysis detected 19 patterns but all are false positives. The flagged 'external_commands' are documentation examples in SKILL.md showing CLI usage, not actual code execution. Path traversal references are documentation links to sibling skill files. Weak crypto flags hit on YAML frontmatter. The skill uses properly scoped allowed-tools (Bash firecrawl * and npx firecrawl *) with explicit command allowlisting. Network access is inherent to web scraping functionality and is legitimate.

已掃描檔案

分析行數

發現項

審計總數

低風險問題 (1)

SKILL.md:5-7

External Command Execution

Skill executes firecrawl CLI commands via Bash. Commands are allowlisted with wildcards (firecrawl *, npx firecrawl *) which provides scope control but could be expanded by installing malicious firecrawl package versions.

風險因素

⚙️ 外部命令 (13)

SKILL.md:22-31 SKILL.md:31-37 SKILL.md:37-38 SKILL.md:38-39 SKILL.md:39-40 SKILL.md:40-41 SKILL.md:41-42 SKILL.md:42-43 SKILL.md:43-44 SKILL.md:44-48 SKILL.md:48-49 SKILL.md:49-50 SKILL.md:50-51

🌐 網路存取

未記錄任何特定位置

審計者: claude

品質評分

架構

100

可維護性

內容

社群

安全

規範符合性

你能建構什麼

競爭對手定價分析

從競爭對手網站提取定價方案和功能比較到結構化 JSON 以供分析。

電商產品編目

從線上商店提取產品列表（包含名稱、價格和描述）以供庫存比較。

目錄資料彙整

從目錄網站提取商家列表、聯絡資訊和後設資料。

試試這些提示

基本資料提取

Extract all pricing tiers from https://example.com/pricing and save as JSON

基於結構描述的提取

Extract products from https://store.example.com using schema: {"type":"object","properties":{"name":{"type":"string"},"price":{"type":"number"},"category":{"type":"string"}}}

聚焦頁面提取

Get feature list from https://example.com/features and https://example.com/enterprise, output to features.json

限制信用點的批次提取

Extract all customer reviews from https://reviews.example.com with max-credits 50, wait for completion and pretty-print output

最佳實務

始終使用 --wait 標誌以內聯接收結果，而非僅獲取工作 ID
提供 JSON 結構描述以獲得可預測的結構化輸出格式
設定 --max-credits 限制以控制代理執行的支出
對於單頁面提取任務使用簡單的 scrape 命令
在生產工作流程中使用前審查提取的 JSON 輸出

避免

在沒有 --wait 標誌的情況下執行代理，導致追蹤丟失工作結果
當 scrape 已足夠時仍使用代理進行簡單的單頁面爬取
省略結構描述卻期望一致的輸出結構
在不受信任或複雜的網站上執行時未設定信用點數限制

常見問題

提取任務需要多長時間？

代理任務通常在 2-5 分鐘內完成，具體時間取決於網站複雜性和需要導航的頁面數量。

agent 和 scrape 有什麼區別？

Agent 自主導航多頁面網站並找出資料所在位置。Scrape 從單個指定頁面提取。Agent 更強大但較慢且消耗更多信用點數。

我需要 Firecrawl API 金鑰嗎？

是的，firecrawl CLI 需要 API 憑證。使用此技能前請確保您的環境已設定 FIRECRAWL_API_KEY。

我如何控制成本？

使用 --max-credits 選項為每個代理執行設定信用點數限制。這將限制該特定提取任務的支出。

輸出是什麼格式？

輸出為 JSON。使用 --schema 獲得符合您結構描述的結構化輸出，或不使用則為自由格式 JSON。使用 -o filename.json 儲存到檔案或使用 --pretty 獲得可讀格式。

我可以從多個 URL 提取嗎？

是的，使用 --urls 選項提供起始 URL。代理將從這些入口點導航以查找和提取所需的資料。

開發者詳情

作者

firecrawl

授權

MIT

儲存庫

https://github.com/firecrawl/cli/tree/main/skills/firecrawl-agent/

引用

main

檔案結構

📄 SKILL.md