技能 agent-browser

🌐

agent-browser

Name: agent-browser
Author: skillssh

安全 ⚙️ 外部命令🌐 網路存取📁 檔案系統存取

使用 AI 代理程式自動化網頁瀏覽

也可從以下取得: inference-sh-8,inference-shell,inference-sh-skills,inf-sh,inference-sh-0,inference-sh-9,inferencesh,inferen-sh,inference-skills,vercel-labs,qu-skills,infsh-skills,toolshell,tul-sh,supercent-io

AI 代理程式需要與網站互動，但缺乏瀏覽器功能。此技能透過 inference.sh 提供無頭瀏覽器自動化，使 Claude、Codex 和 Claude Code 能夠導航頁面、填寫表單、截圖並錄製工作階段。

支援: Claude Codex Code(CC)

🥉 72 青銅

下載技能 ZIP

在 Claude 中上傳

前往設定 → 功能 → 技能 → 上傳技能

開啟並開始使用

測試它

正在使用「agent-browser」。開啟 https://example.com 並識別登入表單元素

預期結果:

頁面載入成功。發現 3 個互動元素：
@e1 [input type='text'] placeholder='Username'
@e2 [input type='password'] placeholder='Password'
@e3 [button] 'Sign In'

正在使用「agent-browser」。使用測試憑證填寫並提交登入表單

預期結果:

表單已提交。頁面重新導向至儀表板。
@e1 [h1] 'Welcome, Test User'
@e2 [nav] 'Dashboard | Settings | Logout'
已截圖。

正在使用「agent-browser」。截圖儀表板

預期結果:

截圖已儲存至 dashboard-20240101.png
頁面標題：Dashboard | 尺寸：1280x720
儀表板包含：導航選單、使用者個人資料卡、資料表格、操作按鈕

安全審計

安全

v1 • 4/22/2026

All static findings are false positives. The skill uses the inference.sh CLI (infsh) to control a headless browser via documented command invocations. External command detections are hardcoded API calls to a legitimate service. Network detections are target URLs for browsing, not exfiltration. Filesystem detections are documentation navigation (../) and standard device paths. Password/crypto detections are documentation showing credential input handling, not cryptography.

已掃描檔案

2,313

分析行數

發現項

審計總數

風險因素

⚙️ 外部命令 (4)

SKILL.md:21-22 references/authentication.md:24-26 references/commands.md:10-11 templates/authenticated-session.sh:40-43

🌐 網路存取 (4)

SKILL.md:9 SKILL.md:37 references/authentication.md:25 references/commands.md:25

📁 檔案系統存取 (2)

SKILL.md:195-200 references/authentication.md:5

審計者: claude

品質評分

架構

100

可維護性

內容

社群

100

安全

規範符合性

你能建構什麼

研究與資料擷取

AI 代理程式瀏覽網站以收集資訊、從頁面擷取結構化資料，並編寫研究報告，無需手動瀏覽。

自動化表單提交

AI 代理程式填寫並提交網頁表單，用於預約掛號、註冊帳戶或完成批次資料輸入等任務。

瀏覽器測試

QA 工程師使用 AI 代理程式導航網站、截圖並錄製測試工作階段，以驗證 UI 功能。

試試這些提示

基本頁面導航

使用 agent-browser 技能開啟 https://example.com 並顯示頁面上所有可點擊的元素。

表單填寫工作流程

開啟 https://example.com/contact 的聯絡表單。填入名稱「John Doe」、電子郵件「john@example.com」，然後提交表單。截圖結果。

已驗證工作階段的資料擷取

使用環境變數中的憑證登入 https://app.example.com。導航至儀表板，擷取所有表格資料，並儲存最終頁面的截圖。

多頁面研究工作階段

在瀏覽 example.com/products 時錄製影片。點擊瀏覽 5 個產品，为最後一個產品填寫諮詢表單，然後關閉工作階段以儲存錄製內容。

最佳實務

頁面載入後務必重新擷取快照；元素參照會過期
使用環境變數存放憑證；千萬不要將密碼寫死在指令碼中
完成後關閉工作階段；影片錄製只有在呼叫 close 後才會保存

避免

不要在不同頁面之間快取元素參照；導航後務必擷取快照
不要將憑證寫死；使用環境變數如 $APP_USERNAME 和 $APP_PASSWORD
不要略過操作後的等待時間；在互動前讓頁面完全載入

常見問題

什麼是 inference.sh，我需要安裝它嗎？

是的，需要 inference.sh。它提供執行瀏覽器自動化的 CLI (infsh)。從 raw.githubusercontent.com/inference-sh/skills/main/cli-install.md 安裝。

為什麼元素參照（如 @e1）會失效？

元素參照會在頁面導航、DOM 變更或動態內容載入後失效。在這些事件後始終呼叫快照函數以取得新的參照。

如何處理受保護網站的登入？

使用 agent-browser 技能自動化登入流程一次，然後重用工作階段 ID 以進行後續已驗證的請求。authentication.md 參考文件說明此模式。

我可以將瀏覽器工作階段錄製為影片嗎？

可以，在 open 函數中啟用 record_video: true。呼叫 close 來擷取影片檔案。可以使用 show_cursor: true 顯示游標以獲得更清晰的示範。

如何透過瀏覽器上傳檔案？

使用 upload 動作配合 file_paths 陣列。參照必須指向檔案輸入元素。範例：{action: upload, ref: @e5, file_paths: ['/path/to/file.pdf']}

如果瀏覽器工作階段逾時會發生什麼？

工作階段不會在伺服器重啟後保留。務必優雅地處理錯誤，必要时重新啟動工作流程。如果在逾時前未呼叫 close，影片錄製將會遺失。

開發者詳情

作者

skillssh

授權

MIT

儲存庫

https://github.com/skillssh/skills/tree/main/tools/utilities/agent-browser/

引用

main

檔案結構

📁 references/

📄 authentication.md

📄 commands.md

📄 proxy-support.md

📄 session-management.md

📄 snapshot-refs.md

📄 video-recording.md

📁 templates/

📄 authenticated-session.sh

📄 capture-workflow.sh

📄 form-automation.sh

📄 SKILL.md