المهارات agent-browser
🌐

agent-browser

مخاطر منخفضة ⚙️ الأوامر الخارجية🌐 الوصول إلى الشبكة📁 الوصول إلى نظام الملفات

Automate Browser Tasks with AI Agents

متاح أيضًا من: vercel-labs,inferencesh,inf-sh

This skill enables AI agents to automate browser interactions including navigation, form filling, data extraction, and visual recording through a simple @e reference system.

يدعم: Claude Codex Code(CC)
🥉 73 برونزي
1

تنزيل ZIP المهارة

2

رفع في Claude

اذهب إلى Settings → Capabilities → Skills → Upload skill

3

فعّل وابدأ الاستخدام

اختبرها

استخدام "agent-browser". Open https://example.com and get elements

النتيجة المتوقعة:

Session started with ID: abc123. Elements found: @e1 [a] "Example Domain", @e2 [h1] "Example Domain", @e3 [p] "This domain is for use..."

استخدام "agent-browser". Fill login form and submit

النتيجة المتوقعة:

Filled @e1 with 'user@example.com', filled @e2 with 'password123', clicked @e3. Navigation detected. Re-snapshot recommended.

استخدام "agent-browser". Take screenshot

النتيجة المتوقعة:

Screenshot captured: {base64 encoded image data}, dimensions: 1920x1080

التدقيق الأمني

مخاطر منخفضة
v1 • 2/15/2026

This is a legitimate browser automation skill using Playwright via inference.sh CLI. The static scanner flagged 606 potential issues, but upon evaluation, all findings are false positives: external_commands detections are bash command examples in documentation, network detections are legitimate test URLs, and the heuristic 'dangerous combination' flag describes expected browser automation behavior (network access + code execution + session management). This skill provides standard web automation functionality for AI agents.

10
الملفات التي تم فحصها
2,310
الأسطر التي تم تحليلها
6
النتائج
1
إجمالي عمليات التدقيق
مشكلات منخفضة المخاطر (3)
Documentation Contains Shell Command Examples
The skill documentation includes bash scripts showing CLI usage. Command substitution syntax is for examples, not actual execution.
Example URLs in Documentation
Documentation contains test URLs like example.com and google.com - standard practice for technical documentation.
Browser Session Management
Skill manages browser sessions including cookies and authentication state. This is core browser automation functionality, not credential access vulnerability.

عوامل الخطر

⚙️ الأوامر الخارجية (5)
🌐 الوصول إلى الشبكة (2)
📁 الوصول إلى نظام الملفات (1)
تم تدقيقه بواسطة: claude

درجة الجودة

45
الهندسة المعمارية
100
قابلية الصيانة
85
المحتوى
50
المجتمع
84
الأمان
91
الامتثال للمواصفات

ماذا يمكنك بناءه

Automated Form Submission

Fill and submit web forms programmatically for data entry automation

Web Content Extraction

Navigate websites and extract structured data for research or monitoring

Visual Browser Testing

Record browser interactions to create visual documentation or debug issues

جرّب هذه الموجهات

Open and Navigate
Use the agent-browser skill to open [URL] and get the element references for all interactive elements on the page.
Fill Form Fields
Using session [SESSION_ID], fill the form field with ref @e1 with the text '[TEXT]', then click the button with ref @e2.
Capture Screenshot
Take a full-page screenshot of the current session [SESSION_ID] and save it.
Extract Data After Action
Click element @e1 to navigate to the next page, then take a snapshot and extract all heading text from the new page.

أفضل الممارسات

  • Always re-snapshot after navigation to get fresh element references
  • Use explicit waits after actions that trigger async loading
  • Close sessions when done to free browser resources
  • Enable video recording only when needed for debugging

تجنب

  • Using element refs from a previous page without re-snapshotting
  • Not handling navigation after click actions
  • Leaking session IDs or not managing session lifecycle
  • Attempting to interact with elements that may not be visible

الأسئلة المتكررة

What is the @e reference system?
Elements on the page are returned with @e prefixes like @e1, @e2. These refs are used to identify which element to interact with. Refs change after each snapshot.
Why do I need to re-snapshot after navigation?
When you navigate to a new page, the DOM changes completely. Previous element refs become invalid and new refs must be fetched.
Can this skill bypass login pages?
The skill can automate login forms but cannot bypass authentication. It can help maintain logged-in sessions for subsequent operations.
Is video recording always enabled?
No, video recording must be explicitly enabled with record_video: true in the open function. It is off by default.
What proxy options are available?
The skill supports HTTP, HTTPS, and SOCKS5 proxies with optional authentication via proxy_username and proxy_password.
Can I execute custom JavaScript?
Yes, use the execute function to run JavaScript code in the browser context. The result is returned as a string.

تفاصيل المطور

المؤلف

inference-sh-0

الترخيص

MIT

مرجع

main