Skills agent-browser

🌐

agent-browser

Name: agent-browser
Author: inference-sh-0

Low Risk ⚙️ External commands🌐 Network access📁 Filesystem access

Automate Browser Tasks with AI Agents

Also available from: inference-sh-8,inference-shell,inference-sh-skills,inf-sh,inference-sh-9,inferencesh,skillssh,inferen-sh,inference-skills,vercel-labs,qu-skills,infsh-skills,toolshell,tul-sh,supercent-io

This skill enables AI agents to automate browser interactions including navigation, form filling, data extraction, and visual recording through a simple @e reference system.

Supports: Claude Codex Code(CC)

⚠️ 68 Poor

Download the skill ZIP

Upload in Claude

Go to Settings → Capabilities → Skills → Upload skill

Toggle on and start using

Test it

Using "agent-browser". Open https://example.com and get elements

Expected outcome:

Session started with ID: abc123. Elements found: @e1 [a] "Example Domain", @e2 [h1] "Example Domain", @e3 [p] "This domain is for use..."

Using "agent-browser". Fill login form and submit

Expected outcome:

Filled @e1 with 'user@example.com', filled @e2 with 'password123', clicked @e3. Navigation detected. Re-snapshot recommended.

Using "agent-browser". Take screenshot

Expected outcome:

Screenshot captured: {base64 encoded image data}, dimensions: 1920x1080

Security Audit

Low Risk

v1 • 2/15/2026

This is a legitimate browser automation skill using Playwright via inference.sh CLI. The static scanner flagged 606 potential issues, but upon evaluation, all findings are false positives: external_commands detections are bash command examples in documentation, network detections are legitimate test URLs, and the heuristic 'dangerous combination' flag describes expected browser automation behavior (network access + code execution + session management). This skill provides standard web automation functionality for AI agents.

Files scanned

2,310

Lines analyzed

findings

Total audits

Low Risk Issues (3)

SKILL.md:34-37 SKILL.md:114-118 SKILL.md:213-215

Documentation Contains Shell Command Examples

The skill documentation includes bash scripts showing CLI usage. Command substitution syntax is for examples, not actual execution.

SKILL.md:9-11 SKILL.md:20

Example URLs in Documentation

Documentation contains test URLs like example.com and google.com - standard practice for technical documentation.

references/session-management.md:20-21 references/session-management.md:67-69

Browser Session Management

Skill manages browser sessions including cookies and authentication state. This is core browser automation functionality, not credential access vulnerability.

Risk Factors

⚙️ External commands (5)

SKILL.md:34-37 SKILL.md:114-118 SKILL.md:213-215 references/authentication.md:20-21 references/commands.md:9-11

🌐 Network access (2)

SKILL.md:9-11 SKILL.md:35

📁 Filesystem access (1)

SKILL.md:162-163

Audited by: claude

Quality Score

Architecture

100

Maintainability

Content

Community

Security

Spec Compliance

What You Can Build

Automated Form Submission

Fill and submit web forms programmatically for data entry automation

Web Content Extraction

Navigate websites and extract structured data for research or monitoring

Visual Browser Testing

Record browser interactions to create visual documentation or debug issues

Try These Prompts

Open and Navigate

Use the agent-browser skill to open [URL] and get the element references for all interactive elements on the page.

Fill Form Fields

Using session [SESSION_ID], fill the form field with ref @e1 with the text '[TEXT]', then click the button with ref @e2.

Capture Screenshot

Take a full-page screenshot of the current session [SESSION_ID] and save it.

Extract Data After Action

Click element @e1 to navigate to the next page, then take a snapshot and extract all heading text from the new page.

Best Practices

Always re-snapshot after navigation to get fresh element references
Use explicit waits after actions that trigger async loading
Close sessions when done to free browser resources
Enable video recording only when needed for debugging

Avoid

Using element refs from a previous page without re-snapshotting
Not handling navigation after click actions
Leaking session IDs or not managing session lifecycle
Attempting to interact with elements that may not be visible

Frequently Asked Questions

What is the @e reference system?

Elements on the page are returned with @e prefixes like @e1, @e2. These refs are used to identify which element to interact with. Refs change after each snapshot.

Why do I need to re-snapshot after navigation?

When you navigate to a new page, the DOM changes completely. Previous element refs become invalid and new refs must be fetched.

Can this skill bypass login pages?

The skill can automate login forms but cannot bypass authentication. It can help maintain logged-in sessions for subsequent operations.

Is video recording always enabled?

No, video recording must be explicitly enabled with record_video: true in the open function. It is off by default.

What proxy options are available?

The skill supports HTTP, HTTPS, and SOCKS5 proxies with optional authentication via proxy_username and proxy_password.

Can I execute custom JavaScript?

Yes, use the execute function to run JavaScript code in the browser context. The result is returned as a string.