스킬 gemini-computer-use

📦

gemini-computer-use

Name: gemini-computer-use
Author: am-will

중간 위험 ⚡ 스크립트 포함🌐 네트워크 접근🔑 환경 변수⚙️ 외부 명령어

Gemini 브라우저 제어 에이전트 구축

브라우저 자동화 에이전트를 안전하고 일관되게 연결하는 일은 어렵습니다. 이 스킬은 Gemini Computer Use 워크플로를 위한 설정 단계와 Playwright 루프를 제공합니다.

지원: Claude Codex Code(CC)

⚠️ 50 나쁨

스킬 ZIP 다운로드

Claude에서 업로드

설정 → 기능 → 스킬 → 스킬 업로드로 이동

토글을 켜고 사용 시작

Agent용 리소스

AI Agent, 크롤러 또는 스크립트가 전체 페이지 대신 깔끔한 컨텍스트가 필요할 때 이 링크를 사용하세요.

Markdown 상세 GET /skills/am-will-gemini-computer-use.md 서명된 manifest GET /api/skills/am-will-gemini-computer-use/manifest 서명된 lockfile GET /api/skills/am-will-gemini-computer-use/lockfile

테스트해 보기

"gemini-computer-use" 사용 중입니다. 웹사이트에서 최신 공개 블로그 게시물 제목을 찾는 브라우저 작업을 실행하세요.

예상 결과:

에이전트를 실행하기 위한 환경 설정, 시작 URL, 프롬프트, 턴 제한이 포함된 짧은 터미널 세션 계획입니다.

"gemini-computer-use" 사용 중입니다. 에이전트가 텍스트를 입력하거나 키보드 단축키를 사용할 수 없도록 제한하세요.

예상 결과:

권장 명령 구성과, 입력 및 키 조합 작업을 모델이 사용할 수 없게 된다는 설명입니다.

"gemini-computer-use" 사용 중입니다. 인증된 관리자 페이지에서 에이전트를 사용할 수 있는지 평가하세요.

예상 결과:

스크린샷 공유, 계정 권한, 샌드박스 프로필, 제외된 작업, 필요한 사람의 확인을 다루는 위험 중심 체크리스트입니다.

보안 감사

중간 위험

v2 • 6/28/2026

Static findings for Markdown backticks, weak cryptography, and certificate files are false positives after context review. The confirmed risk is legitimate but elevated: the Python script sends browser screenshots and URLs to Gemini, then executes model-selected browser actions through Playwright.

스캔된 파일

475

분석된 줄 수

발견 사항

총 감사 수

중간 위험 문제 (3)

scripts/computer_use_agent.py:251-270 scripts/computer_use_agent.py:309-329 scripts/computer_use_agent.py:359-369

Browser Screenshots Sent to External AI Service

The script captures browser screenshots and sends them with the user prompt to the Gemini API. This is required for the skill, but it can expose sensitive page content, account data, or private URLs if used on authenticated sites.

scripts/computer_use_agent.py:147-248 references/google-computer-use.md:8-21

Model-Directed Browser Actions

The script executes Gemini function calls as Playwright actions, including navigation, clicking, typing, key combinations, scrolling, and drag operations. This can affect websites or accounts when the browser is authenticated.

scripts/computer_use_agent.py:130-158 scripts/computer_use_agent.py:244-246 SKILL.md:46-50

Safety Confirmation Depends on Model Signal

The script prompts for confirmation only when the model includes a safety_decision that requires confirmation. There is no independent local policy check for risky clicks, typed text, downloads, purchases, or account changes.

낮은 위험 문제 (4)

env.example:2 scripts/computer_use_agent.py:72-79 scripts/computer_use_agent.py:277-279

Environment Variable Access Is Expected

The static API key findings are true for environment access but not evidence of hardcoded or stolen credentials. The sample env file contains empty placeholders, and the script uses GEMINI_API_KEY to initialize the Google client.

SKILL.md:12-16 SKILL.md:20-24 SKILL.md:29-34 references/google-computer-use.md:3-6

Markdown Backtick Command Findings Are False Positives

The Ruby or shell backtick findings in documentation are Markdown formatting or ordinary setup commands. They do not indicate hidden runtime code execution inside the skill files.

scripts/computer_use_agent.py:37 scripts/computer_use_agent.py:194-201 scripts/computer_use_agent.py:234-238 SKILL.md:3 references/google-computer-use.md:6

Weak Crypto and Certificate Findings Are False Positives

The static weak cryptography and certificate or key-file detections do not match cryptographic operations. The flagged script lines are argument descriptions, mouse coordinates, and keyboard actions.

env.example:8-10 scripts/computer_use_agent.py:281-291 SKILL.md:38-42

Configurable Browser Executable Needs User Trust

The script can launch a browser executable path provided through COMPUTER_USE_BROWSER_EXECUTABLE. This is useful for custom Chromium browsers, but users should avoid sourcing untrusted env files.

위험 요인

⚡ 스크립트 포함 (1)

scripts/computer_use_agent.py:1-377

🌐 네트워크 접근 (4)

scripts/computer_use_agent.py:13-14 scripts/computer_use_agent.py:251-270 scripts/computer_use_agent.py:309-329 SKILL.md:30-33

🔑 환경 변수 (5)

env.example:2 env.example:6 env.example:10 scripts/computer_use_agent.py:72-79 scripts/computer_use_agent.py:281-289

⚙️ 외부 명령어 (4)

SKILL.md:12-16 SKILL.md:20-24 SKILL.md:29-34 scripts/computer_use_agent.py:291

감지된 패턴

Networked Agent Loop With ScreenshotsAutomated UI Control From Model Output

감사자: codex 감사 이력 보기 →

품질 점수

아키텍처

100

유지보수성

콘텐츠

커뮤니티

보안

사양 준수

만들 수 있는 것

브라우저 에이전트 프로토타입 만들기

Gemini가 페이지를 검사하고 Chromium 브라우저를 조작할 수 있게 하는 작동 가능한 개념 증명을 구축합니다.

Computer Use 워크플로 테스트

제어된 브라우저 작업을 실행하고 스크린샷, 함수 호출, 응답이 루프를 통해 어떻게 이동하는지 검사합니다.

안전 제어 평가

더 넓은 배포 전에 제외된 작업, 확인 프롬프트, 샌드박스 브라우저 프로필을 실험합니다.

이 프롬프트를 사용해 보세요

간단한 브라우저 작업 실행

이 스킬을 사용해 공개 웹사이트에서 Gemini Computer Use를 실행하세요. [URL]에서 시작해 [task]를 완료하고, [turn count]턴을 넘지 않게 사용하세요.

위험한 작업 제한

[goal]을 위해 Computer Use 에이전트를 구성하세요. 다음 작업을 제외하세요: [actions]. 제외 설정이 브라우저 동작을 어떻게 바꾸는지 설명하세요.

에이전트 루프 조정

[project need]에 맞게 Playwright 에이전트 루프를 업데이트하세요. 스크린샷 피드백, 함수 응답, 안전 확인 처리는 유지하세요.

배포 계획 검토

[site type]에서 이 Computer Use 에이전트를 실행하려는 제 계획을 검토하세요. 개인정보 위험, 브라우저 샌드박스 설정, 제외된 작업, 사람의 승인 지점을 식별하세요.

모범 사례

필요한 최소 계정 권한만 가진 새 프로필에서 브라우저를 실행하세요.
작업에 필요하지 않은 입력, 탐색, 클릭에는 제외 작업을 사용하세요.
비밀 정보, 고객 데이터, 결제 또는 되돌릴 수 없는 계정 작업이 있는 페이지는 피하세요.

피하기

기본적으로 민감한 계정에 로그인된 브라우저 프로필 안에서 에이전트를 실행하지 마세요.
신뢰할 수 없는 저장소나 공유 폴더에서 env 파일을 가져오지 마세요.
모델 안전 확인을 완전한 정책 제어로 간주하지 마세요.

자주 묻는 질문

이 스킬에는 실행 가능한 코드가 포함되어 있나요?

예. Playwright를 통해 Gemini Computer Use 루프를 실행하는 Python 스크립트가 포함되어 있습니다.

어떤 AI 도구가 이 스킬을 사용할 수 있나요?

메타데이터에는 Claude, Codex, Claude Code가 지원 도구로 나열되어 있습니다.

API 키가 필요한가요?

예. 스크립트는 실행 전에 환경에 GEMINI_API_KEY가 있을 것으로 예상합니다.

브라우저 데이터를 제 컴퓨터 밖으로 보내나요?

예. 프롬프트, 스크린샷, 현재 페이지 컨텍스트를 Gemini API로 보냅니다.

로그인된 브라우저 세션을 제어할 수 있나요?

제공한 모든 브라우저 컨텍스트를 조작할 수 있습니다. 제한된 액세스 권한이 있는 별도 프로필을 사용하세요.

게시해도 안전한가요?

브라우저 자동화와 스크린샷 공유가 예상되는 기능이지만 민감하므로, 중간 위험 경고와 함께 게시할 수 있습니다.

개발자 세부 정보

작성자

am-will

라이선스

MIT

리포지토리

https://github.com/am-will/codex-skills/tree/main/skills/gemini-computer-use/

참조

main

파일 구조

📁 references/

📄 google-computer-use.md

📁 scripts/

📄 computer_use_agent.py

📄 env.example

📄 SKILL.md