📦

Audit History

ai-visual-accuracy-check - 6 audits

Audit version 6

Latest Medium Risk

Jun 28, 2026, 04:03 AM

The static analyzer's Ruby backtick, weak cryptography, and system reconnaissance findings are false positives caused by Markdown code fences, filenames, and ordinary prose. The skill still has medium operational risk because its intended workflow reads local HTML and image files, renders screenshots with a headless browser, and sends visual content to Claude for analysis.

1
Files scanned
385
Lines analyzed
8
findings
codex
Audited by
Medium Risk Issues (2)
Third-Party AI Visual Analysis Sends Document Images
The workflow instructs the assistant to attach the original PDF PNG and rendered HTML screenshot to Claude for comparison. This is legitimate for the skill, but it may expose sensitive document pages to an external AI service if users provide confidential PDFs.
Local File Rendering and Report Output
The skill reads local HTML and PDF page image files, renders HTML through a headless browser, and saves a report under an output directory. This is expected behavior, but users should scope input and output paths to the project workspace.
Low Risk Issues (3)
Static Ruby Backtick Findings Are Markdown False Positives
The external command detections point to Markdown inline code, fenced code blocks, examples, diagrams, and file paths. I did not find Ruby code, shell backtick execution, or user-controlled command construction in the reviewed file.
Weak Cryptography Findings Are Textual False Positives
The weak cryptography detections correspond to the word AI and descriptive visual reasoning text, not cryptographic algorithms or hashing code. No evidence found of MD5, SHA-1, DES, or similar weak cryptographic usage.
System Reconnaissance Findings Are Documentation False Positives
The scanner flagged ordinary validation and comparison prose as reconnaissance. No evidence found of OS discovery, environment probing, host enumeration, or network scanning instructions.

Risk Factors

Detected Patterns

Headless Browser Automation

Audit version 5

Safe

Jan 16, 2026, 02:35 PM

Pure prompt-based skill with no executable code. All 37 static findings are false positives: markdown backtick formatting was misidentified as shell execution, documentation references as file access, and API image attachment as malicious upload. This is a legitimate visual accuracy validation tool that sends images to Claude API for comparison.

2
Files scanned
565
Lines analyzed
1
findings
claude
Audited by
No security issues found

Audit version 4

Safe

Jan 16, 2026, 02:35 PM

Pure prompt-based skill with no executable code. All 37 static findings are false positives: markdown backtick formatting was misidentified as shell execution, documentation references as file access, and API image attachment as malicious upload. This is a legitimate visual accuracy validation tool that sends images to Claude API for comparison.

2
Files scanned
565
Lines analyzed
1
findings
claude
Audited by
No security issues found

Audit version 3

Safe

Jan 10, 2026, 09:53 AM

Pure prompt-based skill with no executable code. All described behaviors (image comparison, Claude API calls, file operations) are legitimate for visual accuracy validation. No obfuscation, no suspicious patterns, no credential theft or exfiltration.

1
Files scanned
385
Lines analyzed
0
findings
claude
Audited by
No security issues found

Audit version 2

Safe

Jan 10, 2026, 09:53 AM

Pure prompt-based skill with no executable code. All described behaviors (image comparison, Claude API calls, file operations) are legitimate for visual accuracy validation. No obfuscation, no suspicious patterns, no credential theft or exfiltration.

1
Files scanned
385
Lines analyzed
0
findings
claude
Audited by
No security issues found

Audit version 1

Safe

Jan 10, 2026, 09:53 AM

Pure prompt-based skill with no executable code. All described behaviors (image comparison, Claude API calls, file operations) are legitimate for visual accuracy validation. No obfuscation, no suspicious patterns, no credential theft or exfiltration.

1
Files scanned
385
Lines analyzed
0
findings
claude
Audited by
No security issues found