Skills ai-html-generate

🎨

ai-html-generate

Name: ai-html-generate
Author: AbeJitsu

Safe 🌐 Network access📁 Filesystem access⚙️ External commands

Convert PDF Pages to Semantic HTML

Manually converting PDF pages to semantic HTML is time-consuming and error-prone. This skill uses AI to automatically generate semantic HTML5 from PDF pages with three sources of context for high accuracy.

Supports: Claude Codex Code(CC)

📊 69 Adequate

Download the skill ZIP

Upload in Claude

Go to Settings → Capabilities → Skills → Upload skill

Toggle on and start using

Test it

Using "ai-html-generate". Convert Chapter 2, Page 16 to semantic HTML

Expected outcome:

Generates 04_page_16.html with semantic HTML5 structure
Includes page-container wrapper and page-content main element
Applies chapter-header with chapter-number and chapter-title
Creates section-navigation with nav-item elements
Uses proper heading hierarchy (h1 → h2 → h3 → h4)
Wraps bullet lists in ul.bullet-list with li.bullet-item
Logs generation metadata to 05_generation_metadata.json

Security Audit

Safe

v5 • 1/16/2026

This skill consists solely of documentation describing how to use AI for PDF-to-HTML conversion. SKILL.md is a markdown file containing prompts and instructions - not executable code. All 59 static findings are false positives: backticks are markdown code formatting, file paths are documentation examples, and there is no actual cryptographic code, shell execution, or file operations in this skill. The skill contains no network operations, no credential access, and no command execution capabilities.

Files scanned

713

Lines analyzed

findings

Total audits

Risk Factors

Audited by: claude View Audit History →

Quality Score

Architecture

100

Maintainability

Content

Community

100

Security

Spec Compliance

What You Can Build

Document Accessibility

Convert textbook chapters to accessible HTML for screen readers and web display with proper semantic markup.

PDF Content Migration

Migrate PDF documentation to web-ready HTML with consistent class names and structured formatting.

Digital Publishing

Transform printed materials into digital formats with proper heading hierarchies and semantic elements.

Try These Prompts

Basic Generation

Convert the attached PDF page to semantic HTML5. Use the PNG image for layout, the JSON data for text accuracy, and the ASCII preview for structure.

With Coverage Check

Generate HTML that includes every word from the source JSON. Do not add any bridging text or transitional phrases between pages.

Semantic Structure

Create semantic HTML with proper heading hierarchy (h1 to h4), list structures, and CSS classes. Use classes like page-container, section-heading, paragraph, and bullet-list.

Multi-Modal Context

Analyze the PNG image for visual layout, use the JSON for exact text content, and follow the ASCII preview for element relationships. Generate HTML5 that accurately recreates the page.

Best Practices

Run the verification script after each page generation to catch AI hallucination early
Use all three input sources (PNG, JSON, ASCII) for complete context
Check coverage percentage is between 99-100% before proceeding to the next page
Do not consolidate chapters until all individual pages pass the verification gate

Avoid

Skipping the text verification step and proceeding to consolidation
Accepting coverage above 100% which indicates the AI added invented content
Consolidating pages with any coverage below 95%
Allowing the AI to add bridging text between page boundaries

Frequently Asked Questions

Which AI models support this skill?

This skill works with Claude, Codex, and Claude Code. It requires vision capabilities for the PNG image input and text generation for the HTML output.

What happens if text coverage is below 95%?

Coverage below 95% indicates missing content. Regenerate the page and re-run verification. Never proceed with pages that fail the quality gate.

Can this skill process scanned PDFs without text layers?

No. This skill requires rich_extraction.json from a previous text extraction skill. Scanned images need OCR processing first.

Is my document data sent to external servers?

Document data is only sent to the configured AI API endpoint. No data is collected or stored by this skill itself. See your AI provider's privacy policy.

Why does the verification sometimes show over 100% coverage?

Coverage above 100% means the AI added words not present in the source. This is hallucination. Regenerate with stricter boundary instructions.

How is this skill different from other PDF converters?

This skill uses three sources of context (visual, textual, structural) instead of a single input. It applies semantic CSS classes and includes validation gates to prevent errors.

Developer Details

Author

AbeJitsu

License

MIT

Repository

https://github.com/AbeJitsu/Game-Settings-Panel/tree/main/.claude/skills/calypso/ai-html-generate

Ref

main

File structure

📄 SKILL.md