📄

pdf

Name: pdf
Author: 92Bilal26

Niedriges Risiko ⚡ Enthält Skripte📁 Dateisystemzugriff

处理 PDF 文档

Auch verfügbar von: ArtemisAI,sickn33,Azeem-2,92Bilal26,anthropics,ZhanlinCui,AutumnsGrove,DYAI2025,K-Dense-AI,davila7,Cam10001110101,ComposioHQ

PDF 文档难以通过编程方式进行编辑。此技能提供了提取文本和表格、创建新 PDF、合并和拆分文档以及自动填写 PDF 表单的工具。

Unterstützt: Claude Codex Code(CC)

🥉 72 Bronze

Die Skill-ZIP herunterladen

In Claude hochladen

Gehe zu Einstellungen → Fähigkeiten → Skills → Skill hochladen

Einschalten und loslegen

Teste es

Verwendung von "pdf". 从 invoice.pdf 提取文本

Erwartetes Ergebnis:

Invoice #12345 - 3 pages
Page 1: Bill to: Acme Corp, Total: $1,250.00
Page 2: Line items (3 items)
Page 3: Payment terms and signature line

Sicherheitsaudit

Niedriges Risiko

v3 • 1/10/2026

Legitimate PDF processing toolkit using standard Python libraries. No network connectivity, no credential access, no code execution vulnerabilities. Scripts only read and write user-specified PDF files.

Gescannte Dateien

900

Analysierte Zeilen

befunde

Gesamtzahl Audits

Risikofaktoren

⚡ Enthält Skripte (8)

scripts/check_fillable_fields.py:1-13 scripts/extract_form_field_info.py:1-153 scripts/convert_pdf_to_images.py:1-36 scripts/fill_fillable_fields.py:1-115 scripts/fill_pdf_form_with_annotations.py:1-108 scripts/create_validation_image.py:1-42 scripts/check_bounding_boxes.py:1-71 scripts/check_bounding_boxes_test.py:1-227

📁 Dateisystemzugriff (6)

scripts/extract_form_field_info.py:140-145 scripts/convert_pdf_to_images.py:22-24 scripts/fill_fillable_fields.py:13-14 scripts/fill_pdf_form_with_annotations.py:32-33 scripts/create_validation_image.py:13-14 scripts/check_bounding_boxes.py:67

Auditiert von: claude Audit-Verlauf anzeigen →

Qualitätsbewertung

Architektur

100

Wartbarkeit

Inhalt

Community

Sicherheit

Spezifikationskonformität

Was du bauen kannst

提取表格数据

从 PDF 报告中提取结构化表格，转换为 CSV 或 Excel 格式进行分析

自动填写表单

自动填写包含姓名和地址等一致信息的 PDF 表单

合并文档

将多个 PDF 文档合并为一个文件，便于分发

Probiere diese Prompts

提取文本

Extract all text from the PDF file located at [path]. Show the total number of pages and provide a summary of the content.

提取表格

Extract all tables from the PDF report at [path] and save them to an Excel file with each table on a separate sheet.

合并 PDF

Merge these PDF files into a single document: [list of files]. Save the result as merged.pdf.

填写 PDF 表单

Fill out the fillable PDF form at [input.pdf] with the values in field_values.json and save the result as [output.pdf].

Bewährte Verfahren

在处理前验证输入 PDF 是否受密码保护
将 PDF 转换为图像时使用高 DPI 设置以获得更好的质量
尝试填写 PDF 表单前验证表单字段值

Vermeiden

不要假设所有 PDF 都有可提取的文本（扫描文档需要 OCR）
不要尝试在不使用注释方法的情况下填写不可填写的 PDF 表单
没有正确密码不要处理加密的 PDF

Häufig gestellte Fragen

此技能可以从扫描的 PDF 中提取文本吗？

扫描的 PDF 包含图像而非文本。首先使用 pytesseract 等 OCR 工具将扫描的页面转换为文本。

支持哪些 PDF 表单字段类型？

可填写的 PDF 表单支持文本字段、复选框、单选组和选择/下拉字段。

我可以合并不同页面大小的 PDF 吗？

可以，pypdf 可以合并不同页面大小的 PDF。可能需要调整内容以实现一致呈现。

处理 PDF 时我的数据安全吗？

所有处理都在本地进行。文件从您指定的路径读取并写入。没有数据发送到外部服务器。

为什么文本提取不完整？

某些 PDF 使用非标准编码或图像而非文本。对于复杂文档，请尝试替代提取方法或 OCR。

这与在线 PDF 工具相比如何？

此技能提供程序化控制、批处理和隐私保护，因为文件永远不会像在线服务那样离开您的系统。

Entwicklerdetails

Autor

92Bilal26

Lizenz

Proprietary. LICENSE.txt has complete terms

Repository

https://github.com/92Bilal26/TaskPilotAI/tree/main/.claude/skills/pdf

Ref

main

Dateistruktur

📁 scripts/

📄 check_bounding_boxes_test.py

📄 check_bounding_boxes.py

📄 check_fillable_fields.py

📄 convert_pdf_to_images.py

📄 create_validation_image.py

📄 extract_form_field_info.py

📄 fill_fillable_fields.py

📄 fill_pdf_form_with_annotations.py

📄 forms.md

📄 LICENSE.txt

📄 reference.md

📄 SKILL.md