📄

pdf

Name: pdf
Author: ComposioHQ

低リスク ⚡ スクリプトを含む

Claude で PDF ドキュメントを処理する

こちらからも入手できます: ArtemisAI,sickn33,Azeem-2,92Bilal26,92Bilal26,anthropics,ZhanlinCui,AutumnsGrove,DYAI2025,K-Dense-AI,davila7,Cam10001110101

PDF からデータを抽出したり、フォームに入力したり、ドキュメントを結合したりする必要がありますか？Claude は Python ライブラリとコマンドラインツールを使って PDF を読み取り、作成し、変更できます。

対応: Claude Codex Code(CC)

🥉 73 ブロンズ

スキルZIPをダウンロード

Claudeでアップロード

設定 → 機能 → スキル → スキルをアップロードへ移動

オンにして利用開始

テストする

「pdf」を使用しています。 Read document.pdf and tell me what it contains

期待される結果:

Document: document.pdf (12 pages)
Contains: Invoice #2024-001 for Acme Corp
Metadata: Created 2024-12-15, Author: J. Smith
No fillable form fields detected
Extracted 3 tables (financial summary on page 5)

「pdf」を使用しています。 Fill out my application form

期待される結果:

Analyzed form fields: 12 text fields, 3 checkboxes, 2 radio groups
Filled 17 fields with provided values
Output saved to: filled_application.pdf
Validation: All fields populated correctly

セキュリティ監査

低リスク

v4 • 1/16/2026

Legitimate PDF manipulation toolkit containing documentation and Python scripts using standard PDF libraries (pypdf, pdfplumber, reportlab). Static findings are false positives: 143 patterns detected in markdown files are code examples in documentation, not executable code. The Python scripts in scripts/ directory are standard PDF processing utilities with no malicious behavior.

スキャンされたファイル

2,192

解析された行数

検出結果

総監査数

リスク要因

⚡ スクリプトを含む (4)

scripts/check_bounding_boxes.py:1-71 scripts/extract_form_field_info.py:1-153 scripts/fill_fillable_fields.py:1-115 scripts/convert_pdf_to_images.py:1-36

監査者: claude 監査履歴を表示 →

品質スコア

アーキテクチャ

100

保守性

コンテンツ

コミュニティ

セキュリティ

仕様準拠

作れるもの

論文からデータを抽出

分析や参照のために研究論文から表やテキストを抽出

PDF フォームを自動入力

JSON 設定でフィールド値を指定してフォームを入力

契約書類を処理

複数ページの契約書を分割し、条項を抽出し、修正済み版を結合

これらのプロンプトを試す

基本的な抽出

Read the PDF at [file.pdf] and tell me how many pages it has, then extract all the text

表の抽出

Extract all tables from [report.pdf] and save them to a CSV file

フォーム入力

Fill out the form in [application.pdf] using these field values: name=John Smith, email=john@example.com

ドキュメントの組み立て

Merge all PDFs in the [input] folder into one file, then split it into separate files by chapter

ベストプラクティス

テキスト抽出には pdfplumber、ドキュメント操作には pypdf、コマンドライン操作には qpdf を使用する
下流処理で精度が重要な場合は座標付きで表を抽出する
メモリ問題を避けるために大きな PDF は処理前に分割する

回避

プログラムでの抽出が速い場合は手作業でのコピー＆ペーストをしない
入力可能なフォームフィールドと視覚的なテキストボックスを混同しない（別のアプローチが必要です）
印刷用に設計された PDF を編集しようとしない（代わりに print-to-PDF ワークフローを使用する）

よくある質問

どの AI ツールがこのスキルに対応していますか？

Claude、Codex、Claude Code に対応しています。pypdf と pdfplumber をインストールした Python 環境が必要です。

必要な Python パッケージは何ですか？

pip で pypdf、pdfplumber、reportlab、pdf2image をインストールします。一部の CLI ツールには poppler-utils が必要です。

入力可能な PDF フォームはどうやって埋めますか？

extract_form_field_info.py を実行してフィールド一覧を出力し、field_values.json にデータを作成してから fill_fillable_fields.py を実行します。

PDF を処理するとき、データは安全ですか？

はい。すべての処理はローカルマシンで行われます。データは外部サーバーに送信されません。

PDF のテキスト抽出が空になるのはなぜですか？

PDF はスキャン画像の可能性があります。OCR には pdf2image と一緒に pytesseract を使用するか、先に画像に変換してください。

他の PDF ツールと何が違いますか？

このスキルは検証、座標マッピング、フォーム処理を含む構造化されたワークフローを Claude に提供します。

開発者の詳細

作成者

ComposioHQ

ライセンス

Proprietary. LICENSE.txt has complete terms

リポジトリ

https://github.com/ComposioHQ/awesome-claude-skills/tree/master/document-skills/pdf

参照

master

ファイル構成

📁 scripts/

📄 check_bounding_boxes_test.py

📄 check_bounding_boxes.py

📄 check_fillable_fields.py

📄 convert_pdf_to_images.py

📄 create_validation_image.py

📄 extract_form_field_info.py

📄 fill_fillable_fields.py

📄 fill_pdf_form_with_annotations.py

📄 forms.md

📄 LICENSE.txt

📄 reference.md

📄 SKILL.md