Process PDF documents automatically
Also available from: 92Bilal26,92Bilal26,davila7,ArtemisAI,AutumnsGrove,Azeem-2,K-Dense-AI,DYAI2025,ComposioHQ,Cam10001110101
PDF documents require manual effort to extract data, fill forms, or combine multiple files. This skill provides programmatic tools to extract text and tables, create new PDFs, merge and split documents, and fill forms automatically using established Python libraries.
Download the skill ZIP
Upload in Claude
Go to Settings → Capabilities → Skills → Upload skill
Toggle on and start using
Test it
Using "pdf". Extract the text and tables from the quarterly report PDF
Expected outcome:
- - Found 12 pages in the PDF
- - Extracted 245 paragraphs of text
- - Identified 3 tables with structured data
- - Table 1: Revenue by Region (4 rows x 3 columns)
- - Table 2: Expenses by Category (6 rows x 3 columns)
- - Table 3: Growth Metrics (5 rows x 2 columns)
Using "pdf". Create a welcome document with company letterhead and branding
Expected outcome:
- - Created 3-page PDF document
- - Added company logo and letterhead on page 1
- - Included 2 paragraphs of welcome text
- - Added page numbers on all pages
- - Applied brand colors to headers
Using "pdf". Fill out the tax form with the provided employee information
Expected outcome:
- - Verified form has 15 fillable fields
- - Filled text fields: name, address, SSN, income
- - Set 3 checkbox fields
- - Selected radio group option for filing status
- - Saved completed form to completed_tax_form.pdf
Security Audit
SafeOfficial Anthropic skill from github.com/anthropics/skills repository. Contains only PDF processing scripts using established Python libraries (pypdf, pdfplumber, reportlab). No network calls, no external command execution, no sensitive filesystem access beyond specified PDF files.
Risk Factors
⚡ Contains scripts (2)
📁 Filesystem access (1)
Quality Score
What You Can Build
Automate form data entry
Automatically fill PDF forms with data from other sources to reduce manual data entry work.
Extract tables from reports
Extract tabular data from PDF reports for analysis in spreadsheets or databases.
Combine case documents
Merge multiple PDF documents into organized case files with proper page ordering.
Try These Prompts
Extract all text from input.pdf and save it to output.txt using pdfplumber.
Merge doc1.pdf, doc2.pdf, and doc3.pdf into merged.pdf using pypdf.
Fill the fields in form.pdf using the data from values.json. First check if the PDF has fillable fields.
Create a new PDF report with title, table of quarterly sales data, and page numbers using reportlab.
Best Practices
- Use pdfplumber for text extraction as it preserves layout better than pypdf
- Validate form field values before filling to catch errors early
- Convert PDFs to images first when filling non-fillable forms to ensure accurate placement
Avoid
- Do not attempt to fill complex non-fillable forms without visual analysis
- Do not skip bounding box validation when placing annotations
- Do not process password-protected PDFs without obtaining the password
Frequently Asked Questions
Which PDF libraries work with Claude and Claude Code?
What are the size limits for PDF processing?
Can this skill integrate with existing document workflows?
Is my data safe when processing PDFs?
Why is my text extraction returning empty results?
How does this compare to other PDF tools?
Developer Details
File structure