pdf-analyze
Process PDF documents for extraction and form filling
متاح أيضًا من: 21pounder
PDF documents often contain important data that is difficult to access programmatically. This skill provides Claude with comprehensive tools to extract text and tables, fill forms, merge documents, and convert PDFs to images for analysis.
تنزيل ZIP المهارة
رفع في Claude
اذهب إلى Settings → Capabilities → Skills → Upload skill
فعّل وابدأ الاستخدام
اختبرها
استخدام "pdf-analyze". Extract text from report.pdf and list all tables found
النتيجة المتوقعة:
- Extracted 24 pages from report.pdf
- Found 5 tables:
- - Table 1: 'Revenue by Region' (page 3)
- - Table 2: 'Q4 Performance Metrics' (page 7)
- - Table 3: 'Customer Demographics' (page 12)
- - Table 4: 'Year-over-Year Growth' (page 18)
- - Table 5: 'Projected 2025 Targets' (page 22)
- Saved extracted_text.txt (45 KB) and tables to tables_20250110.xlsx
التدقيق الأمني
مخاطر منخفضةThis is a legitimate PDF processing toolkit containing utility scripts for extracting text, filling forms, and manipulating documents. All code uses standard Python PDF libraries with no network access, no credential theft patterns, and no obfuscation. The skill's behavior aligns with its stated purpose.
درجة الجودة
ماذا يمكنك بناءه
Extract tables from reports
Pull structured data from financial reports, research papers, and statistical documents into CSV or Excel format.
Automate form completion
Fill out PDF forms programmatically with validated data for applications, surveys, and official documents.
Build PDF processing workflows
Create document processing pipelines that merge, split, and transform PDFs for applications and services.
جرّب هذه الموجهات
Extract all text from document.pdf using pdfplumber and save it to extracted_text.txt
Check if application_form.pdf has fillable form fields, and if so, list all field names and types
Extract all tables from quarterly_report.pdf and save them to an Excel file with one sheet per table
Fill in the following fields in application_form.pdf using data from field_values.json and save to completed_form.pdf
أفضل الممارسات
- Validate form field values before submission to catch errors early
- Convert PDF to images first when working with non-fillable forms to visually verify annotation placement
- Use the bounding box validation script to ensure annotations do not overlap or obscure existing content
تجنب
- Skipping the form field validation step before filling PDFs
- Not converting non-fillable PDFs to images for visual analysis first
- Using hardcoded file paths instead of parameters for reusability
الأسئلة المتكررة
Which Python libraries does this skill use?
What are the system requirements?
How do I fill a scanned PDF that is not fillable?
Is my data safe when processing PDFs?
Why does my filled PDF show annotations in the wrong position?
How is this different from using pdf-lib in JavaScript?
تفاصيل المطور
المؤلف
21pounderالترخيص
Proprietary. LICENSE.txt has complete terms
المستودع
https://github.com/21pounder/terminalAgent/tree/main/deepresearch/.claude/skills/pdf-analyzeمرجع
main