Compétences azure-ai-formrecognizer-java
📦

azure-ai-formrecognizer-java

Sûr

Extract Data from Documents with Azure Form Recognizer

Automate document processing by extracting text, tables, and structured data from forms, invoices, and receipts. Build Java applications that leverage Azure Document Intelligence prebuilt models or create custom document analyzers.

Prend en charge: Claude Codex Code(CC)
🥉 74 Bronze
1

Télécharger le ZIP du skill

2

Importer dans Claude

Allez dans Paramètres → Capacités → Skills → Importer un skill

3

Activez et commencez Ă  utiliser

Tester

Utilisation de "azure-ai-formrecognizer-java". Analyze receipt from restaurant image

Résultat attendu:

Extracted fields: Merchant='The Grand Bistro', TransactionDate=2024-01-15, Total=87.42, Subtotal=78.50, Tax=8.92. Line items: Pasta Carbonara ($24.00), Caesar Salad ($18.00), Tiramisu ($12.00), Wine ($24.50). Confidence scores above 0.95 for all fields.

Utilisation de "azure-ai-formrecognizer-java". Extract table from quarterly report PDF

Résultat attendu:

Table detected: 12 rows x 5 columns. Headers: Quarter, Revenue, Expenses, Profit, Growth. Data extracted with cell positions and confidence scores. Table structure preserved with row/column indices for spreadsheet export.

Utilisation de "azure-ai-formrecognizer-java". Process batch of mixed document types

Résultat attendu:

Document 1: Classified as Invoice (98% confidence) - Vendor: ABC Supply, Amount: $1,250.00. Document 2: Classified as Receipt (96% confidence) - Merchant: Office Depot, Total: $89.99. Document 3: Classified as Contract (94% confidence) - 8 pages detected.

Audit de sécurité

Sûr
v1 • 2/24/2026

All static analysis findings are false positives. The file is Markdown documentation containing Java SDK code examples. No executable code, command injection, or malicious patterns detected. External command detections confused markdown code fences with shell syntax. URL references are documentation examples for Azure endpoint configuration.

1
Fichiers analysés
347
Lignes analysées
0
résultats
1
Total des audits
Aucun problème de sécurité trouvé
Audité par: claude

Score de qualité

38
Architecture
100
Maintenabilité
87
Contenu
50
Communauté
100
Sécurité
91
Conformité aux spécifications

Ce que vous pouvez construire

Automated Invoice Processing

Extract vendor names, invoice dates, line items, and totals from supplier invoices. Integrate with accounts payable systems to automate data entry and reduce manual processing time.

Receipt Data Capture

Build expense reporting applications that extract merchant names, transaction dates, and itemized purchases from receipt images. Enable mobile receipt scanning for employee reimbursements.

Custom Form Digitization

Create custom models for organization-specific forms like purchase orders, contracts, or survey responses. Train the model on sample documents and deploy automated extraction workflows.

Essayez ces prompts

Quick Start: Analyze a Document
Create a Java program using Azure Document Intelligence SDK to extract text and tables from a PDF file. Use the prebuilt-layout model and print the results to console.
Receipt Analysis with Field Extraction
Write Java code to analyze a receipt image using the prebuilt-receipt model. Extract merchant name, transaction date, and total amount with confidence scores. Handle cases where fields may be missing.
Custom Model for Invoice Processing
Generate Java code to build a custom document model for invoice extraction. Include training from Azure Blob Storage, model composition from multiple invoice templates, and error handling for training failures.
Document Classification Pipeline
Create a complete Java solution that classifies incoming documents as invoices, receipts, or contracts using a custom classifier, then routes each type to the appropriate prebuilt model for detailed extraction. Include async polling and retry logic.

Bonnes pratiques

  • Use DefaultAzureCredential for production deployments to leverage managed identities and avoid hardcoding credentials
  • Implement polling with appropriate timeouts for long-running analysis operations on large documents
  • Validate document format and size before submission to avoid unnecessary API calls and costs

Éviter

  • Do not hardcode API keys or endpoints in source code; use environment variables or Azure Key Vault
  • Avoid synchronous blocking calls in high-throughput scenarios; use async patterns for scalability
  • Do not process documents without validating file types; unsupported formats will fail analysis

Foire aux questions

What document formats does Azure Form Recognizer support?
Azure Document Intelligence supports PDF, JPEG, PNG, BMP, and TIFF formats. PDF documents can be up to 500 pages. Maximum file size is 500 MB for PDF and 20 MB for images.
How accurate are the prebuilt models?
Prebuilt models typically achieve 95%+ accuracy on standard documents. Accuracy varies by document quality, language, and field type. Custom models trained on your specific documents can achieve higher accuracy for specialized forms.
What is the difference between prebuilt and custom models?
Prebuilt models work out-of-box for common document types like invoices and receipts. Custom models are trained on your specific document templates for fields not covered by prebuilt models, such as proprietary forms or specialized layouts.
How do I handle documents in multiple languages?
Azure Document Intelligence supports over 70 languages automatically. Prebuilt models detect language from document content. For custom models, include training samples in all target languages for best results.
What happens if the analysis operation fails?
Failed operations return an error with status code and message. Common errors include invalid document format, unsupported language, or service throttling. Implement retry logic with exponential backoff for transient failures.
Can I process documents stored in Azure Blob Storage directly?
Yes, you can pass a SAS URL to beginAnalyzeDocumentFromUrl instead of uploading file bytes. This is more efficient for large documents and reduces bandwidth usage. Ensure the SAS token has read permissions and adequate validity period.

Détails du développeur

Structure de fichiers

đź“„ SKILL.md