Fähigkeiten Azure AI Content Understanding SDK for Python
📦

Azure AI Content Understanding SDK for Python

Sicher

Extract content from documents, images, audio, and video with Azure AI

Transform unstructured media files into structured, searchable content for RAG applications and automated workflows. Use prebuilt analyzers or create custom field extraction models with the Azure AI Content Understanding SDK.

UnterstĂĽtzt: Claude Codex Code(CC)
📊 69 Angemessen
1

Die Skill-ZIP herunterladen

2

In Claude hochladen

Gehe zu Einstellungen → Fähigkeiten → Skills → Skill hochladen

3

Einschalten und loslegen

Teste es

Verwendung von "Azure AI Content Understanding SDK for Python". Analyze a research paper PDF using prebuilt-documentSearch

Erwartetes Ergebnis:

Document analysis complete. Extracted 15 pages of markdown content including 3 figures, 2 tables, and 47 paragraphs. Content structured with proper headings and citations preserved.

Verwendung von "Azure AI Content Understanding SDK for Python". Transcribe a 30-minute meeting recording

Erwartetes Ergebnis:

Audio transcription complete. 1,247 words across 89 timestamped phrases. Key topics detected: project timeline, budget review, resource allocation. Full transcript available with speaker segmentation.

Verwendung von "Azure AI Content Understanding SDK for Python". Extract invoice fields from vendor PDF

Erwartetes Ergebnis:

Custom analyzer extracted: vendor_name='Acme Corporation', invoice_total=15420.50, invoice_date='2026-01-15', line_items=[{description: 'Software License', amount: 12000}, {description: 'Support', amount: 3420.50}]

Sicherheitsaudit

Sicher
v1 • 2/24/2026

This skill contains documentation for the Azure AI Content Understanding SDK, an official Microsoft Azure service. Static analysis scanned 0 files with 0 security issues detected (risk score: 0/100). The SKILL.md file provides usage examples for document, image, audio, and video analysis using legitimate Azure SDK methods. No executable code, network calls, or dangerous patterns present.

0
Gescannte Dateien
0
Analysierte Zeilen
0
befunde
1
Gesamtzahl Audits
Keine Sicherheitsprobleme gefunden
Auditiert von: claude

Qualitätsbewertung

38
Architektur
100
Wartbarkeit
87
Inhalt
30
Community
100
Sicherheit
74
Spezifikationskonformität

Was du bauen kannst

RAG Document Indexing

Convert PDF documents, research papers, and technical manuals into markdown format for retrieval-augmented generation systems.

Meeting Intelligence

Transcribe recorded meetings and webinars with speaker timestamps for searchable meeting notes and action item extraction.

Invoice Processing Automation

Extract vendor names, invoice totals, and line items from supplier invoices using custom field extraction models.

Probiere diese Prompts

Basic Document Analysis
Analyze this PDF document using the prebuilt-documentSearch analyzer and return the extracted markdown content. Document URL: {url}
Audio Transcription with Timestamps
Transcribe this audio file and provide all phrases with their start and end timestamps. Audio URL: {url}. Format output as: [start - end]: text
Video Content Summary
Analyze this video to extract key frames, transcript phrases, and generate a summary. Video URL: {url}. Return: 1) Key frame descriptions with timestamps, 2) Full transcript, 3) Executive summary
Custom Field Extraction
Create a custom analyzer with fields: {field_schema}. Then analyze this document: {url} and extract values for each defined field. Return results as structured JSON with field names and extracted values.

Bewährte Verfahren

  • Use begin_analyze with AnalyzeInput for all analysis operations and await poller.result() for completion
  • Access extracted content via result.contents[0] and check the content.kind property for type-specific handling
  • Prefer async client with azure.identity.aio for high-throughput scenarios requiring concurrent analysis jobs

Vermeiden

  • Do not call analyze() directly - always use begin_analyze() which returns a poller for long-running operations
  • Avoid accessing result.fields without first verifying the analyzer was configured with a field_schema
  • Do not use sync client for batch processing workflows that require analyzing multiple files concurrently

Häufig gestellte Fragen

What file formats does Content Understanding support?
Documents: PDF, images (JPEG, PNG), Office docs. Audio: MP3, WAV, M4A. Video: MP4, MOV. The prebuilt analyzers are optimized for specific content types.
How long do analysis operations take?
Documents typically complete in seconds. Audio and video analysis are long-running operations that may take several minutes depending on media length and complexity.
Can I use this skill with local files?
The examples show URL-based inputs. For local files, you must upload to accessible storage (Azure Blob Storage recommended) and provide the URL to the analyzer.
What is the difference between sync and async clients?
Sync client uses blocking calls suitable for scripts and low-throughput scenarios. Async client (aio) enables concurrent operations and is recommended for high-throughput batch processing.
How do I create a custom analyzer?
Use client.create_analyzer() with an analyzer_id, description, base_analyzer_id, and field_schema defining the fields to extract. The custom analyzer persists for repeated use.
What authentication methods are supported?
DefaultAzureCredential supports managed identity, service principal, CLI credentials, and environment-based authentication. Configure CONTENTUNDERSTANDING_ENDPOINT environment variable with your resource URL.

Entwicklerdetails

Dateistruktur

đź“„ SKILL.md