Fähigkeiten Azure AI Content Understanding SDK for Python

📦

Azure AI Content Understanding SDK for Python

Name: Azure AI Content Understanding SDK for Python
Author: sickn33

Sicher

Extract content from documents, images, audio, and video with Azure AI

Transform unstructured media files into structured, searchable content for RAG applications and automated workflows. Use prebuilt analyzers or create custom field extraction models with the Azure AI Content Understanding SDK.

Unterstützt: Claude Codex Code(CC)

⚠️ 68 Schlecht

Die Skill-ZIP herunterladen

In Claude hochladen

Gehe zu Einstellungen → Fähigkeiten → Skills → Skill hochladen

Einschalten und loslegen

Teste es

Verwendung von "Azure AI Content Understanding SDK for Python". Analyze a research paper PDF using prebuilt-documentSearch

Erwartetes Ergebnis:

Document analysis complete. Extracted 15 pages of markdown content including 3 figures, 2 tables, and 47 paragraphs. Content structured with proper headings and citations preserved.

Verwendung von "Azure AI Content Understanding SDK for Python". Transcribe a 30-minute meeting recording

Erwartetes Ergebnis:

Audio transcription complete. 1,247 words across 89 timestamped phrases. Key topics detected: project timeline, budget review, resource allocation. Full transcript available with speaker segmentation.

Verwendung von "Azure AI Content Understanding SDK for Python". Extract invoice fields from vendor PDF

Erwartetes Ergebnis:

Custom analyzer extracted: vendor_name='Acme Corporation', invoice_total=15420.50, invoice_date='2026-01-15', line_items=[{description: 'Software License', amount: 12000}, {description: 'Support', amount: 3420.50}]

Sicherheitsaudit

Sicher

v1 • 2/24/2026

This skill contains documentation for the Azure AI Content Understanding SDK, an official Microsoft Azure service. Static analysis scanned 0 files with 0 security issues detected (risk score: 0/100). The SKILL.md file provides usage examples for document, image, audio, and video analysis using legitimate Azure SDK methods. No executable code, network calls, or dangerous patterns present.

Gescannte Dateien

Analysierte Zeilen

befunde

Gesamtzahl Audits

Keine Sicherheitsprobleme gefunden

Auditiert von: claude

Qualitätsbewertung

Architektur

100

Wartbarkeit

Inhalt

Community

100

Sicherheit

Spezifikationskonformität

Was du bauen kannst

RAG Document Indexing

Convert PDF documents, research papers, and technical manuals into markdown format for retrieval-augmented generation systems.

Meeting Intelligence

Transcribe recorded meetings and webinars with speaker timestamps for searchable meeting notes and action item extraction.

Invoice Processing Automation

Extract vendor names, invoice totals, and line items from supplier invoices using custom field extraction models.

Probiere diese Prompts

Basic Document Analysis

Analyze this PDF document using the prebuilt-documentSearch analyzer and return the extracted markdown content. Document URL: {url}

Audio Transcription with Timestamps

Transcribe this audio file and provide all phrases with their start and end timestamps. Audio URL: {url}. Format output as: [start - end]: text

Video Content Summary

Analyze this video to extract key frames, transcript phrases, and generate a summary. Video URL: {url}. Return: 1) Key frame descriptions with timestamps, 2) Full transcript, 3) Executive summary

Custom Field Extraction

Create a custom analyzer with fields: {field_schema}. Then analyze this document: {url} and extract values for each defined field. Return results as structured JSON with field names and extracted values.

Bewährte Verfahren

Use begin_analyze with AnalyzeInput for all analysis operations and await poller.result() for completion
Access extracted content via result.contents[0] and check the content.kind property for type-specific handling
Prefer async client with azure.identity.aio for high-throughput scenarios requiring concurrent analysis jobs

Vermeiden

Do not call analyze() directly - always use begin_analyze() which returns a poller for long-running operations
Avoid accessing result.fields without first verifying the analyzer was configured with a field_schema
Do not use sync client for batch processing workflows that require analyzing multiple files concurrently

Häufig gestellte Fragen

What file formats does Content Understanding support?

Documents: PDF, images (JPEG, PNG), Office docs. Audio: MP3, WAV, M4A. Video: MP4, MOV. The prebuilt analyzers are optimized for specific content types.

How long do analysis operations take?

Documents typically complete in seconds. Audio and video analysis are long-running operations that may take several minutes depending on media length and complexity.

Can I use this skill with local files?

The examples show URL-based inputs. For local files, you must upload to accessible storage (Azure Blob Storage recommended) and provide the URL to the analyzer.

What is the difference between sync and async clients?

Sync client uses blocking calls suitable for scripts and low-throughput scenarios. Async client (aio) enables concurrent operations and is recommended for high-throughput batch processing.

How do I create a custom analyzer?

Use client.create_analyzer() with an analyzer_id, description, base_analyzer_id, and field_schema defining the fields to extract. The custom analyzer persists for repeated use.

What authentication methods are supported?

DefaultAzureCredential supports managed identity, service principal, CLI credentials, and environment-based authentication. Configure CONTENTUNDERSTANDING_ENDPOINT environment variable with your resource URL.

Entwicklerdetails

Autor

sickn33

Lizenz

MIT

Repository

https://github.com/sickn33/antigravity-awesome-skills/tree/main/skills/azure-ai-contentunderstanding-py

Ref

main

Dateistruktur

📄 SKILL.md