스킬 Azure AI Content Understanding SDK for Python

📦

Azure AI Content Understanding SDK for Python

Name: Azure AI Content Understanding SDK for Python
Author: sickn33

안전

Extract content from documents, images, audio, and video with Azure AI

Transform unstructured media files into structured, searchable content for RAG applications and automated workflows. Use prebuilt analyzers or create custom field extraction models with the Azure AI Content Understanding SDK.

지원: Claude Codex Code(CC)

⚠️ 68 나쁨

스킬 ZIP 다운로드

Claude에서 업로드

설정 → 기능 → 스킬 → 스킬 업로드로 이동

토글을 켜고 사용 시작

테스트해 보기

"Azure AI Content Understanding SDK for Python" 사용 중입니다. Analyze a research paper PDF using prebuilt-documentSearch

예상 결과:

Document analysis complete. Extracted 15 pages of markdown content including 3 figures, 2 tables, and 47 paragraphs. Content structured with proper headings and citations preserved.

"Azure AI Content Understanding SDK for Python" 사용 중입니다. Transcribe a 30-minute meeting recording

예상 결과:

Audio transcription complete. 1,247 words across 89 timestamped phrases. Key topics detected: project timeline, budget review, resource allocation. Full transcript available with speaker segmentation.

"Azure AI Content Understanding SDK for Python" 사용 중입니다. Extract invoice fields from vendor PDF

예상 결과:

Custom analyzer extracted: vendor_name='Acme Corporation', invoice_total=15420.50, invoice_date='2026-01-15', line_items=[{description: 'Software License', amount: 12000}, {description: 'Support', amount: 3420.50}]

보안 감사

안전

v1 • 2/24/2026

This skill contains documentation for the Azure AI Content Understanding SDK, an official Microsoft Azure service. Static analysis scanned 0 files with 0 security issues detected (risk score: 0/100). The SKILL.md file provides usage examples for document, image, audio, and video analysis using legitimate Azure SDK methods. No executable code, network calls, or dangerous patterns present.

스캔된 파일

분석된 줄 수

발견 사항

총 감사 수

보안 문제를 찾지 못했습니다

감사자: claude

품질 점수

아키텍처

100

유지보수성

콘텐츠

커뮤니티

100

보안

사양 준수

만들 수 있는 것

RAG Document Indexing

Convert PDF documents, research papers, and technical manuals into markdown format for retrieval-augmented generation systems.

Meeting Intelligence

Transcribe recorded meetings and webinars with speaker timestamps for searchable meeting notes and action item extraction.

Invoice Processing Automation

Extract vendor names, invoice totals, and line items from supplier invoices using custom field extraction models.

이 프롬프트를 사용해 보세요

Basic Document Analysis

Analyze this PDF document using the prebuilt-documentSearch analyzer and return the extracted markdown content. Document URL: {url}

Audio Transcription with Timestamps

Transcribe this audio file and provide all phrases with their start and end timestamps. Audio URL: {url}. Format output as: [start - end]: text

Video Content Summary

Analyze this video to extract key frames, transcript phrases, and generate a summary. Video URL: {url}. Return: 1) Key frame descriptions with timestamps, 2) Full transcript, 3) Executive summary

Custom Field Extraction

Create a custom analyzer with fields: {field_schema}. Then analyze this document: {url} and extract values for each defined field. Return results as structured JSON with field names and extracted values.

모범 사례

Use begin_analyze with AnalyzeInput for all analysis operations and await poller.result() for completion
Access extracted content via result.contents[0] and check the content.kind property for type-specific handling
Prefer async client with azure.identity.aio for high-throughput scenarios requiring concurrent analysis jobs

피하기

Do not call analyze() directly - always use begin_analyze() which returns a poller for long-running operations
Avoid accessing result.fields without first verifying the analyzer was configured with a field_schema
Do not use sync client for batch processing workflows that require analyzing multiple files concurrently

자주 묻는 질문

What file formats does Content Understanding support?

Documents: PDF, images (JPEG, PNG), Office docs. Audio: MP3, WAV, M4A. Video: MP4, MOV. The prebuilt analyzers are optimized for specific content types.

How long do analysis operations take?

Documents typically complete in seconds. Audio and video analysis are long-running operations that may take several minutes depending on media length and complexity.

Can I use this skill with local files?

The examples show URL-based inputs. For local files, you must upload to accessible storage (Azure Blob Storage recommended) and provide the URL to the analyzer.

What is the difference between sync and async clients?

Sync client uses blocking calls suitable for scripts and low-throughput scenarios. Async client (aio) enables concurrent operations and is recommended for high-throughput batch processing.

How do I create a custom analyzer?

Use client.create_analyzer() with an analyzer_id, description, base_analyzer_id, and field_schema defining the fields to extract. The custom analyzer persists for repeated use.

What authentication methods are supported?

DefaultAzureCredential supports managed identity, service principal, CLI credentials, and environment-based authentication. Configure CONTENTUNDERSTANDING_ENDPOINT environment variable with your resource URL.

개발자 세부 정보

작성자

sickn33

라이선스

MIT

리포지토리

https://github.com/sickn33/antigravity-awesome-skills/tree/main/skills/azure-ai-contentunderstanding-py

참조

main

파일 구조

📄 SKILL.md