Azure Speech to Text REST API for Short Audio
Transcribe Audio with Azure Speech to Text REST API
Transcribe short audio files (up to 60 seconds) using Azure Speech to Text REST API without installing the Speech SDK. This skill provides ready-to-use Python code for authentication, audio formatting, and error handling.
下载技能 ZIP
在 Claude 中上传
前往 设置 → 功能 → 技能 → 上传技能
开启并开始使用
测试它
正在使用“Azure Speech to Text REST API for Short Audio”。 Transcribe audio.wav using Azure Speech to Text with detailed format
预期结果:
Transcription successful!
Display Text: What's the weather like today?
Confidence: 90.5%
The audio has been transcribed using Azure Speech to Text REST API with detailed response format.
正在使用“Azure Speech to Text REST API for Short Audio”。 How do I authenticate with Azure Speech to Text using bearer tokens?
预期结果:
You can authenticate using bearer tokens for improved security. The token is valid for 10 minutes. Here's how to implement it:
1. Get access token from the token endpoint
2. Use the token in the Authorization header
3. Cache the token and renew before expiration
This approach is recommended for production applications.
安全审计
安全This is a prompt-only documentation skill that provides guidance for integrating with Azure Speech to Text REST API. Static analysis found no suspicious patterns, no executable code, and no risk factors. The skill consists of documentation and code examples for legitimate API integration. No security concerns identified.
质量评分
你能构建什么
Transcribe Voice Memos
Convert short voice recordings to text for note-taking and documentation
Automated Call Transcription
Transcribe short phone call recordings for analysis and records
Multilingual Content Transcription
Transcribe audio content in multiple languages using Azure language support
试试这些提示
Use the Azure Speech to Text REST API skill to transcribe the audio file at path [AUDIO_FILE_PATH] to text. Use language [LANGUAGE_CODE] (e.g., en-US).
Use the Azure Speech to Text REST API skill to transcribe [AUDIO_FILE_PATH] using detailed format to get confidence scores. Language: [LANGUAGE]. Handle errors gracefully.
Use the Azure Speech to Text REST API skill to transcribe [AUDIO_FILE_PATH] asynchronously. Show how to implement the async version with aiohttp for better performance.
Use the Azure Speech to Text REST API skill to write a transcription function that handles all RecognitionStatus values (Success, NoMatch, InitialSilenceTimeout, BabbleTimeout, Error) with appropriate responses.
最佳实践
- Use WAV PCM format at 16kHz mono for best recognition accuracy
- Cache bearer tokens for 9 minutes to avoid repeated authentication
- Enable chunked transfer encoding for lower latency on larger files
避免
- Do not send audio files longer than 60 seconds - use Batch Transcription API instead
- Do not use this for real-time streaming - use Speech SDK streaming
- Do not hardcode API keys in source code - use environment variables
常见问题
What audio formats does Azure Speech to Text REST API support?
Do I need to install the Azure Speech SDK to use this skill?
How do I get Azure Speech to Text credentials?
What is the difference between simple and detailed response format?
Can I transcribe audio in languages other than English?
How do I handle authentication errors?
开发者详情
作者
sickn33许可证
MIT
仓库
https://github.com/sickn33/antigravity-awesome-skills/tree/main/skills/azure-speech-to-text-rest-py引用
main
文件结构
📄 SKILL.md