技能 ocr

📄

ocr

Name: ocr
Author: Bae-ChangHyun

安全 📁 檔案系統存取

將 PDF 和圖片轉換為 Markdown

從 PDF 文件和圖片中提取文字需要多個步驟和工具。此技能使用 Claude 視覺功能準確地將掃描文件和圖片轉換為乾淨的 Markdown 格式，並保留適當的結構。

支援: Claude Codex Code(CC)

📊 70 充足

下載技能 ZIP

在 Claude 中上傳

前往設定 → 功能 → 技能 → 上傳技能

開啟並開始使用

測試它

正在使用「ocr」。將這份掃描的季度報告轉換為 Markdown

預期結果:

OCR processing complete for quarterly_report.pdf
Output: quarterly_report.md (12 pages)
Content types detected: Tables, headings, bullet lists, images
Files saved to: /path/to/quarterly_report.md

正在使用「ocr」。僅從此螢幕截圖中提取程式碼片段

預期結果:

Extracted 3 code blocks from screenshot.png
Output: screenshot.md
Languages detected: Python, JavaScript, SQL

正在使用「ocr」。批次處理這 50 份文件掃描

預期結果:

Processed 47 of 50 files successfully
3 files skipped (size limits exceeded)
Output files saved to original folder

安全審計

安全

v5 • 1/16/2026

Legitimate OCR document processing skill using Claude vision. Static findings are false positives triggered by benign patterns in documentation (file paths, directory names, shell examples). The skill only uses system Read/Write tools, makes no network calls, and uses Task agents for context isolation. No malicious intent detected.

已掃描檔案

542

分析行數

發現項

審計總數

風險因素

📁 檔案系統存取 (4)

SKILL.md:86-88 SKILL.md:121-124 SKILL.md:148-154 SKILL.md:234-237

審計者: claude 查看審計歷史 →

品質評分

架構

100

可維護性

內容

社群

100

安全

規範符合性

你能建構什麼

轉換研究論文

從學術論文中提取文字以進行引用和分析工作。

數位化文件

將掃描的合約和表單轉換為可搜尋的文字檔案。

提取圖片文字

從螢幕截圖中提取文字用於文件和部落格文章。

試試這些提示

基本 PDF 轉換

/ocr /path/to/document.pdf

僅提取表格

/ocr /path/to/report.pdf "Extract only the tables"

批次圖片處理

/ocr /path/to/images/folder/

逐頁輸出

/ocr /path/to/long-document.pdf

最佳實務

使用清晰的自訂指令將提取重點放在特定內容上
對於大型文件，請驗證複雜表格是否正確呈現
處理複雜版面時保留原始檔案作為參考

避免

不要假設 OCR 能完美保留複雜格式
未驗證輸出準確性之前，不要處理敏感文件
不要用於受密碼保護或加密的 PDF 檔案

常見問題

支援哪些檔案大小?

檔案必須在 Claude 視覺限制範圍內。大型 PDF 可能觸發 413 錯誤。

這對掃描的 PDF 有效嗎?

是的,包含圖片的掃描 PDF 會使用 Claude 視覺功能處理。

能否提取手寫文字?

手寫辨識準確度不一。簡單、清晰的手寫效果最佳。

如何處理複雜表格?

表格轉換為 Markdown 格式。非常複雜的版面可能需要人工檢視。

我的資料會被傳送到外部嗎?

處理僅使用 Claude 視覺功能。不會將資料傳送至第三方服務。

統一輸出與逐頁輸出的差異?

統一模式將所有頁面合併為一個檔案。逐頁模式則建立個別檔案。

開發者詳情

作者

Bae-ChangHyun

授權

MIT

儲存庫

https://github.com/Bae-ChangHyun/cc-plugins-bch/tree/main/plugins/utils/skills/ocr

引用

main

檔案結構

📄 SKILL.md