Back to all skills
๐Ÿ“ธ
Web & Browser

Screenshot & OCR

Capture a web page or a screen region and extract its text as structured data.

4.5rating
11,700 installs
screenshot-ocr + ocr-and-documents
Pro+ required

About this skill

Capture a full web page, a region, or a running app window, then run OCR tuned for on-screen text. Returns clean text or structured JSON for common layouts (tables, forms, invoices, receipts). Handles retina screenshots, dark mode, and non-Latin scripts. Use it on anything where copy-paste isn't available.

What it does

  • Full-page, region, or app-window capture
  • OCR tuned for on-screen text
  • Structured output for tables and forms
  • Retina and dark-mode support
  • Non-Latin script support

Use cases

  • Capture a web table that blocks copy-paste into a CSV
  • Extract text from a legacy app window for analysis
  • Build a searchable archive from a folder of screenshots