返回顶部
s

screen-vision

macOS screen OCR & click automation via Apple Vision + ScreenCaptureKit. Capture any window or screen region, extract text with coordinates, find text, and click on it — all from the terminal.

作者: admin | 来源: ClawHub
源自
ClawHub
版本
V 1.2.0
安全检测
已通过
380
下载量
0
收藏
概述
安装方式
版本历史

screen-vision

# screen-vision macOS screen OCR & click automation CLI powered by Apple Vision + ScreenCaptureKit. ## IMPORTANT: Auto-Setup (run first) Before running any screen-vision command, check if the binary exists. If not, run the setup script: ```bash command -v screen-vision &>/dev/null || bash "${CLAUDE_SKILL_DIR}/setup.sh" ``` This installs `screen-vision` (via Homebrew or source build) and `cliclick` automatically. ## Requirements - macOS 14.0+ (Sonoma) - Screen Recording permission (System Settings > Privacy & Security > Screen Recording) ## Commands | Command | Description | Output | |---------|-------------|--------| | `screen-vision ocr [--app NAME]` | Full OCR | JSON array `[{text, x, y, w, h, confidence}]` | | `screen-vision list [--app NAME]` | OCR list | Human-readable text with coordinates | | `screen-vision find "text" [--app NAME]` | Find text | JSON `{text, x, y, found}` | | `screen-vision has "text" [--app NAME]` | Check text exists | Exit code 0 (found) / 1 (not found) | | `screen-vision tap "text" [--app NAME] [--retry N]` | Find + click | JSON `{text, x, y, tapped}` | | `screen-vision wait "text" [--timeout SEC]` | Poll until text appears | JSON `{text, x, y, found}` | ## Capture Priority ``` --region x,y,w,h > --app "AppName" > full screen (default) ``` ## Usage Patterns ### OCR a specific app window ```bash screen-vision list --app "Safari" ``` ### Check if text is visible (for conditionals) ```bash screen-vision has "Submit" --app "MyApp" && echo "Found" || echo "Not found" ``` ### Click on text with retry ```bash screen-vision tap "OK" --app "MyApp" --retry 3 ``` ### Wait for text to appear (e.g. loading complete) ```bash screen-vision wait "Complete" --timeout 30 ``` ### Full screen OCR as JSON (pipe to jq) ```bash screen-vision ocr | jq '.[].text' ``` ## $ARGUMENTS Handling Parse the user's request to determine which command to run: - "화면에 뭐 있어?" / "what's on screen?" → `screen-vision list` - "~찾아" / "find ~" → `screen-vision find "text"` - "~클릭해" / "click ~" → `screen-vision tap "text"` - "~보여?" / "is ~ visible?" → `screen-vision has "text"` - "~뜰 때까지 기다려" / "wait for ~" → `screen-vision wait "text"`

标签

skill ai

通过对话安装

该技能支持在以下平台通过对话安装:

OpenClaw WorkBuddy QClaw Kimi Claude

方式一:安装 SkillHub 和技能

帮我安装 SkillHub 和 screen-vision-1776124696 技能

方式二:设置 SkillHub 为优先技能安装源

设置 SkillHub 为我的优先技能安装源,然后帮我安装 screen-vision-1776124696 技能

通过命令行安装

skillhub install screen-vision-1776124696

下载 Zip 包

⬇ 下载 screen-vision v1.2.0

文件大小: 2.84 KB | 发布时间: 2026-4-14 10:36

v1.2.0 最新 2026-4-14 10:36
setup.sh now downloads pre-built binary on Apple Silicon (no Xcode/brew required)

Archiver·手机版·闲社网·闲社论坛·羊毛社区· 多链控股集团有限公司 · 苏ICP备2025199260号-1

Powered by Discuz! X5.0   © 2024-2025 闲社网·线报更新论坛·羊毛分享社区·http://xianshe.com

p2p_official_large
返回顶部