← 返回 Skills 市场
durtydhiana

Airpoint 1.3.16

作者 DurtyDhiana · GitHub ↗ · v1.0.0 · MIT-0
darwin ⚠ suspicious
226
总下载
0
收藏
0
当前安装
1
版本数
在 OpenClaw 中安装
/install airpoint-1-3-16
功能描述
Control a Mac through natural language — open apps, click buttons, read the screen, type text, manage windows, and automate multi-step tasks via Airpoint's A...
使用说明 (SKILL.md)

Airpoint — AI Computer Use for macOS

Airpoint gives you an AI agent that can see and control a Mac — open apps, click UI elements, read on-screen text, type, scroll, drag, and manage windows. You give it a natural-language instruction and it carries out the task autonomously by perceiving the screen (accessibility tree + screenshots + visual locator), planning actions, executing them, and verifying the result.

Everything runs through the airpoint CLI.

Requirements

  • macOS (Apple Silicon or Intel)
  • Airpoint app — must be running. Download from airpoint.app.
  • Airpoint CLI — the airpoint command must be on PATH. Install it from the Airpoint app: Settings → Plugins → Install CLI.

Setup

Before using Airpoint's AI agent, the user must configure it in the Airpoint app (Settings → Assistant):

  1. AI model API key (required). Set an API key for the chosen provider:
    • OpenAI (recommended): model gpt-5.1 with reasoning effort low gives the best balance of cost, speed, and quality.
    • Anthropic and Google Gemini are also supported.
  2. Gemini API key (recommended). Even when using OpenAI or Anthropic as the primary model, a Google Gemini API key enables the visual locator — a secondary model (gemini-3-flash-preview) that finds UI targets on screen by analyzing screenshots. Without it, the agent relies on the accessibility tree only.
  3. macOS permissions. The app prompts on first launch, but verify these are granted in System Settings → Privacy & Security:
    • Accessibility — required for mouse/keyboard control.
    • Screen Recording — required for screenshots and screen perception.
    • Camera is only needed for hand tracking (not for the AI agent).
  4. Custom instructions (optional). In Settings → Assistant, add custom instructions to tailor the agent's behavior (e.g., preferred language, apps to avoid, workflows to follow).

If the user reports that airpoint ask fails or the agent can't see the screen, ask them to verify steps 1–3 above.

How to use

  1. Run airpoint ask "\x3Cyour instruction>" to send a task to the on-device agent.
  2. The command blocks until the agent finishes (up to 5 minutes) and returns:
    • A text summary of what the agent did and the result.
    • One or more screenshot file paths showing the screen state after the task.
  3. Read the text output to confirm whether the task succeeded.
  4. If screenshots were returned, show the last screenshot to the user as visual confirmation of the result.
  5. If something went wrong or the task is stuck, run airpoint stop to cancel.

Example flow:

> airpoint ask "open Safari and search for 'OpenClaw'"
Opened Safari, typed 'OpenClaw' into the address bar, and pressed Enter.
The search results page is now displayed.

1 screenshot(s) saved to session abc123
  └ screenshots/step_3.png (/Users/you/Library/Application Support/com.medhuelabs.airpoint/sessions/abc123/screenshots/step_3.png)

After receiving this, show the screenshot to the user so they can see what happened.

Commands

Ask the AI agent to do something (primary command)

This is the most important command. It sends a natural-language task to Airpoint's built-in computer-use agent which can see the screen, move the mouse, click, type, scroll, open apps via Spotlight, manage windows, and verify its own actions.

# Synchronous — waits for the agent to finish (up to 5 min) and returns output
airpoint ask "open Safari and go to github.com"
airpoint ask "what's on my screen right now?"
airpoint ask "find the Slack notification and read it"
airpoint ask "open System Settings and enable Dark Mode"
airpoint ask "open Mail, find the latest email from John, and summarize it"

# Fire-and-forget — returns immediately
airpoint ask "open Spotify and play my liked songs" --no-wait

# Show the assistant panel on screen while running
airpoint ask "open System Settings and enable Dark Mode" --show-panel

Stop a running task

airpoint stop

Cancels the currently running assistant task. Use this if a task is stuck or taking too long.

Capture a screenshot

airpoint see

Returns a screenshot of the current display. Useful for verifying state before or after issuing an ask command.

Check status

airpoint status
airpoint status --json

Returns app version and current state (tracking active, etc.).

Hand tracking (secondary)

Airpoint also supports hands-free cursor control via camera-based hand tracking. These commands start/stop that feature:

airpoint tracking on
airpoint tracking off
airpoint tracking        # show current state

Read or change settings

airpoint settings list             # all current settings
airpoint settings list --json      # machine-readable
airpoint settings get cursor.sensitivity
airpoint settings set cursor.sensitivity 1.5

Common settings: cursor.sensitivity (default 1.0), cursor.acceleration (default true), scroll.sensitivity (default 1.0), scroll.inertia (default true).

System vitals

airpoint vitals          # CPU, RAM, temperature
airpoint vitals --json

Launch the app

airpoint open            # opens/focuses the Airpoint macOS app

Tips

  • Use airpoint ask for almost everything. The agent can read the screen, interact with any app, and chain multi-step workflows autonomously.
  • Always use --json when you need to parse output programmatically.
  • The agent can answer questions about what's on screen ("what app is in the foreground?", "read the error message in this dialog").
  • Airpoint is a notarized, code-signed macOS app. Download it from airpoint.app.
安全使用建议
This skill is an instruction shim for the Airpoint macOS app and appears coherent, but you should still exercise standard caution before enabling control of your screen and input. Verify you obtained the Airpoint app/CLI from the official site (https://airpoint.app) and that the 'airpoint' binary is the signed, expected executable. Be aware the app requires Accessibility and Screen Recording permissions — granting those lets the app (and the agent it runs) see and control your UI. The app also asks you to supply an LLM API key inside its settings; consider using a limited or replaceable key and monitor/rotate it if you stop using the app. If you need higher assurance, inspect the installed CLI binary (code signature, checksums) or run it in a controlled account before granting broad access.
功能分析
Type: OpenClaw Skill Name: airpoint-1-3-16 Version: 1.0.0 The skill bundle describes a high-privilege macOS automation tool that requires Accessibility and Screen Recording permissions to control the user's computer. Several indicators suggest the documentation is untrustworthy or deceptive: it references non-existent AI models (e.g., 'gpt-5.1' and 'gemini-3-flash-preview') in SKILL.md, and the _meta.json file contains a future-dated publication timestamp (March 2026). While there is no explicit code for data exfiltration in the provided files, the use of fake technical specifications to encourage the installation of a third-party binary (airpoint.app) and the inputting of API keys is a significant red flag.
能力评估
Purpose & Capability
Name/description (control a Mac via Airpoint) align with declared requirements: macOS and the 'airpoint' CLI. No unrelated binaries, env vars, or config paths are requested.
Instruction Scope
SKILL.md directs the agent to run the 'airpoint' CLI commands (ask, see, status, stop, settings, etc.) and to rely on the Airpoint app for model API keys and permissions. It does not instruct reading unrelated files, exporting secrets, or contacting unexpected endpoints. It appropriately warns about macOS accessibility and screen-recording permissions.
Install Mechanism
Instruction-only skill with no install spec or bundled code. This is the lowest-risk install model; the skill assumes the user installs the official Airpoint CLI separately via the app.
Credentials
No environment variables or credentials are required by the skill. The SKILL.md does advise placing an AI model API key inside the Airpoint app (expected for a product that uses LLMs/vision models). No unrelated secrets are requested.
Persistence & Privilege
Skill is not 'always:true' and is user-invocable only. The skill does not request to modify other skills or system-wide settings. It relies on the existing Airpoint app which itself requires accessibility/screen-recording privileges (documented).
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install airpoint-1-3-16
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /airpoint-1-3-16 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.0
- Initial release of Airpoint skill for macOS. - Control your Mac using natural-language instructions via the `airpoint` CLI. - Supports opening apps, clicking UI elements, reading on-screen text, typing, managing windows, and automating multi-step tasks. - Requires Airpoint app and CLI; supports integration with OpenAI, Gemini, and Anthropic models. - Includes screenshot feedback, hands-free cursor control, and customizable instructions. - Provides commands for assistant interaction, hand tracking, status, settings, and system vitals.
元数据
Slug airpoint-1-3-16
版本 1.0.0
许可证 MIT-0
累计安装 0
当前安装数 0
历史版本数 1
常见问题

Airpoint 1.3.16 是什么?

Control a Mac through natural language — open apps, click buttons, read the screen, type text, manage windows, and automate multi-step tasks via Airpoint's A... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 226 次。

如何安装 Airpoint 1.3.16?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install airpoint-1-3-16」即可一键安装,无需额外配置。

Airpoint 1.3.16 是免费的吗?

是的,Airpoint 1.3.16 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

Airpoint 1.3.16 支持哪些平台?

Airpoint 1.3.16 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(darwin)。

谁开发了 Airpoint 1.3.16?

由 DurtyDhiana(@durtydhiana)开发并维护,当前版本 v1.0.0。

💬 留言讨论