← 返回 Skills 市场
haoruilee

Evaluate Agent-Native

作者 0x4C33 · GitHub ↗ · v1.4.0 · MIT-0
cross-platform ⚠ suspicious
277
总下载
0
收藏
0
当前安装
5
版本数
在 OpenClaw 中安装
/install evaluate-agent-native
功能描述
Evaluate whether a service qualifies as "agent-native" using the five hard criteria from the awesome-agent-native-services standard. Also checks for the bonu...
使用说明 (SKILL.md)

Skill: evaluate-agent-native

Use this skill to rigorously evaluate whether a service qualifies as "agent-native" according to the five-criterion standard. Also check for the URL Onboarding signal — the strongest indicator that a service was truly built for agents from day one.

The gold standard: URL Onboarding

Before applying the five criteria, ask the highest-level question:

Can an agent join and start using this service by reading a single URL?

Services that answer YES are exhibiting the strongest possible form of agent-nativeness. They have internalized the agent as first-class user so deeply that the onboarding flow itself is machine-readable:

# The full agent onboarding in one instruction:
Read \x3Curl> and follow the instructions.

Examples:

  • Moltbook: Read https://www.moltbook.com/skill.md — complete registration, heartbeat, posting, DM protocol
  • Ensue / autoresearch@home: Read https://raw.githubusercontent.com/mutable-state-inc/autoresearch-at-home/master/collab.md — complete swarm joining, claiming, publishing protocol

This is qualitatively different from:

  • An SDK that a human developer installs (requires human coding time)
  • An MCP server that a human adds to a config file (requires human config edit)
  • A REST API that requires API key setup (requires human account creation)

URL Onboarding means the agent itself handles all of this — reading, understanding, and executing the join sequence autonomously.

Mark URL Onboarding as a strong bonus signal and highlight it prominently in the evaluation report.


When to activate

Activate when the user asks:

  • "Is [service] agent-native?"
  • "Does [service] qualify for the awesome list?"
  • "I want to add [service] — does it meet the criteria?"
  • "What's the difference between agent-native and agent-adapted?"
  • "Why isn't [service] on the list?"
  • "Does [service] have URL Onboarding?"

The five hard criteria

A service must pass all five to qualify as agent-native. Evaluate each one explicitly.

Criterion 1 — Agent-First Positioning

Test: Does the official homepage or documentation explicitly identify AI agents as the primary consumer?

Evidence to look for:

  • Homepage headline naming AI agents
  • Documentation framing agents as the core user
  • Product name or tagline that only makes sense for agents

Red flags:

  • "Now with AI agent support" (agents are an add-on)
  • "Build apps, workflows, and agents" (agents are one of many outputs)

Criterion 2 — Agent-Specific Primitives

Test: Does the API expose at least one primitive with no meaningful human-facing equivalent?

Questions to ask:

  • What is the core API object? Agent inbox? KYA token? Claim? Heartbeat? Or generic inbox/token/task?
  • Would this primitive exist if agents didn't exist?
  • Is the output format optimized for LLM consumption or human reading?

Pass examples: agent inbox, KYA identity token, approval gate with context-window injection, claim_experiment(), heartbeat protocol, publish_hypothesis().

Fail examples: a REST API that sends emails (humans use it too), a webhook any server can receive.

Criterion 3 — Autonomy-Compatible Control Plane

Test: Can an agent complete a full task loop without a human clicking anything?

Questions to ask:

  • Can the agent provision its own credentials?
  • Can the agent initiate, execute, and complete the action without a human redirect?
  • Does the service provide agent-appropriate constraint mechanisms?

Criterion 4 — Machine-to-Machine Integration Surface

Test: Is the primary interface an SDK, REST API, MCP server, webhook, or machine-readable URL?

Questions to ask:

  • Can an agent use this service without a human ever opening a browser?
  • Is there a URL, SDK, REST API, or MCP server documented as the primary integration path?

Note: A service that exposes a machine-readable skill.md or protocol URL (URL Onboarding) passes this criterion with exceptional strength.

Criterion 5 — Agent Identity / Delegation Semantics

Test: Does the service distinguish (a) agent's own identity, (b) delegated user permissions, (c) audit trail?


Bonus signals (check all that apply)

Signal Weight Evidence to look for
URL Onboarding ⭐⭐⭐ Highest Service hosts a machine-readable skill.md / protocol doc an agent reads and follows to self-register
Dedicated agent identity model High Agent gets its own credential/wallet/token
MCP server published Medium Official MCP server with documented tools
Agent Skills (SKILL.md) published Medium npx skills add org/repo works
Per-agent state / memory / session Medium State isolated by agent instance
Audit / trajectory artifacts Medium Machine-readable evidence of agent actions

How to test for URL Onboarding:

  1. Look for a skill.md, SKILL.md, collab.md, or similar machine-readable protocol file hosted at the service's domain or GitHub.
  2. Ask: could an agent read that URL and complete the full registration/onboarding sequence autonomously?
  3. Try the instruction: Read \x3Curl> and follow the instructions — does it work?

Classification decision tree

Does the service pass all 5 criteria?
├── YES → agent-native ✅
│   └── Does it also have URL Onboarding?
│       ├── YES → agent-native ⭐ (URL Onboarding — highest tier)
│       └── NO  → agent-native (standard)
└── NO
    ├── Originally built for humans, agent interfaces added later?
    │   └── YES → agent-adapted ⚠️
    └── Helps humans BUILD agents?
        └── YES → agent-builder ❌

Evaluation output format

## Evaluation: {Service Name}
**Website:** {url}

### URL Onboarding Check ⭐
**Has URL Onboarding:** YES / NO
**Onboarding instruction (if YES):** Read {url} and follow the instructions to {join/register/participate}
**Notes:** {what the agent gets by reading that URL}

---

### Criterion 1 — Agent-First Positioning
**Result:** PASS / FAIL / PARTIAL
**Evidence:** "{exact quote}" — {source URL}

### Criterion 2 — Agent-Specific Primitives
**Result:** PASS / FAIL / PARTIAL
**Evidence:** {primitive name and description}
**No human equivalent because:** {explanation}

### Criterion 3 — Autonomy-Compatible Control Plane
**Result:** PASS / FAIL / PARTIAL
**Evidence:** {how agents operate without human confirmation}

### Criterion 4 — Machine-to-Machine Integration Surface
**Result:** PASS / FAIL / PARTIAL
**Evidence:** {URL, SDK, API, MCP details}

### Criterion 5 — Agent Identity / Delegation Semantics
**Result:** PASS / FAIL / PARTIAL / N/A
**Evidence:** {identity model details}

---

### Bonus signals
- [ ] URL Onboarding ⭐⭐⭐ — agent joins by reading one URL
- [ ] Dedicated agent identity model
- [ ] MCP server published
- [ ] Agent Skills (SKILL.md) published
- [ ] Per-agent state/memory/session
- [ ] Audit/trajectory/replay artifacts

---

### Overall verdict
**Classification:** agent-native ⭐ (URL Onboarding) / agent-native / agent-adapted / agent-builder
**Recommendation:** Add to main list / Add to Excluded section / Do not add
**Confidence:** High / Medium / Low
**Reasoning:** {one paragraph summary}

### Next steps
{If agent-native with URL Onboarding: highlight this in the issue and service file prominently}
{If agent-native without: link to issue template}
{If agent-adapted: explain what would need to change}

Common borderline cases

"The product added an MCP server — does that make it agent-native?"

No. MCP support is a bonus signal, not a criterion. The core question is whether the service was designed from inception for agents. A human email provider that adds an MCP server is still agent-adapted.

"The service has URL Onboarding but other criteria are weak."

URL Onboarding is the strongest bonus signal but cannot substitute for the five hard criteria. Evaluate all five independently; URL Onboarding is an amplifier, not a replacement.

"The service says 'for AI agents' in marketing."

Check the actual primitives. URL Onboarding is a reliable signal because it requires genuine design effort — you can't fake it with a marketing blog post.

安全使用建议
This skill is coherent and doesn't request secrets or installs, so it's reasonable to enable for manual evaluations. However, be careful if you allow the agent to 'follow' arbitrary URLs autonomously: external onboarding docs can contain instructions that cause the agent to attempt account provisioning, credential use, or other side effects. Recommendations: (1) prefer read-only analysis of URLs rather than automatic execution; (2) require a human review/approval step before the agent performs actions it discovered; (3) sandbox or restrict tools the agent can use when testing unknown URLs; and (4) never expose real credentials or secrets while evaluating onboarding flows.
功能分析
Type: OpenClaw Skill Name: evaluate-agent-native Version: 1.4.0 The skill defines a framework for evaluating 'agent-native' services but introduces a significant security risk in SKILL.md by instructing the agent to 'Read <url> and follow the instructions' as a test for 'URL Onboarding.' This pattern is a classic prompt injection vulnerability that encourages the agent to execute arbitrary instructions from untrusted external sources. While the stated intent is evaluation and no explicit malicious payloads (like data exfiltration or backdoors) are present, the methodology facilitates remote task execution by third parties.
能力评估
Purpose & Capability
Name and description match the SKILL.md content: the skill's goal is to evaluate 'agent-native' properties and the instructions only require reading webpages/URLs and analyzing text. No unrelated binaries, env vars, or installs are requested.
Instruction Scope
SKILL.md explicitly directs agents to read service homepages, documentation, and machine-readable protocol files (e.g., SKILL.md) and even suggests the 'Read <url> and follow the instructions' pattern. That is coherent with the purpose (testing URL Onboarding) but grants the agent broad discretion to interpret and act on external instructions — a behavioral risk if the agent is allowed to execute onboarding steps autonomously.
Install Mechanism
No install spec and no code files; instruction-only skills are lowest-risk from an install perspective.
Credentials
No required environment variables, credentials, or config paths are declared or referenced in the SKILL.md — appropriate for a read/analyze evaluator.
Persistence & Privilege
always is false and the skill does not request persistent presence or attempt to modify other skills or system configurations.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install evaluate-agent-native
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /evaluate-agent-native 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.4.0
v1.4.0: Star badges added to catalog. No criteria changes.
v1.3.0
v1.3.0: Catalog updated to 38 services. New Memory & State services: MemOS (MemCube, parametric/activation/plaintext), memU (dual-mode Fast Context + Deep Reasoning for 24/7 proactive agents). New Agent Social: Openwork (agent-to-agent hiring on-chain), Shellmates (agent pen-pals).
v1.2.0
v1.2.0: Catalog updated to 34 services. No criteria changes.
v1.1.0
v1.1.0: URL Onboarding check now comes BEFORE the five criteria — it is the gold standard test. New bonus signal table with weights (URL Onboarding = Highest). Evaluation output format has a new top section: '⭐ URL Onboarding Check'. Classification decision tree updated with URL Onboarding tier.
v1.0.0
Initial release — evaluate whether a service qualifies as agent-native using the five hard criteria from the awesome-agent-native-services standard.
元数据
Slug evaluate-agent-native
版本 1.4.0
许可证 MIT-0
累计安装 0
当前安装数 0
历史版本数 5
常见问题

Evaluate Agent-Native 是什么?

Evaluate whether a service qualifies as "agent-native" using the five hard criteria from the awesome-agent-native-services standard. Also checks for the bonu... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 277 次。

如何安装 Evaluate Agent-Native?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install evaluate-agent-native」即可一键安装,无需额外配置。

Evaluate Agent-Native 是免费的吗?

是的,Evaluate Agent-Native 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

Evaluate Agent-Native 支持哪些平台?

Evaluate Agent-Native 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 Evaluate Agent-Native?

由 0x4C33(@haoruilee)开发并维护,当前版本 v1.4.0。

💬 留言讨论