← Back to Skills Marketplace
test
by
Jarrett817
· GitHub ↗
· v2.0.0
· MIT-0
90
Downloads
0
Stars
0
Active Installs
2
Versions
Install in OpenClaw
/install test-11
Description
Guide for creating high-quality MCP (Model Context Protocol) servers that enable LLMs to interact with external services through well-designed tools. Use whe...
Usage Guidance
This skill includes working code that will call an external LLM (Anthropic/Claude) and an evaluation harness that instructs the agent to print detailed summaries of tool inputs and outputs. Yet the skill metadata declares no required environment variables or API keys. Before installing or enabling this skill:
- Treat it as suspicious until you confirm what credentials it needs. The evaluation script likely requires an Anthropic API key (or similar) to function; do not provide secret keys unless you audit the code and accept the risk.
- Audit the scripts (evaluation.py and connections.py) yourself: look for where network calls are made and where data is logged or returned. The EVALUATION_PROMPT explicitly asks for tool inputs/outputs and summaries: that can expose secrets returned by tools.
- If you plan to run evaluations, only run them against test/read-only environments containing non-sensitive data, and never point the harness at production systems or supply real account tokens.
- Consider asking the skill author to: (1) declare required env vars (e.g., ANTHROPIC_API_KEY) in the metadata, (2) remove or make optional any prompts that require verbatim tool input/output logging, and (3) limit what is printed in reports to non-sensitive metadata.
If you cannot verify these changes or audit the code yourself, avoid enabling the skill with real credentials or running it against live services.
Capability Analysis
Type: OpenClaw Skill
Name: test-11
Version: 2.0.0
The skill bundle is a comprehensive toolkit for developing and evaluating Model Context Protocol (MCP) servers. It includes detailed markdown guides (SKILL.md and reference files) and Python scripts (scripts/connections.py, scripts/evaluation.py) for connecting to and testing MCP servers using the Anthropic API. The code utilizes the official 'mcp' and 'anthropic' libraries, following standard patterns for MCP client/server interaction. No malicious behavior, such as data exfiltration or unauthorized access, was detected. The capability to execute local commands via the stdio transport is a core feature of the protocol and is appropriately exposed through user-controlled CLI arguments in the evaluation script. The instructions provided to the AI agent are well-aligned with the stated purpose of building and testing high-quality integrations.
Capability Assessment
Purpose & Capability
Name/description present a developer guide for MCP servers — that matches the included documentation and helper code. However, the shipped runtime code (scripts/evaluation.py) instantiates an Anthropic client and calls remote LLM APIs, and scripts/requirements.txt lists 'anthropic' and 'mcp'. The SKILL metadata declares no required env vars or primary credential despite the code clearly needing API credentials to call an LLM service. This is an incoherence: a documentation/guide skill would not normally embed direct runtime code that requires provider credentials without declaring them.
Instruction Scope
The SKILL.md and embedded evaluation harness (EVALUATION_PROMPT and evaluation.py) instruct the agent to (a) call tools and include detailed summaries of each step, inputs provided, and outputs received, and (b) include feedback and final responses wrapped in XML-like tags. Requiring the assistant to verbatim report tool inputs/outputs increases the risk of leaking any sensitive data returned by tools. The evaluation/reference docs also say 'At NO stage should you READ the code of the MCP server implementation itself', which conflicts with the fact the package includes implementation code and an evaluation harness – ambiguous scope. Overall, the instructions are broader than a passive guide and could lead to exfiltration of sensitive data if used against real services.
Install Mechanism
There is no install spec (instruction-only install), and all files are shipped in the skill bundle. No external installer or download-from-URL steps are present, so there is no high-risk install mechanism. requirements.txt lists Python deps (anthropic, mcp) which is expected for the provided scripts.
Credentials
The code uses Anthropic() (Anthropic SDK) which typically requires an API key (e.g., ANTHROPIC_API_KEY) or similar credential, but requires.env/primaryEnv are empty — the skill does not declare required credentials. That omission is a meaningful mismatch. Additionally, scripts/connections.py supports passing environment variables into subprocess stdio connections; the evaluation flow collects and prints tool inputs/outputs — without declared environment constraints this could enable secrets to be passed into tool calls and then captured in reports.
Persistence & Privilege
The skill does not request always:true, does not declare system-wide config modifications, and is user-invocable. It does allow the agent to call the included code autonomously (default), which is normal. No unusual persistence or cross-skill config edits are present.
How to Use
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install test-11 - After installation, invoke the skill by name or use
/test-11 - Provide required inputs per the skill's parameter spec and get structured output
Version History
v2.0.0
test-11 1.0.0
- Initial release.
- Provides a comprehensive guide for building high-quality MCP servers.
- Covers research, implementation, testing, and evaluation phases.
- Includes documentation links and best practices for both TypeScript and Python SDKs.
v1.0.0
Initial release of MCP server builder skill
- Provides comprehensive step-by-step guide for planning, implementing, reviewing, and evaluating MCP (Model Context Protocol) servers.
- Includes references for both TypeScript and Python MCP server development with practical setup and pattern examples.
- Features actionable best practices for tool design, schema definitions, error handling, and response formatting.
- Supplies evaluation guidelines and an XML template for crafting complex, verifiable test questions.
- Bundles key reference documents and best practice guides for rapid access during development.
Metadata
Frequently Asked Questions
What is test?
Guide for creating high-quality MCP (Model Context Protocol) servers that enable LLMs to interact with external services through well-designed tools. Use whe... It is an AI Agent Skill for Claude Code / OpenClaw, with 90 downloads so far.
How do I install test?
Run "/install test-11" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is test free?
Yes, test is completely free, licensed under MIT-0. You can download, install and use it at no cost.
Which platforms does test support?
test is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).
Who created test?
It is built and maintained by Jarrett817 (@jarrett817); the current version is v2.0.0.
More Skills