← Back to Skills Marketplace
okradze

Crawl By Desearch

by okradze · GitHub ↗ · v1.0.1
cross-platform ✓ Security Clean
877
Downloads
1
Stars
4
Active Installs
2
Versions
Install in OpenClaw
/install desearch-crawl
Description
Crawl/scrape and extract content from any webpage URL. Returns the page content as clean text or raw HTML. Use this when you need to read the full contents o...
README (SKILL.md)

Crawl Webpage By Desearch

Extract content from any webpage URL. Returns clean text or raw HTML.

Quick Start

  1. Get an API key from https://console.desearch.ai
  2. Set environment variable: export DESEARCH_API_KEY='your-key-here'

Usage

# Crawl a webpage (returns clean text by default)
scripts/desearch.py crawl "https://en.wikipedia.org/wiki/Artificial_intelligence"

# Get raw HTML
scripts/desearch.py crawl "https://example.com" --crawl-format html

Options

Option Description
--crawl-format Output content format: text (default) or html

Examples

Read a documentation page

scripts/desearch.py crawl "https://docs.python.org/3/tutorial/index.html"

Get raw HTML for analysis

scripts/desearch.py crawl "https://example.com/page" --crawl-format html

Response

Example (format=text, truncated, default)

Artificial intelligence (AI) is the capability of computational systems to perform tasks that typically require human intelligence, such as learning, reasoning, problem-solving, perception, and decision-making...

Example (format=html, truncated)

\x3C!DOCTYPE html>
\x3Chtml>
  \x3Chead>\x3Ctitle>Artificial intelligence - Wikipedia\x3C/title>\x3C/head>
  \x3Cbody>
    \x3Cp>Artificial intelligence (AI) is the capability of computational systems...\x3C/p>
  \x3C/body>
\x3C/html>

Notes

  • Response is plain text or raw HTML — not JSON.
  • Default format is text. Use --crawl-format html only when you need to inspect page structure.
  • Prefer text format to avoid bloating the agent context with markup.

Errors

Status 401, Unauthorized (e.g., missing/invalid API key)

{
  "detail": "Invalid or missing API key"
}

Status 402, Payment Required (e.g., balance depleted)

{
  "detail": "Insufficient balance, please add funds to your account to continue using the service."
}

Resources

Usage Guidance
This skill is a thin client for the Desearch API and appears coherent. Before installing, confirm you trust desearch.ai and that you are comfortable sending target URLs (and their contents) to that external service. Treat the DESEARCH_API_KEY like any API secret: use least-privilege keys if supported, rotate keys periodically, and avoid using a key that has broader account permissions than necessary. Note the SKILL.md says the response is plain text/HTML but the script may return JSON objects from the API and pretty-print them — this is informational only. If you need offline/local crawling or want guarantees about sensitive content, do not send private pages to a third-party API.
Capability Analysis
Type: OpenClaw Skill Name: desearch-crawl Version: 1.0.1 The skill bundle is benign. The `SKILL.md` provides clear, non-malicious instructions for using a web crawling service, including setting an API key. The `scripts/desearch.py` script acts as a client for the `api.desearch.ai` service, securely retrieving the `DESEARCH_API_KEY` from environment variables and directing all network traffic to the legitimate Desearch API endpoint. There is no evidence of data exfiltration, malicious execution, persistence mechanisms, or prompt injection attempts against the agent.
Capability Assessment
Purpose & Capability
Name/description (crawl/scrape pages) match the included CLI script which calls https://api.desearch.ai/web/crawl. The only required secret is DESEARCH_API_KEY, which is appropriate for a hosted crawl API.
Instruction Scope
SKILL.md instructs the agent to call the Desearch API and set DESEARCH_API_KEY; the included script does exactly that. Minor inconsistency: SKILL.md states responses are plain text or raw HTML (not JSON), but the script will pretty-print any JSON object returned by the API. This is a benign mismatch in how results are presented.
Install Mechanism
There is no install step; the skill is instruction-only with a small included Python script that uses only the standard library (urllib). No downloads or archive extraction are performed.
Credentials
Only one environment variable (DESEARCH_API_KEY) is required and is directly used to authorize requests to the stated API. No unrelated secrets, config paths, or excessive permissions are requested.
Persistence & Privilege
The skill does not request always:true and does not modify other skills or system-wide settings. Default autonomous invocation is allowed (platform default) but not combined with other concerning privileges.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install desearch-crawl
  3. After installation, invoke the skill by name or use /desearch-crawl
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.1
- Expanded documentation with new "Quick Start", "Response", and "Errors" sections. - Added example outputs for both text and HTML formats. - Clarified that the response is plain text or raw HTML, not JSON. - Included error response examples for common status codes. - Added links to the API reference and Desearch Console for further resources.
v1.0.0
desearch-crawl 1.0.0 initial release - Introduces the ability to crawl or scrape any webpage URL. - Returns webpage content as either clean text (default) or raw HTML. - Simple command-line interface and usage documented. - Requires a DESEARCH_API_KEY environment variable for authentication.
Metadata
Slug desearch-crawl
Version 1.0.1
License
All-time Installs 5
Active Installs 4
Total Versions 2
Frequently Asked Questions

What is Crawl By Desearch?

Crawl/scrape and extract content from any webpage URL. Returns the page content as clean text or raw HTML. Use this when you need to read the full contents o... It is an AI Agent Skill for Claude Code / OpenClaw, with 877 downloads so far.

How do I install Crawl By Desearch?

Run "/install desearch-crawl" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Crawl By Desearch free?

Yes, Crawl By Desearch is completely free (open-source). You can download, install and use it at no cost.

Which platforms does Crawl By Desearch support?

Crawl By Desearch is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Crawl By Desearch?

It is built and maintained by okradze (@okradze); the current version is v1.0.1.

💬 Comments