/install desearch-crawl
Crawl Webpage By Desearch
Extract content from any webpage URL. Returns clean text or raw HTML.
Quick Start
- Get an API key from https://console.desearch.ai
- Set environment variable:
export DESEARCH_API_KEY='your-key-here'
Usage
# Crawl a webpage (returns clean text by default)
scripts/desearch.py crawl "https://en.wikipedia.org/wiki/Artificial_intelligence"
# Get raw HTML
scripts/desearch.py crawl "https://example.com" --crawl-format html
Options
| Option | Description |
|---|---|
--crawl-format |
Output content format: text (default) or html |
Examples
Read a documentation page
scripts/desearch.py crawl "https://docs.python.org/3/tutorial/index.html"
Get raw HTML for analysis
scripts/desearch.py crawl "https://example.com/page" --crawl-format html
Response
Example (format=text, truncated, default)
Artificial intelligence (AI) is the capability of computational systems to perform tasks that typically require human intelligence, such as learning, reasoning, problem-solving, perception, and decision-making...
Example (format=html, truncated)
\x3C!DOCTYPE html>
\x3Chtml>
\x3Chead>\x3Ctitle>Artificial intelligence - Wikipedia\x3C/title>\x3C/head>
\x3Cbody>
\x3Cp>Artificial intelligence (AI) is the capability of computational systems...\x3C/p>
\x3C/body>
\x3C/html>
Notes
- Response is plain text or raw HTML — not JSON.
- Default format is
text. Use--crawl-format htmlonly when you need to inspect page structure. - Prefer
textformat to avoid bloating the agent context with markup.
Errors
Status 401, Unauthorized (e.g., missing/invalid API key)
{
"detail": "Invalid or missing API key"
}
Status 402, Payment Required (e.g., balance depleted)
{
"detail": "Insufficient balance, please add funds to your account to continue using the service."
}
Resources
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install desearch-crawl - After installation, invoke the skill by name or use
/desearch-crawl - Provide required inputs per the skill's parameter spec and get structured output
What is Crawl By Desearch?
Crawl/scrape and extract content from any webpage URL. Returns the page content as clean text or raw HTML. Use this when you need to read the full contents o... It is an AI Agent Skill for Claude Code / OpenClaw, with 877 downloads so far.
How do I install Crawl By Desearch?
Run "/install desearch-crawl" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is Crawl By Desearch free?
Yes, Crawl By Desearch is completely free (open-source). You can download, install and use it at no cost.
Which platforms does Crawl By Desearch support?
Crawl By Desearch is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).
Who created Crawl By Desearch?
It is built and maintained by okradze (@okradze); the current version is v1.0.1.