← 返回 Skills 市场

Kekik Crawler

Name: Kekik Crawler
Author: keyiflerolsun

作者 Ömer Faruk Sancak · GitHub ↗ · v0.1.0-rc1

cross-platform ⚠ suspicious

726

总下载

当前安装

版本数

在 OpenClaw 中安装

/install kekik-crawler

功能描述

Scrapling-only, deterministic web crawler with clean SRP architecture, presets, checkpointing, and JSONL/report outputs.

使用说明 (SKILL.md)

Kekik Crawler

Scrapling tabanlı, browser'sız crawler.

Quick Start

pip install -r requirements.txt
python main.py --urls https://example.org

Presets

python main.py --queries "Ömer Faruk Sancak" keyiflerolsun --preset person-research --out outputs/person.jsonl --report outputs/person-report.json
python main.py --queries "Ömer Faruk Sancak" keyiflerolsun --preset deep-research --out outputs/deep.jsonl --report outputs/deep-report.json

Notes

Output files are under outputs/
Main entrypoint: main.py
Orchestration: core/crawl_runner.py

安全使用建议

This package appears to be a straightforward local crawler, but review a few things before running: 1) Presets (person-research, deep-research) set no_robots=True — they will ignore robots.txt; only use that for acceptable targets. 2) The plugin system loads and executes Python files from the plugins directory — inspect any plugins you use (and do not point plugin_dir at an untrusted path). 3) The runtime will fetch pages and write outputs/cache/checkpoint files under outputs/; run in an isolated directory or container if you want to limit side effects. 4) The instructions require pip installing dependencies from PyPI (notably scrapling); verify package versions and provenance. If you want lower risk, run the tests (pytest) and review the included plugins and requirements before executing the crawler against external or internal networks.

功能分析

Type: OpenClaw Skill Name: kekik-crawler Version: 0.1.0-rc1 The skill is a web crawler that performs legitimate web fetching and local file storage. However, it is classified as 'suspicious' due to a significant Remote Code Execution (RCE) vulnerability. The `core/plugin_manager.py` module dynamically loads Python files from a user-specified `plugin_dir` (via `importlib.util.spec_from_file_location` and `spec.loader.exec_module`). While the default `plugin_dir` is 'plugins/' within the skill bundle, an attacker who can control the `--plugins` argument in `main.py` could point it to an arbitrary directory containing malicious Python code, leading to arbitrary code execution. There is no evidence of intentional malicious behavior such as data exfiltration, persistence, or prompt injection against the agent in the provided code or documentation.

能力评估

✓ Purpose & Capability

The code implements a scrapling-based crawler and matches the name/description: crawl orchestration, fetcher, plugins, checkpointing, JSONL outputs and a report. No unrelated env vars, binaries, or external services are requested.

ℹ Instruction Scope

SKILL.md instructs pip install -r requirements.txt and running main.py which will fetch arbitrary web pages, write outputs/cache/checkpoint files, and load plugins from a plugin directory. The runtime can fetch robots.txt and search engine pages (DuckDuckGo/Bing/Yahoo/Brave). Plugin loading uses importlib.exec_module (executing plugin code) — expected for a plugin system, but worth reviewing.

ℹ Install Mechanism

No registry install spec is provided; the README/SKILL.md asks to pip install requirements.txt (selectolax, tenacity, orjson, scrapling). Installing from PyPI is normal but carries the usual supply-chain risk (verify packages/versions). There are no downloads from arbitrary URLs or archives in the manifest.

✓ Credentials

The skill declares no required environment variables or credentials, which is proportionate for a web crawler. It writes local files (outputs/, SQLite cache) and does not request unrelated secrets.

✓ Persistence & Privilege

always:false and no modifications to other skills or system-wide configs. The skill stores checkpoints, cache, and reports under outputs/ (normal). Note: presets intentionally set no_robots=True for research presets, which increases crawling aggressiveness but is an operational choice rather than a permission request.

如何使用

确保已安装 OpenClaw（本地或 Docker 部署）
在对话框中输入安装命令：/install kekik-crawler
安装完成后，直接呼叫该 Skill 的名称或使用 /kekik-crawler 触发
根据 Skill 的参数说明提供必要输入，即可获得结构化输出

版本历史

v0.1.0-rc1

First release candidate: Scrapling-only deterministic crawler with SRP architecture, presets, tests, and reporting.

元数据

Slug kekik-crawler

版本 0.1.0-rc1

许可证 —

累计安装 4

当前安装数 4

历史版本数 1

常见问题

Kekik Crawler 是什么？

Scrapling-only, deterministic web crawler with clean SRP architecture, presets, checkpointing, and JSONL/report outputs. 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件，目前累计下载 726 次。

如何安装 Kekik Crawler？

在 OpenClaw 或 Claude Code 对话框中运行命令「/install kekik-crawler」即可一键安装，无需额外配置。

Kekik Crawler 是免费的吗？

是的，Kekik Crawler 完全免费（开源免费），可自由下载、安装和使用。

Kekik Crawler 支持哪些平台？

Kekik Crawler 跨平台运行，可在任意部署了 OpenClaw / Claude Code 的环境中使用（cross-platform）。

谁开发了 Kekik Crawler？

由 Ömer Faruk Sancak（@keyiflerolsun）开发并维护，当前版本 v0.1.0-rc1。

Kekik Crawler

Kekik Crawler

Quick Start

Presets

Notes

Kekik Crawler 是什么？

如何安装 Kekik Crawler？

Kekik Crawler 是免费的吗？

Kekik Crawler 支持哪些平台？

谁开发了 Kekik Crawler？

💬 留言讨论