← 返回 Skills 市场
Kekik Crawler
作者
Ömer Faruk Sancak
· GitHub ↗
· v0.1.0-rc1
726
总下载
0
收藏
4
当前安装
1
版本数
在 OpenClaw 中安装
/install kekik-crawler
功能描述
Scrapling-only, deterministic web crawler with clean SRP architecture, presets, checkpointing, and JSONL/report outputs.
使用说明 (SKILL.md)
Kekik Crawler
Scrapling tabanlı, browser'sız crawler.
Quick Start
pip install -r requirements.txt
python main.py --urls https://example.org
Presets
python main.py --queries "Ömer Faruk Sancak" keyiflerolsun --preset person-research --out outputs/person.jsonl --report outputs/person-report.json
python main.py --queries "Ömer Faruk Sancak" keyiflerolsun --preset deep-research --out outputs/deep.jsonl --report outputs/deep-report.json
Notes
- Output files are under
outputs/ - Main entrypoint:
main.py - Orchestration:
core/crawl_runner.py
安全使用建议
This package appears to be a straightforward local crawler, but review a few things before running: 1) Presets (person-research, deep-research) set no_robots=True — they will ignore robots.txt; only use that for acceptable targets. 2) The plugin system loads and executes Python files from the plugins directory — inspect any plugins you use (and do not point plugin_dir at an untrusted path). 3) The runtime will fetch pages and write outputs/cache/checkpoint files under outputs/; run in an isolated directory or container if you want to limit side effects. 4) The instructions require pip installing dependencies from PyPI (notably scrapling); verify package versions and provenance. If you want lower risk, run the tests (pytest) and review the included plugins and requirements before executing the crawler against external or internal networks.
功能分析
Type: OpenClaw Skill
Name: kekik-crawler
Version: 0.1.0-rc1
The skill is a web crawler that performs legitimate web fetching and local file storage. However, it is classified as 'suspicious' due to a significant Remote Code Execution (RCE) vulnerability. The `core/plugin_manager.py` module dynamically loads Python files from a user-specified `plugin_dir` (via `importlib.util.spec_from_file_location` and `spec.loader.exec_module`). While the default `plugin_dir` is 'plugins/' within the skill bundle, an attacker who can control the `--plugins` argument in `main.py` could point it to an arbitrary directory containing malicious Python code, leading to arbitrary code execution. There is no evidence of intentional malicious behavior such as data exfiltration, persistence, or prompt injection against the agent in the provided code or documentation.
能力评估
Purpose & Capability
The code implements a scrapling-based crawler and matches the name/description: crawl orchestration, fetcher, plugins, checkpointing, JSONL outputs and a report. No unrelated env vars, binaries, or external services are requested.
Instruction Scope
SKILL.md instructs pip install -r requirements.txt and running main.py which will fetch arbitrary web pages, write outputs/cache/checkpoint files, and load plugins from a plugin directory. The runtime can fetch robots.txt and search engine pages (DuckDuckGo/Bing/Yahoo/Brave). Plugin loading uses importlib.exec_module (executing plugin code) — expected for a plugin system, but worth reviewing.
Install Mechanism
No registry install spec is provided; the README/SKILL.md asks to pip install requirements.txt (selectolax, tenacity, orjson, scrapling). Installing from PyPI is normal but carries the usual supply-chain risk (verify packages/versions). There are no downloads from arbitrary URLs or archives in the manifest.
Credentials
The skill declares no required environment variables or credentials, which is proportionate for a web crawler. It writes local files (outputs/, SQLite cache) and does not request unrelated secrets.
Persistence & Privilege
always:false and no modifications to other skills or system-wide configs. The skill stores checkpoints, cache, and reports under outputs/ (normal). Note: presets intentionally set no_robots=True for research presets, which increases crawling aggressiveness but is an operational choice rather than a permission request.
如何使用
- 确保已安装 OpenClaw(本地或 Docker 部署)
- 在对话框中输入安装命令:
/install kekik-crawler - 安装完成后,直接呼叫该 Skill 的名称或使用
/kekik-crawler触发 - 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v0.1.0-rc1
First release candidate: Scrapling-only deterministic crawler with SRP architecture, presets, tests, and reporting.
元数据
常见问题
Kekik Crawler 是什么?
Scrapling-only, deterministic web crawler with clean SRP architecture, presets, checkpointing, and JSONL/report outputs. 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 726 次。
如何安装 Kekik Crawler?
在 OpenClaw 或 Claude Code 对话框中运行命令「/install kekik-crawler」即可一键安装,无需额外配置。
Kekik Crawler 是免费的吗?
是的,Kekik Crawler 完全免费(开源免费),可自由下载、安装和使用。
Kekik Crawler 支持哪些平台?
Kekik Crawler 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。
谁开发了 Kekik Crawler?
由 Ömer Faruk Sancak(@keyiflerolsun)开发并维护,当前版本 v0.1.0-rc1。
推荐 Skills