← 返回 Skills 市场
h8296699

Real Estate Spider

作者 h8296699 · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ⚠ suspicious
110
总下载
0
收藏
0
当前安装
1
版本数
在 OpenClaw 中安装
/install real-estate-spider
功能描述
专业爬取中国房产中介网站(安居客、搜房网、贝壳找房、链家)数据的通用爬虫技能,包含反爬虫策略和自动数据提取功能
安全使用建议
This skill appears to do what it claims (crawler + anti‑bot workarounds), but it contains steps that handle and store sensitive session cookies, recommends using proxies and third‑party CAPTCHA‑solving services, and writes session/cookie files to disk. Before installing or running it: 1) Review the shell scripts and Python code line‑by‑line (especially any code that saves or loads session files, or performs network uploads). 2) Do not paste real authentication cookies or API keys into the scripts unless you understand the risks; prefer manual CAPTCHA handling. 3) Run initially in an isolated environment (VM/container) and with non‑privileged user account. 4) Replace example proxy endpoints and CAPTCHA service URLs with vetted providers only if necessary, and store API keys in a secure place (and add explicit env var support). 5) Verify legality and website terms of service for scraping target sites in your jurisdiction. If you are not comfortable auditing the code, treat this skill as high‑risk and avoid providing real credentials or cookies.
功能分析
Type: OpenClaw Skill Name: real-estate-spider Version: 1.0.0 This bundle is a functional real estate data crawler designed for Chinese platforms such as Lianjia, Beike, and Anjuke. It utilizes standard Python scraping libraries (requests, BeautifulSoup) and the OpenClaw agent-browser tool to extract property details while managing anti-bot measures like User-Agent rotation, random delays, and session persistence. The code logic in main.py and the scripts (e.g., real_estate_crawler.py) is transparently aligned with the stated purpose, and no evidence of data exfiltration, credential theft, or unauthorized system access was found.
能力评估
Purpose & Capability
Name/description, required binaries (python3, agent-browser), and included Python and shell scripts all align with a web‑crawling skill for Anjuke/Ke/Lianjia/Soufun. The presence of agent-browser scripts and Python crawlers is expected for the claimed functionality.
Instruction Scope
SKILL.md and included scripts explicitly instruct the agent to: set cookies (including names like lianjia_ssid), save and restore browser session files, simulate device fingerprints, rotate proxies, capture screenshots, and optionally send captcha images to third‑party CAPTCHA solving endpoints. Those actions go beyond simple data collection because they encourage reuse of authenticated sessions and transmission of potentially sensitive artifacts to external services.
Install Mechanism
No remote install/download URLs or package installers are used; the skill is distributed as source files and shell scripts (no install spec). That lowers supply‑chain risk compared to arbitrary remote downloads. The scripts do, however, call 'agent-browser' and rely on a local Python environment.
Credentials
The skill requests no declared environment variables or credentials, but its behavior relies on sensitive data: saved session files and cookies, optional CAPTCHA API keys, and proxy endpoints. The SKILL.md and config reference using/setting cookie values and an API key for captcha solving (example Authorization header), yet no env var is declared for that key—this mismatch and the fact that session/cookie files will be written to disk are proportionality and secrecy handling concerns.
Persistence & Privilege
The skill is not always-enabled and is user-invocable; it runs shell commands and spawns agent-browser subprocesses (main.py uses subprocess for scripts). That behavior is expected for this class of skill, but because it reads/writes session files and can execute shell scripts, run it with the same care you would give any code that manipulates cookies or spawns processes.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install real-estate-spider
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /real-estate-spider 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.0
Initial release of Real Estate Spider – a universal crawler for major Chinese real estate sites. - Supports data extraction from Anjuke, Soufun, Beike (ke.com), and Lianjia. - Includes anti-crawling strategies: browser fingerprint simulation, random delay, session & cookie management, optional proxy IP, and captcha handling. - Extracts core real estate info: price, area, location, type, decoration, and year built. - Allows export in JSON, CSV, Excel, and supports visualization. - Provides usage scripts for both Python and agent-browser automation.
元数据
Slug real-estate-spider
版本 1.0.0
许可证 MIT-0
累计安装 0
当前安装数 0
历史版本数 1
常见问题

Real Estate Spider 是什么?

专业爬取中国房产中介网站(安居客、搜房网、贝壳找房、链家)数据的通用爬虫技能,包含反爬虫策略和自动数据提取功能. 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 110 次。

如何安装 Real Estate Spider?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install real-estate-spider」即可一键安装,无需额外配置。

Real Estate Spider 是免费的吗?

是的,Real Estate Spider 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

Real Estate Spider 支持哪些平台?

Real Estate Spider 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 Real Estate Spider?

由 h8296699(@h8296699)开发并维护,当前版本 v1.0.0。

💬 留言讨论