← 返回 Skills 市场
665
总下载
0
收藏
7
当前安装
2
版本数
在 OpenClaw 中安装
/install zhihu-fetcher
功能描述
知乎数据获取 - 极简设计,支持三级认证降级(Browser Profile → File Cookie → Fallback),确保数据可靠获取 Zhihu Data Fetcher - Minimalist design with three-level auth fallback (Browser Prof...
安全使用建议
Plain-language checklist and recommendations before installing or running:
- Do not run the scripts blindly. This package expects you to run Node.js scripts and Python scripts locally — ensure you have Node and Python installed and run them in a safe environment.
- Remove or replace any cookie/session values that are pre-filled in config/fallback-sources.json. Treat the included cookie-like strings as potentially sensitive or stale — do not reuse them. Prefer to leave cookie fields empty and fill them yourself from a browser you control.
- If you only need unauthenticated/fallback data, configure the skill to use fallback-only mode (or run only the fallback snippet) so you avoid storing session cookies altogether.
- The browser console snippet (browser-research.js) collects document.cookie and other environment details. Only paste/run that code in a browser you trust and where you are comfortable exposing those values locally. Never paste it into a remote console provided by an untrusted party.
- Inspect the code locally before use. The code currently performs only direct requests to zhihu.com and configured fallback URLs (e.g., GitHub raw). If you see any network calls to unknown endpoints (especially remote servers not documented in SKILL.md), stop and investigate.
- Prefer running in an isolated environment (VM or throwaway container) if you plan to provide cookies from a logged-in account. That limits risk if something unexpected is present.
- Ask the publisher or maintainer to: (1) declare required runtimes (Node, Python) and any other prerequisites in metadata, (2) remove any embedded session tokens from the repo, and (3) confirm whether any of the provided cookie strings are placeholders.
If you want, I can extract the precise lines that read or print cookies and network URLs (so you can audit them), or produce a safer minimal command sequence that uses only the fallback source.
功能分析
Type: OpenClaw Skill
Name: zhihu-fetcher
Version: 1.1.0
The zhihu-fetcher bundle is a well-structured tool for scraping and archiving Zhihu hot lists and search results. It features a robust three-level authentication fallback mechanism (Browser Profile, File Cookie, and Fallback Source) and includes comprehensive scripts for SQLite database management (scripts/db.py), data integration (scripts/save_to_db.py), and HTML report generation (scripts/generate_html.py). While the bundle handles sensitive session cookies and includes research scripts for analyzing anti-crawl mechanisms (snippets/browser-research.js), these behaviors are transparently documented and strictly aligned with the stated purpose of data collection. No evidence of data exfiltration to unauthorized third parties, malicious execution, or prompt injection was found.
能力评估
Purpose & Capability
The skill's stated purpose (fetch Zhihu hot lists with three-level auth) aligns with the scripts provided (Node.js snippets for fetching and Python for DB operations). However the package metadata declares no required binaries or env vars, yet runtime instructions and code clearly require Node.js and Python 3. That mismatch (failing to declare required runtimes) is an incoherence that could mislead users. Otherwise the requested files and configuration (cookies, fallback sources) are consistent with the scraping purpose.
Instruction Scope
SKILL.md instructs users how to login, copy cookies into config, run Node/Python scripts, and optionally run a browser console snippet. The browser-research.js intentionally collects document.cookie plus userAgent, timezone, platform and some window keys — data that is sensitive (session cookies). Collecting these in the browser console is relevant to anti-crawl research, but it also gathers session tokens and environment details which are sensitive and could be exfiltrated if modified. The instructions do not direct data to unknown external endpoints beyond Zhihu and configured fallback URLs (e.g., GitHub raw).
Install Mechanism
There is no external installer or remote download; the skill is instruction+source-file based. That limits supply-chain risk from arbitrary downloads. The scripts will execute locally (Node and Python) and create files under the repo's data/ directory. No third-party install URLs or archives are used.
Credentials
The skill expects users to supply Zhihu session cookies (file-based cookie fallback) which is proportionate to the stated goal of authenticated scraping. However the repository's config/fallback-sources.json already contains pre-filled cookie-like values (e.g., zhihu_session, _xsrf, d_c0). Embedding session tokens or real cookie values in a distributed repo is inappropriate and risky: those values could be reused by others or indicate the author accidentally committed secrets. The skill does not request unrelated credentials or environment variables, which is good, but the included cookie data is a red flag.
Persistence & Privilege
The skill does not request global 'always' installation and defaults allow model invocation (normal). It writes to its own data/zhihu.db and generated HTML files, which is expected for this functionality. It does not attempt to modify other skills or system-wide agent config.
如何使用
- 确保已安装 OpenClaw(本地或 Docker 部署)
- 在对话框中输入安装命令:
/install zhihu-fetcher - 安装完成后,直接呼叫该 Skill 的名称或使用
/zhihu-fetcher触发 - 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.1.0
Add bilingual Chinese-English documentation, update with database persistence and HTML visualization features
v1.0.0
zhihu-fetcher 1.0.0
- Initial release: Minimalist Zhihu data fetcher supporting three-level authentication fallback (Browser Profile → File Cookie → Fallback Source) for reliable data retrieval.
- Provides convenient scripts for fetching Zhihu hot lists, search results, and testing.
- Includes flexible configuration for cookies, rate limiting, and authentication priority.
- Supports fallback to alternative data sources if authentication fails.
- Simple file structure and clear usage instructions for quick setup.
- Tested with browser, file cookie, and fallback modes.
元数据
常见问题
知乎数据获取 | Zhihu Data Fetcher 是什么?
知乎数据获取 - 极简设计,支持三级认证降级(Browser Profile → File Cookie → Fallback),确保数据可靠获取 Zhihu Data Fetcher - Minimalist design with three-level auth fallback (Browser Prof... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 665 次。
如何安装 知乎数据获取 | Zhihu Data Fetcher?
在 OpenClaw 或 Claude Code 对话框中运行命令「/install zhihu-fetcher」即可一键安装,无需额外配置。
知乎数据获取 | Zhihu Data Fetcher 是免费的吗?
是的,知乎数据获取 | Zhihu Data Fetcher 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。
知乎数据获取 | Zhihu Data Fetcher 支持哪些平台?
知乎数据获取 | Zhihu Data Fetcher 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。
谁开发了 知乎数据获取 | Zhihu Data Fetcher?
由 noah(@noah-1106)开发并维护,当前版本 v1.1.0。
推荐 Skills