← Back to Skills Marketplace
小红书笔记抓取与选题助手
by
2813223285
· GitHub ↗
· v1.0.1
· MIT-0
115
Downloads
0
Stars
0
Active Installs
2
Versions
Install in OpenClaw
/install xiaohongshu-note-fetcher
Description
抓取并整理小红书笔记公开页面信息(标题、正文摘要、作者、发布时间、互动数据、标签、封面图等)为结构化 JSON 或 Markdown。用于“根据笔记链接提取内容”“批量收集笔记基础信息”“生成笔记摘要素材”等场景;当用户提供小红书笔记 URL、URL 列表或需要导出机器可读结果时触发。
Usage Guidance
This package appears coherent for scraping and structuring public Xiaohongshu notes, but pay attention to secrets and external endpoints before running: 1) Do not paste full browser cookie strings or session tokens into untrusted environments — those are equivalent to logging in as you. 2) TikHub API calls will send your TikHub token to api.tikhub.io (expected); only use tokens you control and trust the third party. 3) The generic API script (search_notes_generic.py) will call whatever base_url you provide with whatever token/header you give it — avoid pointing it at unknown hosts to prevent accidental secret exfiltration. 4) generate_wow_pack encodes content into a mermaid.ink URL — generating/sharing that URL will upload the encoded content to a third-party service when the URL is resolved. 5) If you run the Playwright scraper, installing Playwright will download browser binaries; run it in a sandbox or environment where you can vet the network activity. 6) Ensure your usage complies with Xiaohongshu’s terms, robots.txt, and privacy rules. If you need to proceed: run locally in an isolated environment, avoid reusing high-privilege session cookies, and inspect outputs before sharing.
Capability Analysis
Type: OpenClaw Skill
Name: xiaohongshu-note-fetcher
Version: 1.0.1
The skill bundle is a comprehensive toolkit for scraping and analyzing Xiaohongshu (Red) note data, utilizing both direct browser automation (Playwright) and third-party APIs (TikHub). The scripts (e.g., fetch_xiaohongshu_notes.py, search_notes_tikhub.py) are well-structured, provide clear documentation on handling user-provided credentials like Cookies and API tokens, and lack any indicators of malicious intent such as data exfiltration to unauthorized endpoints or hidden command execution. The bundle also includes legitimate utility features like generating mindmap URLs via Mermaid.ink (generate_wow_pack.py) and interactive data filtering.
Capability Tags
Capability Assessment
Purpose & Capability
The name/description (抓取小红书笔记并结构化输出) aligns with the included files: multiple fetchers (HTTP and Playwright), TikHub API callers, result processors, and export tools. The code and SKILL.md consistently implement search-by-keyword, single/batch URL fetch, and output generation.
Instruction Scope
Runtime instructions are narrowly scoped to fetching pages, parsing metadata, and exporting results. They do request user-provided cookie strings/files and API tokens (TikHub or generic endpoints), and they produce optional screenshots/HTML. These are relevant to the stated task but are sensitive inputs (browser cookies/session tokens). The generic adapter (search_notes_generic.py) can call any base_url you supply — that is powerful and can be misused if pointed at a malicious endpoint.
Install Mechanism
There is no formal install spec (instruction-only skill), which is low-risk. The repo contains package.json/package-lock and instructs installing playwright when using the browser-mode scraper — installing Playwright downloads browser binaries (normal for this task but more heavyweight). All downloads referenced are standard registries (npm) and known endpoints (no obscure download URLs in SKILL.md).
Credentials
The skill does not declare required env vars and does not require unrelated credentials. It does legitimately require API tokens or cookie files provided by the user to access TikHub or to fetch authenticated pages. Because the generic search script accepts arbitrary base_url + token, a misconfigured usage could leak a token to an unintended endpoint — the files and instructions make this possible but it is user-driven rather than hidden.
Persistence & Privilege
The skill is not always-enabled and does not request elevated persistence. It does not modify other skills or system-wide configs. It runs as invoked and writes output files in the working directory (expected behavior).
How to Use
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install xiaohongshu-note-fetcher - After installation, invoke the skill by name or use
/xiaohongshu-note-fetcher - Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.1
- 新增极简用法演示,支持一行命令调用 TikHub API 并输出标准结构化文件。
- 补充 Makefile 用法说明,支持 API Key 文件读取和推荐最佳长期用法。
- 无破坏性变更,旧参数规则和常用脚本保持兼容。
- 增加 PUBLISHING_NOTES.txt 文件,丰富发布或维护说明。
v1.0.0
xiaohongshu-note-fetcher 1.0.0 初始版本发布:
- 支持抓取和整理小红书笔记公开页面信息为结构化 JSON 或 Markdown,包括标题、摘要、作者、发布时间、互动数据、标签、封面图等。
- 支持单条和批量链接输入,并可通过 Cookie 获得更完整信息。
- 提供多种抓取与导出方式,包括原生脚本、TikHub/通用 API、Playwright 浏览器自动化等。
- 附带批量处理、互动筛选、导图生成、交互式筛选工具等多个实用脚本。
- 详尽参数说明、常见问题处理与合规指引,支持多场景采集和内容整理需求。
Metadata
Frequently Asked Questions
What is 小红书笔记抓取与选题助手?
抓取并整理小红书笔记公开页面信息(标题、正文摘要、作者、发布时间、互动数据、标签、封面图等)为结构化 JSON 或 Markdown。用于“根据笔记链接提取内容”“批量收集笔记基础信息”“生成笔记摘要素材”等场景;当用户提供小红书笔记 URL、URL 列表或需要导出机器可读结果时触发。 It is an AI Agent Skill for Claude Code / OpenClaw, with 115 downloads so far.
How do I install 小红书笔记抓取与选题助手?
Run "/install xiaohongshu-note-fetcher" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is 小红书笔记抓取与选题助手 free?
Yes, 小红书笔记抓取与选题助手 is completely free, licensed under MIT-0. You can download, install and use it at no cost.
Which platforms does 小红书笔记抓取与选题助手 support?
小红书笔记抓取与选题助手 is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).
Who created 小红书笔记抓取与选题助手?
It is built and maintained by 2813223285 (@2813223285); the current version is v1.0.1.
More Skills