← Back to Skills Marketplace
csak47mu

抖音文案提取

by csak47mu · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ✓ Security Clean
109
Downloads
0
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install douyin-text-grab
Description
从抖音链接提取完整文案并生成Word文件。触发条件:用户发送抖音链接并要求提取文案/文本/内容,或说'抓文案''提取内容''做成文件'。支持视频和图文笔记。不做总结不改写,原文案完整提取。
README (SKILL.md)

抖音文案提取技能

触发条件

用户发送抖音链接(https://v.douyin.com/xxx 或分享口令)+ 提取/抓取/文案/内容/文件 相关意图。

工作流程

Step 1: 解析短链获取数据

curl -s -L -o /tmp/douyin_page.html \
  -w "%{http_code}" \
  "https://v.douyin.com/xxx/" \
  -H "User-Agent: Mozilla/5.0 (iPhone; CPU iPhone OS 16_0 like Mac OS X) AppleWebKit/605.1.15 (KHTML, like Gecko) Version/16.0 Mobile/15E148 Safari/604.1" \
  --connect-timeout 10

Step 2: 提取 _ROUTER_DATA

从 HTML 中提取 _ROUTER_DATA JSON,关键字段:

  • item_list[0].desc完整文案(核心,一个字不改)
  • item_list[0].author.nickname — 作者昵称
  • item_list[0].statistics — 点赞/评论/转发/收藏
  • item_list[0].create_time — 发布时间戳

Step 3: 生成 Word 文件

使用 scripts/create_docx.py

python3 scripts/create_docx.py --title "标题" --author "作者" --content "完整文案" --output /tmp/output.docx

Step 4: 投递文件

message action=send filePath=/tmp/output.docx

核心规则

  1. 原文案完整提取,不总结不改写
  2. 先交文件再说话 — 不问"要不要做"
  3. 按原文结构排版:识别"第X组""复制这句"等结构化内容
  4. 文件名包含作者名便于识别
Usage Guidance
This skill appears to do what it says: fetch a Douyin short link page, extract the _ROUTER_DATA JSON, convert the extracted text into a .docx, and send the file. Before installing, note: (1) the Python script requires the python-docx package (metadata does not declare this); install python-docx in a safe environment rather than using '--break-system-packages'. (2) The skill will deliver the generated file automatically ('先交文件再说话') — if that surprises you, ask for a modification so it prompts for user confirmation before sending. (3) The SKILL.md omits the exact extraction steps for _ROUTER_DATA and may fail if Douyin requires JS rendering; this is a functional limitation, not a security issue. (4) Ensure you trust the agent context where files are sent (avoid sending sensitive links/content if you don't want automatic upload).
Capability Analysis
Type: OpenClaw Skill Name: douyin-text-grab Version: 1.0.0 The skill is designed to extract text content from Douyin (TikTok China) links and format it into a Word document. The workflow involves using `curl` to fetch page content and a Python script (`scripts/create_docx.py`) to generate the .docx file using the `python-docx` library. While the shell command template in `SKILL.md` technically lacks explicit input sanitization for the URL (a common vulnerability in agentic workflows), the logic is transparent, aligned with the stated purpose, and contains no evidence of malicious intent, data exfiltration, or unauthorized access.
Capability Assessment
Purpose & Capability
Name/description (extract Douyin text and produce Word) matches the included instructions and script: curl to fetch the page, extract _ROUTER_DATA JSON, use scripts/create_docx.py to generate a .docx. No unrelated credentials, binaries, or configuration are requested.
Instruction Scope
Instructions stay within the stated task (download Douyin short link HTML, extract JSON, produce and send a Word file). Two caveats: (1) SKILL.md uses a placeholder short-link and does not provide the exact extraction command/regex for _ROUTER_DATA (functional gap, not necessarily malicious); (2) the core rule '先交文件再说话' (always deliver the file first, without asking) is a privacy/UX concern because the agent will send the file automatically.
Install Mechanism
No install spec (instruction-only) and no external downloads — low risk. However, the script depends on python-docx but the skill metadata does not declare this dependency; the script prints a pip install suggestion (including the unusual '--break-system-packages' flag) which is an operational detail users should be aware of.
Credentials
The skill requests no environment variables, credentials, or config paths. The script only reads inputs passed on the command line and writes a .docx to the filesystem (/tmp or specified path). No secrets are requested or used.
Persistence & Privilege
always is false and the skill does not request persistent or elevated privileges nor modify other skills or system-wide configs. It writes only the output .docx to a specified path and sends it via the agent's message action.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install douyin-text-grab
  3. After installation, invoke the skill by name or use /douyin-text-grab
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
首版:抖音链接→完整文案提取→Word文件生成交付
Metadata
Slug douyin-text-grab
Version 1.0.0
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 1
Frequently Asked Questions

What is 抖音文案提取?

从抖音链接提取完整文案并生成Word文件。触发条件:用户发送抖音链接并要求提取文案/文本/内容,或说'抓文案''提取内容''做成文件'。支持视频和图文笔记。不做总结不改写,原文案完整提取。 It is an AI Agent Skill for Claude Code / OpenClaw, with 109 downloads so far.

How do I install 抖音文案提取?

Run "/install douyin-text-grab" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is 抖音文案提取 free?

Yes, 抖音文案提取 is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does 抖音文案提取 support?

抖音文案提取 is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created 抖音文案提取?

It is built and maintained by csak47mu (@csak47mu); the current version is v1.0.0.

💬 Comments