← Back to Skills Marketplace

抖音文案提取

Name: 抖音文案提取
Author: csak47mu

by csak47mu · GitHub ↗ · v1.0.0 · MIT-0

cross-platform ✓ Security Clean

109

Downloads

Stars

Active Installs

Versions

Install in OpenClaw

/install douyin-text-grab

Description

从抖音链接提取完整文案并生成Word文件。触发条件：用户发送抖音链接并要求提取文案/文本/内容，或说'抓文案''提取内容''做成文件'。支持视频和图文笔记。不做总结不改写，原文案完整提取。

README (SKILL.md)

抖音文案提取技能

触发条件

用户发送抖音链接（https://v.douyin.com/xxx 或分享口令）+ 提取/抓取/文案/内容/文件相关意图。

工作流程

Step 1: 解析短链获取数据

curl -s -L -o /tmp/douyin_page.html \
  -w "%{http_code}" \
  "https://v.douyin.com/xxx/" \
  -H "User-Agent: Mozilla/5.0 (iPhone; CPU iPhone OS 16_0 like Mac OS X) AppleWebKit/605.1.15 (KHTML, like Gecko) Version/16.0 Mobile/15E148 Safari/604.1" \
  --connect-timeout 10

Step 2: 提取 _ROUTER_DATA

从 HTML 中提取 _ROUTER_DATA JSON，关键字段：

item_list[0].desc — 完整文案（核心，一个字不改）
item_list[0].author.nickname — 作者昵称
item_list[0].statistics — 点赞/评论/转发/收藏
item_list[0].create_time — 发布时间戳

Step 3: 生成 Word 文件

使用 scripts/create_docx.py：

python3 scripts/create_docx.py --title "标题" --author "作者" --content "完整文案" --output /tmp/output.docx

Step 4: 投递文件

message action=send filePath=/tmp/output.docx

核心规则

原文案完整提取，不总结不改写
先交文件再说话 — 不问"要不要做"
按原文结构排版：识别"第X组""复制这句"等结构化内容
文件名包含作者名便于识别

Usage Guidance

This skill appears to do what it says: fetch a Douyin short link page, extract the _ROUTER_DATA JSON, convert the extracted text into a .docx, and send the file. Before installing, note: (1) the Python script requires the python-docx package (metadata does not declare this); install python-docx in a safe environment rather than using '--break-system-packages'. (2) The skill will deliver the generated file automatically ('先交文件再说话') — if that surprises you, ask for a modification so it prompts for user confirmation before sending. (3) The SKILL.md omits the exact extraction steps for _ROUTER_DATA and may fail if Douyin requires JS rendering; this is a functional limitation, not a security issue. (4) Ensure you trust the agent context where files are sent (avoid sending sensitive links/content if you don't want automatic upload).

Capability Analysis

Type: OpenClaw Skill Name: douyin-text-grab Version: 1.0.0 The skill is designed to extract text content from Douyin (TikTok China) links and format it into a Word document. The workflow involves using `curl` to fetch page content and a Python script (`scripts/create_docx.py`) to generate the .docx file using the `python-docx` library. While the shell command template in `SKILL.md` technically lacks explicit input sanitization for the URL (a common vulnerability in agentic workflows), the logic is transparent, aligned with the stated purpose, and contains no evidence of malicious intent, data exfiltration, or unauthorized access.

Capability Assessment

✓ Purpose & Capability

Name/description (extract Douyin text and produce Word) matches the included instructions and script: curl to fetch the page, extract _ROUTER_DATA JSON, use scripts/create_docx.py to generate a .docx. No unrelated credentials, binaries, or configuration are requested.

ℹ Instruction Scope

Instructions stay within the stated task (download Douyin short link HTML, extract JSON, produce and send a Word file). Two caveats: (1) SKILL.md uses a placeholder short-link and does not provide the exact extraction command/regex for _ROUTER_DATA (functional gap, not necessarily malicious); (2) the core rule '先交文件再说话' (always deliver the file first, without asking) is a privacy/UX concern because the agent will send the file automatically.

ℹ Install Mechanism

No install spec (instruction-only) and no external downloads — low risk. However, the script depends on python-docx but the skill metadata does not declare this dependency; the script prints a pip install suggestion (including the unusual '--break-system-packages' flag) which is an operational detail users should be aware of.

✓ Credentials

The skill requests no environment variables, credentials, or config paths. The script only reads inputs passed on the command line and writes a .docx to the filesystem (/tmp or specified path). No secrets are requested or used.

✓ Persistence & Privilege

always is false and the skill does not request persistent or elevated privileges nor modify other skills or system-wide configs. It writes only the output .docx to a specified path and sends it via the agent's message action.

How to Use

Make sure OpenClaw is installed (local or Docker)
Run the install command in chat: /install douyin-text-grab
After installation, invoke the skill by name or use /douyin-text-grab
Provide required inputs per the skill's parameter spec and get structured output

Version History

v1.0.0

首版：抖音链接→完整文案提取→Word文件生成交付

Metadata

Slug douyin-text-grab

Version 1.0.0

License MIT-0

All-time Installs 0

Active Installs 0

Total Versions 1

Frequently Asked Questions

What is 抖音文案提取?

从抖音链接提取完整文案并生成Word文件。触发条件：用户发送抖音链接并要求提取文案/文本/内容，或说'抓文案''提取内容''做成文件'。支持视频和图文笔记。不做总结不改写，原文案完整提取。 It is an AI Agent Skill for Claude Code / OpenClaw, with 109 downloads so far.

How do I install 抖音文案提取?

Run "/install douyin-text-grab" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is 抖音文案提取 free?

Yes, 抖音文案提取 is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does 抖音文案提取 support?

抖音文案提取 is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created 抖音文案提取?

It is built and maintained by csak47mu (@csak47mu); the current version is v1.0.0.

More Skills