← 返回 Skills 市场
Deepseek Extract
作者
ZhangYuanzhuo
· GitHub ↗
· v1.0.0
· MIT-0
81
总下载
0
收藏
0
当前安装
1
版本数
在 OpenClaw 中安装
/install deepseek-extract
功能描述
Extract full conversation content from DeepSeek shared chat links. Use when: user provides a DeepSeek share URL (chat.deepseek.com/share/...), wants to extra...
使用说明 (SKILL.md)
\r \r
DeepSeek Extract\r
\r Extract full conversation content from DeepSeek shared chat links.\r \r
Prerequisites\r
\r Before using this skill, ensure the following are installed:\r \r
pip install playwright\r
playwright install chromium\r
```\r
\r
If `playwright` is not installed when the skill runs, inform the user and offer to run the install commands above.\r
\r
## Workflow\r
\r
### Step 1: Validate the URL\r
\r
1. Check if the provided URL matches `https://chat.deepseek.com/share/...`\r
2. If the URL does not match this pattern, inform the user:\r
- "This skill only supports DeepSeek share links (chat.deepseek.com/share/...)."\r
- If the URL is from another AI platform (ChatGPT, Claude, etc.), suggest the appropriate skill or manual copy.\r
3. If valid, proceed to Step 2.\r
\r
### Step 2: Run the extraction script\r
\r
Execute the extraction script from the skill's `scripts/` directory:\r
\r
```bash\r
python3 "\x3Cskill_dir>/scripts/extract_deepseek.py" "\x3Cshare_url>" --output "\x3Coutput_path>"\r
```\r
\r
**Parameters:**\r
- `\x3Cshare_url>`: The DeepSeek share URL (required)\r
- `--output`: Output file path (optional, defaults to `./deepseek_conversation.md`)\r
- `--format`: Output format — `markdown` (default) or `json`\r
- `--headed`: Run browser in headed mode for debugging (optional flag)\r
- `--timeout`: Page load timeout in milliseconds (optional, defaults to 30000)\r
\r
Replace `\x3Cskill_dir>` with the actual skill base directory path.\r
\r
**Error handling:**\r
- If `playwright` is not installed: inform the user and offer to run `pip install playwright && playwright install chromium`\r
- If `python3` is not found: try `python` instead\r
- If the script exits with a timeout error: retry with `--timeout 60000`\r
- If the script exits with an empty extraction: try with `--headed` flag to debug visually\r
\r
### Step 3: Verify and deliver\r
\r
1. Read the output file to verify content was extracted successfully.\r
2. If the output is empty or contains fewer than 2 messages:\r
- Try again with `--headed` flag for debugging\r
- The DeepSeek page may have anti-bot protection — inform the user\r
- Suggest: "DeepSeek may be blocking automated access. You can try manually copying the conversation."\r
3. If content was extracted successfully, present it to the user.\r
4. If the user wants a different format (Word, PDF, etc.), use other skills (docx, pdf) to convert.\r
\r
### Step 4: Cleanup\r
\r
Remove any temporary files created during extraction. Keep the output file unless the user specifies otherwise.\r
\r
## How It Works\r
\r
The extraction script uses Playwright (headless Chromium) to:\r
1. Navigate to the DeepSeek share URL\r
2. Wait for the SPA to fully render (JavaScript execution)\r
3. Try multiple CSS selector strategies to find message elements\r
4. Classify messages as user or assistant based on class names and data attributes\r
5. If primary selectors fail, use aggressive text-parsing fallback\r
6. Output the result as Markdown or JSON\r
\r
## Output Format\r
\r
### Markdown (default)\r
```markdown\r
# DeepSeek 对话记录\r
\r
> 来源: \x3Cshare_url>\r
\r
---\r
\r
## 用户\r
\x3Cmessage content>\r
\r
## DeepSeek\r
\x3Cresponse content>\r
\r
---\r
(repeat for each turn)\r
```\r
\r
### JSON\r
```json\r
{\r
"url": "\x3Cshare_url>",\r
"title": "\x3Cpage title>",\r
"messages": [\r
{"role": "user", "content": "..."},\r
{"role": "assistant", "content": "..."}\r
]\r
}\r
```\r
\r
## Troubleshooting\r
\r
| Issue | Cause | Solution |\r
|-------|-------|----------|\r
| No messages extracted | Anti-bot / CAPTCHA | Try `--headed` flag; copy manually |\r
| Timeout error | Slow network | Retry with `--timeout 60000` |\r
| Incomplete content | Lazy loading | Script auto-scrolls; try `--headed` to verify |\r
| `playwright` not found | Not installed | Run `pip install playwright && playwright install chromium` |\r
| `python3` not found | Windows environment | Try `python` instead of `python3` |\r
安全使用建议
This skill appears to do what it says: locally render a DeepSeek share URL with Playwright and extract messages. Before installing or running it, note: (1) you will likely need to run pip install playwright and playwright install chromium, which downloads browser binaries into your environment; (2) the script opens and executes JavaScript on the target page (the provided DeepSeek URL) to render content — only provide share URLs you trust or are authorized to extract; (3) the skill does not request credentials or exfiltrate data to third parties, but installing Python packages always carries the usual supply-chain risk, so install packages from trusted sources and inspect code if you have concerns; (4) the URL validation is strict and may reject some valid share URLs with extra query strings or atypical formats. If you want greater assurance, review the full script contents locally before running and run Playwright installs in an isolated environment (virtualenv or container).
功能分析
Type: OpenClaw Skill
Name: deepseek-extract
Version: 1.0.0
The skill is a legitimate utility designed to extract conversation history from DeepSeek share links using Playwright for browser automation. The Python script (scripts/extract_deepseek.py) implements robust scraping logic with multiple selector strategies and fallback mechanisms, and its behavior strictly aligns with the stated purpose in SKILL.md and README.md without any evidence of data exfiltration, malicious execution, or prompt injection.
能力标签
能力评估
Purpose & Capability
Name/description match the included script and SKILL.md: the skill uses Playwright + headless Chromium to render DeepSeek share pages and extract messages. Required binary is just python3, which is reasonable for a Python extraction script. Nothing requested (no credentials, no unrelated binaries) appears out of scope for the stated task.
Instruction Scope
SKILL.md restricts usage to chat.deepseek.com/share/... URLs and instructs running the included script to produce Markdown/JSON output. The runtime instructions ask only to run the local script, check for Playwright, read the output file, and optionally retry with headed mode; they do not direct the agent to read unrelated local files, other credentials, or post data to third-party endpoints. The extraction code navigates only to the provided URL (after validating it) and parses page content.
Install Mechanism
This is an instruction-only skill with a bundled Python script (no automated install spec). It requires the user to install Playwright and its Chromium browsers (pip install playwright; playwright install chromium). Installing Playwright will download browser binaries from the internet; this is expected for a browser-based scraper but is a worthwhile operational note for users (it modifies local environment and adds large browser binaries).
Credentials
The skill requires no environment variables, no credentials, and no config paths. The code does not attempt to access system secrets or other services beyond loading the provided DeepSeek URL.
Persistence & Privilege
The skill is not always-on and does not request persistent privileges. It does not modify other skills' configs or system-wide agent settings. The agent may run it autonomously (default), which is normal and not excessive here.
如何使用
- 确保已安装 OpenClaw(本地或 Docker 部署)
- 在对话框中输入安装命令:
/install deepseek-extract - 安装完成后,直接呼叫该 Skill 的名称或使用
/deepseek-extract触发 - 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.0
v1.0.0: DeepSeek share link conversation extractor with Playwright, multiple selector strategies, Markdown/JSON output, error resilience
元数据
常见问题
Deepseek Extract 是什么?
Extract full conversation content from DeepSeek shared chat links. Use when: user provides a DeepSeek share URL (chat.deepseek.com/share/...), wants to extra... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 81 次。
如何安装 Deepseek Extract?
在 OpenClaw 或 Claude Code 对话框中运行命令「/install deepseek-extract」即可一键安装,无需额外配置。
Deepseek Extract 是免费的吗?
是的,Deepseek Extract 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。
Deepseek Extract 支持哪些平台?
Deepseek Extract 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。
谁开发了 Deepseek Extract?
由 ZhangYuanzhuo(@zz0116)开发并维护,当前版本 v1.0.0。
推荐 Skills