Deepseek Extract
/install deepseek-extract
\r \r
DeepSeek Extract\r
\r Extract full conversation content from DeepSeek shared chat links.\r \r
Prerequisites\r
\r Before using this skill, ensure the following are installed:\r \r
pip install playwright\r
playwright install chromium\r
```\r
\r
If `playwright` is not installed when the skill runs, inform the user and offer to run the install commands above.\r
\r
## Workflow\r
\r
### Step 1: Validate the URL\r
\r
1. Check if the provided URL matches `https://chat.deepseek.com/share/...`\r
2. If the URL does not match this pattern, inform the user:\r
- "This skill only supports DeepSeek share links (chat.deepseek.com/share/...)."\r
- If the URL is from another AI platform (ChatGPT, Claude, etc.), suggest the appropriate skill or manual copy.\r
3. If valid, proceed to Step 2.\r
\r
### Step 2: Run the extraction script\r
\r
Execute the extraction script from the skill's `scripts/` directory:\r
\r
```bash\r
python3 "\x3Cskill_dir>/scripts/extract_deepseek.py" "\x3Cshare_url>" --output "\x3Coutput_path>"\r
```\r
\r
**Parameters:**\r
- `\x3Cshare_url>`: The DeepSeek share URL (required)\r
- `--output`: Output file path (optional, defaults to `./deepseek_conversation.md`)\r
- `--format`: Output format — `markdown` (default) or `json`\r
- `--headed`: Run browser in headed mode for debugging (optional flag)\r
- `--timeout`: Page load timeout in milliseconds (optional, defaults to 30000)\r
\r
Replace `\x3Cskill_dir>` with the actual skill base directory path.\r
\r
**Error handling:**\r
- If `playwright` is not installed: inform the user and offer to run `pip install playwright && playwright install chromium`\r
- If `python3` is not found: try `python` instead\r
- If the script exits with a timeout error: retry with `--timeout 60000`\r
- If the script exits with an empty extraction: try with `--headed` flag to debug visually\r
\r
### Step 3: Verify and deliver\r
\r
1. Read the output file to verify content was extracted successfully.\r
2. If the output is empty or contains fewer than 2 messages:\r
- Try again with `--headed` flag for debugging\r
- The DeepSeek page may have anti-bot protection — inform the user\r
- Suggest: "DeepSeek may be blocking automated access. You can try manually copying the conversation."\r
3. If content was extracted successfully, present it to the user.\r
4. If the user wants a different format (Word, PDF, etc.), use other skills (docx, pdf) to convert.\r
\r
### Step 4: Cleanup\r
\r
Remove any temporary files created during extraction. Keep the output file unless the user specifies otherwise.\r
\r
## How It Works\r
\r
The extraction script uses Playwright (headless Chromium) to:\r
1. Navigate to the DeepSeek share URL\r
2. Wait for the SPA to fully render (JavaScript execution)\r
3. Try multiple CSS selector strategies to find message elements\r
4. Classify messages as user or assistant based on class names and data attributes\r
5. If primary selectors fail, use aggressive text-parsing fallback\r
6. Output the result as Markdown or JSON\r
\r
## Output Format\r
\r
### Markdown (default)\r
```markdown\r
# DeepSeek 对话记录\r
\r
> 来源: \x3Cshare_url>\r
\r
---\r
\r
## 用户\r
\x3Cmessage content>\r
\r
## DeepSeek\r
\x3Cresponse content>\r
\r
---\r
(repeat for each turn)\r
```\r
\r
### JSON\r
```json\r
{\r
"url": "\x3Cshare_url>",\r
"title": "\x3Cpage title>",\r
"messages": [\r
{"role": "user", "content": "..."},\r
{"role": "assistant", "content": "..."}\r
]\r
}\r
```\r
\r
## Troubleshooting\r
\r
| Issue | Cause | Solution |\r
|-------|-------|----------|\r
| No messages extracted | Anti-bot / CAPTCHA | Try `--headed` flag; copy manually |\r
| Timeout error | Slow network | Retry with `--timeout 60000` |\r
| Incomplete content | Lazy loading | Script auto-scrolls; try `--headed` to verify |\r
| `playwright` not found | Not installed | Run `pip install playwright && playwright install chromium` |\r
| `python3` not found | Windows environment | Try `python` instead of `python3` |\r
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install deepseek-extract - After installation, invoke the skill by name or use
/deepseek-extract - Provide required inputs per the skill's parameter spec and get structured output
What is Deepseek Extract?
Extract full conversation content from DeepSeek shared chat links. Use when: user provides a DeepSeek share URL (chat.deepseek.com/share/...), wants to extra... It is an AI Agent Skill for Claude Code / OpenClaw, with 81 downloads so far.
How do I install Deepseek Extract?
Run "/install deepseek-extract" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is Deepseek Extract free?
Yes, Deepseek Extract is completely free, licensed under MIT-0. You can download, install and use it at no cost.
Which platforms does Deepseek Extract support?
Deepseek Extract is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).
Who created Deepseek Extract?
It is built and maintained by ZhangYuanzhuo (@zz0116); the current version is v1.0.0.