← Back to Skills Marketplace
qinkaizhou

url2pdf-mk

by QKZ · GitHub ↗ · v1.1.1 · MIT-0
cross-platform ✓ Security Clean
157
Downloads
0
Stars
0
Active Installs
10
Versions
Install in OpenClaw
/install url2pdf-mk
Description
EN: Webpage to PDF + Markdown converter. Converts any URL (especially WeChat articles) into offline-readable PDF and Markdown files with complete images, pre...
Usage Guidance
This skill appears to do what it says, but it controls a real browser and by default reuses your Chrome/Chromium profile. That means if you run it in default (non-isolated) mode the script can access cookies, logged-in sessions, and other profile data. The SKILL.md already warns about this; before using the skill: (1) prefer --isolated mode if you don't want the tool to access your login sessions, (2) run it on non-sensitive accounts or inside a VM/container if you need extra isolation, (3) check the xlsx input file is from a trusted source (the script will read it and access the listed URLs), (4) be aware the tool opens a local CDP proxy port and writes temp files (cdp-proxy-<user>), and (5) review/verify the included scripts and Python dependencies before running. If you want to avoid any possibility of the agent autonomously launching the browser against your profile, disable model/skill invocation or run the tool manually with explicit command-line flags.
Capability Assessment
Purpose & Capability
Name/description (webpage → PDF/Markdown, WeChat articles) match the provided scripts: HTTP scraping, browser (CDP) scraping, batch/xlsx handling, Markdown/PDF generation. Requiring a Chrome/Chromium installation and Python libraries is appropriate for the described functionality.
Instruction Scope
SKILL.md and scripts instruct launching/controlling a local browser via CDP, reading an input xlsx, creating date folders on the desktop, and writing output files and temporary proxy state files. The skill explicitly documents that browser mode reuses the user's Chrome profile (access to cookies/login sessions) and offers an --isolated flag. These actions are within the scope of the stated purpose but have privacy implications the user should be aware of.
Install Mechanism
No remote download/install spec is present; this is an instruction/code bundle with local Python scripts. No external binary downloads or strange URLs are used. Dependencies are standard Python packages (requests, openpyxl, websockets, etc.) and the browser binary (Chrome/Chromium).
Credentials
The skill declares no required environment variables or credentials. However, by default it may reuse the user's Chrome profile directory (platform-specific profile paths) and will read/write temp state files (cdp-proxy-<user>) and output to Desktop—access that is proportionate to scraping but can expose cookies, session tokens, and local profile data. The SKILL.md warns about this and offers an isolated mode.
Persistence & Privilege
always:false and no special platform-wide modifications are requested. The CDP proxy may spawn a local helper process and write proxy state files in the temp directory while running, but nothing indicates it persists beyond normal runtime files or modifies other skills/configs.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install url2pdf-mk
  3. After installation, invoke the skill by name or use /url2pdf-mk
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.1.1
- Adds a prominent security note in the description about browser mode reusing your Chrome profile, which can access cookies and sessions. - Warns that the tool may open a local CDP proxy and store temporary state files (cdp-proxy-<user>) in your temp directory. - Recommends using `--isolated` mode for public content or running in a VM/sandbox to protect sensitive data. - No code or feature changes; this update improves user security awareness.
v1.1.0
No code or functional changes; SKILL.md now includes a full English/Chinese bilingual description and workflow. - The description has been rewritten in both English and Chinese for broader accessibility. - Documentation sections, features, workflow, environment requirements, and usage instructions are now bilingual. - No changes to scripts, logic, or skill behavior—documentation update only.
v1.0.9
### v1.0.9(2026-04-14)— 文档增强 - 新增「概述」章节,简要介绍核心功能定位 - 新增「工作机制」章节,含技术架构流程图、详细流程说明、智能路由、CDP 说明 - 增加「适配环境」章节,列出操作系统、浏览器及 Python 环境支持 - 新增「基础组件依赖」章节,详细说明依赖包、系统要求及快速环境检查方法 - 优化文档结构,整合环境信息、统一表格格式,提高可读性
v1.0.8
### v1.0.8(2026-04-13)— 安全增强 - 新增“重要安全警告”说明,包括 Cookie、临时文件、CDP 端口安全、运行权限与环境建议 - main.py 增加临时文件清理日志,清理成功/失败均有提示
v1.0.6
- 移除 _meta.json 文件,清理元数据文件夹 - 其余功能和用法保持不变
v1.0.5
url2pdf-mk v1.0.5 — 安全修复版本 - 移除 `batch_http.py` 中的硬编码 xlsx 默认路径,需显式传入 xlsx 文件路径 - 所有脚本:pip 依赖改为运行前**预装检查 + 报错提示**,不再运行时自动安装,提高安全性 - `batch_http.py` 新增系统目录路径安全校验 - 文档(SKILL.md)调整安全建议,`--isolated` 推荐为公开内容默认模式
v1.0.3
Two changes were made based on ClawHub’s security scan: SKILL.md — added a “Security Notice” section explaining that reuse_profile=True reuses the user’s real Chrome profile and cookies, recommending batch_http.py as a cookie-free alternative. It also documents the local CDP Proxy on port 9223, the automatic pip installs via Tsinghua mirror, and provides the recommended pre-install command. batch_scrape.py — hardened sys.path: removed the sys.path.insert(1, ORIG_CDP_SCRIPTS) line that added the entire QClaw browser-cdp directory to the Python path. All required modules (cdp_proxy, cdp_client, browser_actions) already exist locally in the skill’s scripts/ directory, so no external directory is needed. Functional behavior is unchanged; the attack surface is reduced.
v1.0.2
url2pdf-mk v1.0.2 - Introduced a new unified entry point: `main.py` for smart routing and simplified usage. - Added modular scripts: `_encoding_fix.py`, `browser_actions.py`, `browser_launcher.py`, `cdp_client.py`, `cdp_proxy.py`, and `page_snapshot.py` for improved code structure and maintainability. - Removed deprecated `example.py` script. - Now automatically selects between single and batch processing based on URL count. - Streamlined the user experience for both single and batch webpage to PDF/Markdown workflows.
v1.0.1
- Added batch processing scripts: `batch_scrape.py` (browser, PDF+Markdown) and `batch_http.py` (HTTP-only, Markdown). - Now supports xlsx-based bulk webpage/article conversion for streamlined offline archiving. - Existing single-page script (`scrape.py`) unchanged for individual use. - Updated documentation to reflect new batch capabilities and usage instructions.
v1.0.0
Initial release of url2pdf-mk. - 支持将任意微信文章或网页一键导出为排版完整的 PDF 和 Markdown 文件,附带所有正文图片。 - 自动按日期归档至桌面文件夹,文件名包含文章发布日期和标题。 - 专为微信文章优化,正确处理图片、懒加载、时间戳及公众号信息。 - 具备跨平台兼容性(Windows/macOS/Linux),自动适配中文字体。 - 降级兼容:无 Chrome 时可仅导出无图片文本 PDF。 - 已通过安全审查,无高风险操作,保证桌面文件写入安全。
Metadata
Slug url2pdf-mk
Version 1.1.1
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 10
Frequently Asked Questions

What is url2pdf-mk?

EN: Webpage to PDF + Markdown converter. Converts any URL (especially WeChat articles) into offline-readable PDF and Markdown files with complete images, pre... It is an AI Agent Skill for Claude Code / OpenClaw, with 157 downloads so far.

How do I install url2pdf-mk?

Run "/install url2pdf-mk" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is url2pdf-mk free?

Yes, url2pdf-mk is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does url2pdf-mk support?

url2pdf-mk is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created url2pdf-mk?

It is built and maintained by QKZ (@qinkaizhou); the current version is v1.1.1.

💬 Comments