← 返回 Skills 市场

wos-crawler

Name: wos-crawler
Author: grizzlyccc

作者 GrizzlyCcc · GitHub ↗ · v1.1.2 · MIT-0

cross-platform ⚠ suspicious

180

总下载

当前安装

版本数

在 OpenClaw 中安装

/install wos-crawler

功能描述

WOS 文献一站式工具：Web of Science 检索爬取 + PDF 批量下载，全部在同一个 Web 界面完成。基于 Selenium 爬取 WOS 文献列表，导出 Excel 后自动调用多渠道 PDF 下载器（Sci-Hub/CORE/S2 OA/OpenAlex/Unpaywall/Publisher...

安全使用建议

Do not run or grant credentials to this skill as-is. The SKILL.md expects local scripts (scripts/web_ui.py), Python packages, browser automation, and persistent cookie storage, but the package contains no code or install instructions — it appears incomplete or mismatched. Before installing or running anything, ask the publisher for: (1) the source repository or packaged code; (2) exact install steps and a manifest of files the skill will write (where cookies and PDFs are stored); (3) a list of external endpoints/domains the skill contacts (e.g., Sci‑Hub, publishers); (4) whether any credentials are uploaded or transmitted externally. Be especially cautious about giving your WOS institutional credentials to unknown software and about automated access to Sci‑Hub or publisher sites — those can have legal and policy implications. If the author cannot provide verifiable code or a trustworthy source repo, treat the skill as unsafe to run.

功能分析

Type: OpenClaw Skill Name: wos-crawler Version: 1.1.2 The skill bundle describes a literature crawling tool that utilizes high-risk capabilities, including automated browser control via Selenium, a local web server (Flask) running on port 5678, and cookie persistence for session management. While these features are aligned with the stated purpose of crawling Web of Science and downloading PDFs, the combination of credential handling and local network services presents a significant attack surface. The actual implementation code (e.g., `scripts/web_ui.py`) is not provided in the bundle, preventing a full verification of the logic and ensuring no unauthorized data exfiltration occurs.

能力评估

⚠ Purpose & Capability

The declared purpose (WOS crawling + multi-source PDF download) is plausible for the described tools (Selenium, Sci‑Hub, Unpaywall, etc.), but the skill manifest declares no code, no install, no required binaries, and no config paths while the SKILL.md explicitly instructs running python {SKILL_DIR}/scripts/web_ui.py and using Edge + Selenium. That mismatch (instructions requiring scripts and runtime components that are not present or declared) is internally inconsistent.

⚠ Instruction Scope

SKILL.md tells the agent/user to run a local web UI that will: open/automate a browser to log into Web of Science, persist cookies, export Excel, and use multiple external download channels (including Sci‑Hub). Those actions involve reading/writing local files, storing cookies, and making network requests to third parties; yet the package provides no implementation and gives broad autonomy to perform credentialed access and file I/O without specifying where data or cookies are stored. Instructions therefore grant broad scope without corresponding code or safeguards.

ℹ Install Mechanism

There is no install specification (instruction-only), which is low risk in itself — but SKILL.md assumes existence of scripts and Python dependencies (selenium, flask, etc.) and instructs running a local script. The absence of any shipped code or install steps makes the skill unusable as packaged and raises questions about where the referenced scripts come from.

⚠ Credentials

The SKILL.md requires a WOS institution account and an Edge browser and implies persistent cookies and local storage of downloaded PDFs, yet requires.env, primary credential, and config path fields are all empty. Sensitive items (WOS login session/cookies) are indirectly required but not declared or scoped, which is disproportionate and opaque.

⚠ Persistence & Privilege

The skill states it will persist cookies and perform breakpoint-resume of downloads and write PDFs to an output directory, indicating it will create and maintain local state. However, the package does not declare where that state is stored nor provide code. While always:false and autonomous invocation are normal, undocumented persistence of cookies and credential-bearing sessions increases risk and should be clarified.

如何使用

确保已安装 OpenClaw（本地或 Docker 部署）
在对话框中输入安装命令：/install wos-crawler
安装完成后，直接呼叫该 Skill 的名称或使用 /wos-crawler 触发
根据 Skill 的参数说明提供必要输入，即可获得结构化输出

版本历史

v1.1.2

- Major update: removed the main crawling script, disabling core skill functionality. - File removed: scripts/wos_crawler.py - The skill cannot perform WOS literature crawling without this file.

v1.0.1

wos-crawler 1.0.1 - No file changes detected in this version. - Functionality and documentation remain unchanged.

v1.0.0

- Initial release of wos-crawler, a Web of Science literature crawling tool with an enhanced tkinter GUI. - Supports automatic WOS advanced query construction from natural language, bulk fetching of titles, authors, journals, DOIs, citation counts, abstracts, etc. - Enables filtering by journal (preset + custom), author/title/DOI precise search, literature type, and year range. - Provides automatic paging, result deduplication, Excel export, and cookie persistence for login state. - Features live query preview, journal management (add/remove/clear), and error auto-screenshot. - Uses Selenium + Edge, requires only selenium, pandas, openpyxl (Selenium 4.6+ for driver management).

元数据

Slug wos-crawler

版本 1.1.2

许可证 MIT-0

累计安装 1

当前安装数 1

历史版本数 3

常见问题

wos-crawler 是什么？

WOS 文献一站式工具：Web of Science 检索爬取 + PDF 批量下载，全部在同一个 Web 界面完成。基于 Selenium 爬取 WOS 文献列表，导出 Excel 后自动调用多渠道 PDF 下载器（Sci-Hub/CORE/S2 OA/OpenAlex/Unpaywall/Publisher... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件，目前累计下载 180 次。

如何安装 wos-crawler？

在 OpenClaw 或 Claude Code 对话框中运行命令「/install wos-crawler」即可一键安装，无需额外配置。

wos-crawler 是免费的吗？

是的，wos-crawler 完全免费，采用 MIT-0 许可证，可自由下载、安装和使用。

wos-crawler 支持哪些平台？

wos-crawler 跨平台运行，可在任意部署了 OpenClaw / Claude Code 的环境中使用（cross-platform）。

谁开发了 wos-crawler？

由 GrizzlyCcc（@grizzlyccc）开发并维护，当前版本 v1.1.2。