← 返回 Skills 市场

Deep Research Pro v5.0.1

Name: Deep Research Pro v5.0.1
Author: xueylee-dotcom

作者 xueylee-dotcom · GitHub ↗ · v5.0.1 · MIT-0

cross-platform ⚠ suspicious

224

总下载

当前安装

版本数

在 OpenClaw 中安装

/install deep-research-v50

功能描述

Performs deep research using a three-stage process: data extraction, thematic insight briefs with contradiction analysis, and narrative-driven strategic repo...

使用说明 (SKILL.md)

Skill: Deep Research Pro (v5.0 - 洞察引擎)

版本：5.0.1 描述：真深度研究技能 - 三阶段合成 + 失败即停止

核心原则

深度不是"写得多"，而是"每一行数据都可溯源"。

🔴 v5.0.1 强制规则（新增）

规则1：提取失败必须明确报错

# 如果提取的数据不满足最低要求，输出：
{
  "error": "提取失败",
  "reason": "样本量缺失 / 主要结果缺失 / 原文引用不足30字",
  "suggestion": "跳过此来源或人工复核"
}

规则2：质量评分必须校验内容

质量评分逻辑：
- 有样本量 + 主要结果 + 原文引用 ≥ 30字 → 8.0-9.0
- 有2项 → 7.0-7.5
- 有1项或全是"见原文" → 标记为"待验证"，不评分

规则3：报告必须区分"已验证"和"待验证"

## 已验证结论（基于核心论文）

### 结论1：LSTM在ICU场景预测准确率达0.87
- 来源：card-002 (PMC11110807)
- 证据：样本量1,250, 95%CI 0.82-0.91, p\x3C0.001
- 原文引用："The LSTM model achieved..." (Results, p.5)

## 待验证线索（基于元数据）

### 线索1：远程医疗或可节省成本
- 来源：card-001 (PubMed摘要)
- 状态：⚠️ 需人工访问原文验证

执行流程

Step 1: 检索与提取

# 使用PubMed API获取结构化数据
python3 scripts/extract-pmc.py \x3Cpmid>

# 如果返回error，跳过该来源
# 如果数据不全，标记为"待验证"

Step 2: 卡片生成（强制校验）

---
source_id: card-xxx
status: verified | pending | failed
quality_score: 8.5 | N/A
---

## 1. 核心数据提取

| 指标 | 数值 | 验证状态 |
|------|------|----------|
| 样本量 | 9,080 | ✅ 已提取 |
| 主要结果 | 未提取 | ⚠️ 待验证 |
| 原文引用 | "..."(30字+) | ✅ 已提取 |

## 2. 质量说明

- 数据完整度：2/3
- 建议：访问原文验证主要结果

Step 3: 报告生成（明确区分）

禁止：混合使用"已验证"和"待验证"数据

要求：

已验证结论：单独章节
待验证线索：单独章节 + 警告标识

质量门禁

卡片数量：≥5个有完整数据的
溯源验证：100%通过
明确区分：已验证 vs 待验证

v5.0.1 vs v5.0 对比

维度	v5.0	v5.0.1
提取失败	静默填充"见原文"	明确报错
质量评分	虚高(8.5分但空洞)	必须校验内容
报告生成	混合使用数据	明确区分已验证/待验证

Skill版本：5.0.1 | 最后更新：2026-03-19

安全使用建议

What to check before installing or running: - Input handling: The PDF extractor will download any URL you pass to it. Treat this as a network-capable program: do not pass untrusted or internal network URLs (SSRF risk). Run the scripts in a sandboxed environment with network egress controls if possible. - Documentation mismatch: SKILL.md shows usage with '<pmid>'/PubMed, but scripts.extract-pmc.py builds a PMC URL (expects a PMC id). Confirm which identifier to pass and test on benign known PMCID values first. - Hard-coded paths: synthesize.sh calls the check script using a /root/.openclaw/... absolute path. That may fail or, if the environment mirrors that path, read files from unexpected locations. Inspect and (if needed) modify the script to use relative paths inside the research directory before running. - Review network targets: extract-pmc.py requests NCBI/PMC (expected) and extract-from-pdf.py fetches arbitrary URLs (expected for PDFs). Ensure your runtime allows only the external hosts you trust; consider disabling outbound network or restricting DNS if you are uncertain. - Sanity-check outputs: the scripts write temporary JSON to /tmp and generate reports. Review those outputs and run check-sourcing.sh manually to confirm it only accesses expected 'sources' files. - Run first in an isolated environment: because the skill performs network I/O and file reads/writes, test it inside a container or VM with limited network access and without sensitive files mounted. If you need higher assurance, ask the skill author to: (1) fix the pmid/pmcid documentation mismatch, (2) avoid hard-coded absolute paths, and (3) add explicit input validation/whitelisting for PDF URLs.

功能分析

Type: OpenClaw Skill Name: deep-research-v50 Version: 5.0.1 The skill bundle is a well-structured research tool designed to extract and synthesize academic and industry data. It includes scripts for downloading and parsing PDFs (extract-from-pdf.py), scraping PubMed Central (extract-pmc.py), and performing quality assessments (quality-score.py). While the PDF extraction script performs network requests, it includes basic safety checks (e.g., enforcing the 'http' scheme), and the bash scripts use restrictive regex to prevent path traversal. The synthesize.sh script contains hardcoded demo content for a report, but this appears to be a placeholder or reference implementation for the agent rather than a malicious deception. No evidence of data exfiltration, backdoors, or harmful prompt injection was found.

能力评估

ℹ Purpose & Capability

Name/description (deep research + extraction/quality gating) aligns with the included scripts and templates. However SKILL.md and the run examples reference 'PubMed API' and a '<pmid>' argument while scripts.extract-pmc.py constructs a PMC URL (expects a PMC id) — a documentation/expectation mismatch. All required capabilities (PDF/PMC extraction, quality scoring, synthesis) are present and consistent with the stated purpose; no unrelated credentials or binaries are requested.

⚠ Instruction Scope

Runtime instructions call local scripts that (a) fetch remote resources (extract-from-pdf.py downloads arbitrary PDF URLs provided by the caller) and (b) read and grep report/source files. The scripts write extracted JSON to /tmp and create reports under supplied research directories. The agent instructions do not ask for unrelated secrets, but the ability to download arbitrary URLs can be abused (SSRF or fetching internal endpoints) if untrusted inputs are used. Also SKILL.md example usage (<pmid>) may mislead users about correct inputs (pmid vs pmcid).

✓ Install Mechanism

This is instruction-plus-script (no install spec). No remote installers, package downloads, or archive extraction are performed by the skill itself. The code uses standard Python libraries and requests/pdf parsers if installed. Lowest install risk from package distribution perspective.

✓ Credentials

Skill requests no environment variables, no credentials, and no config paths. The scripts only perform network requests to source sites (NCBI/any provided PDF URLs). No unrelated service tokens are requested. This is proportionate to the stated research purpose.

ℹ Persistence & Privilege

The skill is not always:true and does not request persistent privileges. One implementation detail: synthesize.sh invokes the check script via an absolute path (/root/.openclaw/.../scripts/check-sourcing.sh). That is odd but not inherently privileged — it may fail on some hosts or indicate assumptions about runtime layout. No code attempts to modify other skills or system-wide config.

如何使用

确保已安装 OpenClaw（本地或 Docker 部署）
在对话框中输入安装命令：/install deep-research-v50
安装完成后，直接呼叫该 Skill 的名称或使用 /deep-research-v50 触发
根据 Skill 的参数说明提供必要输入，即可获得结构化输出

版本历史

v5.0.1

Deep Research Pro v5.0.1 — 强化数据校验和透明度 - 新增强制错误报告机制，数据提取不合格时必须报错并给出跳过建议 - 质量评分逻辑升级，只为数据完整卡片打分，其余标记为“待验证”不计分 - 报告输出须明确分离“已验证结论”与“待验证线索”，避免混用 - 集成新的结构化提取脚本 extract-pmc.py，支持从PubMed自动抓取 - 降低质量门槛，要求有≥5个完整数据卡片，通过全部溯源验证

v5.0.0

Deep Research Pro v5.0.0 introduces a major upgrade, shifting from simple data listing to insight-driven narrative reporting. - Added a three-phase synthesis pipeline: Data → Insight (Theme Briefs) → Report. - New requirement: Cluster research cards into 3–5 theme briefs with contradiction analysis before final report generation. - Reports must now be structured as narrative arguments, not just lists, and provide clear business significance and action steps. - Enhanced quality criteria: ≥15 cards, ≥50% full-text, 100% sourcing validation, ≥3 contradiction analyses, and mandatory strategic insights per theme. - Updated toolset: Added theme synthesis script, new prompt templates for every stage, and strengthened verification workflows.

元数据

Slug deep-research-v50

版本 5.0.1

许可证 MIT-0

累计安装 0

当前安装数 0

历史版本数 2

常见问题