← 返回 Skills 市场
kernix0421

Ai Research Scraper

作者 Kernix0421 · GitHub ↗ · v1.8.14
cross-platform ⚠ suspicious
2357
总下载
1
收藏
7
当前安装
23
版本数
在 OpenClaw 中安装
/install ai-research-scraper
功能描述
用于抓取AI领域最新研究信息的技能,重点关注AI产品发展。从知名AI网站获取信息,提供简洁概括和链接,限制数据量以便快速阅读。
使用说明 (SKILL.md)

AI Research Scraper

概述

这个技能专门用于从知名AI领域网站抓取最新研究信息,重点关注AI产品发展方面。它会限制抓取的数据量和token使用,提供简洁的信息概括,并包含原始网页链接,方便用户进一步查阅详细内容。

使用场景

  • 当您需要快速了解AI领域最新产品动态时
  • 当您想跟踪特定AI产品的发展信息时
  • 当您需要获取AI技术发展的最新研究成果摘要时

支持的网站

当前支持的AI领域知名网站包括:

  • TechCrunch AI板块
  • VentureBeat AI板块
  • MIT Technology Review AI板块
  • Google AI Blog
  • Microsoft AI Blog
  • NVIDIA Blog

功能特点

  • 数据量控制:限制摘要长度,确保信息简洁
  • 重点突出:聚焦AI产品发展信息
  • 链接完整:每个摘要都包含原始网页链接
  • 定时更新:支持定期抓取最新信息
  • 可配置性:支持添加或修改目标网站
  • 网络优化:增加超时时间和重试机制,提高抓取成功率
  • 缓存机制:添加1小时缓存,避免重复抓取相同内容
  • 稳定源:优化网站列表,使用更稳定的AI领域网站源
  • 备用搜索:支持使用tavily-search技能替代,避免网络超时问题
  • 简化处理:暂时移除翻译功能,避免API错误和网络超时问题

使用方法

快速开始

使用默认配置抓取AI产品发展相关信息:

python3 /root/.openclaw/workspace/skills/ai-research-scraper/scripts/scraper.py

配置选项

自定义网站列表

编辑 references/websites.txt 文件,添加或删除目标网站。每行格式:

网站名称|网站URL|RSS/Feed URL(可选)

控制摘要长度

python3 /root/.openclaw/workspace/skills/ai-research-scraper/scripts/scraper.py --max-tokens 500

指定时间范围

python3 /root/.openclaw/workspace/skills/ai-research-scraper/scripts/scraper.py --days 7

指定主题重点

python3 /root/.openclaw/workspace/skills/ai-research-scraper/scripts/scraper.py --topic product-development

脚本和资源

Scripts

  • scripts/scraper.py: 主要的网页抓取和信息提取脚本
  • scripts/example.py: 示例脚本(可删除或修改)

References

  • references/websites.txt: 包含要抓取的网站列表
  • references/api_reference.md: API参考文档(待完善)

Assets

  • assets/: 资源文件夹(可用于存放模板或其他静态资源)
安全使用建议
This skill is internally inconsistent and should be treated with caution. Before installing or running it: - Inspect the tavily-search skill referenced at /root/.openclaw/workspace/skills/tavily-search/scripts/search.mjs (the scraper invokes that Node script). Ensure you trust that code and that Node is the expected runtime. - Do not provide API keys or other credentials until the skill explicitly documents and declares them in metadata. The repo mentions Tavily and various translation APIs but the metadata lists no required env vars. - Be aware some scripts reference missing functions (e.g., translate_text) or rely on files that may not exist—the package may be broken or incomplete. - Run the scripts in a sandbox (non-root container) first and review network calls (outbound endpoints) to confirm they contact expected services (official blogs, known translation endpoints, or tavily api), not unknown domains. - If you need this functionality, ask the maintainer for an explicit dependency list (node, required skills), a manifest of required environment variables, and a fix for the missing/unused functions before granting any credentials or running the skill in production.
功能分析
Type: OpenClaw Skill Name: ai-research-scraper Version: 1.8.14 The skill is designed to scrape AI research information, primarily by invoking the local 'tavily-search' OpenClaw skill via `subprocess.run` in `scripts/scraper.py`. While there are several functional bugs and inconsistencies (e.g., `scraper.py` does not read `references/websites.txt` as advertised, and some test scripts attempt to import a non-existent `translate_text` function), these do not indicate malicious intent or significant security vulnerabilities. All external network calls are for legitimate translation or search APIs, with API keys correctly shown as placeholders. There is no evidence of data exfiltration, malicious execution, persistence, or prompt injection against the agent.
能力评估
Purpose & Capability
SKILL.md says it scrapes AI sites and provides concise summaries. The code does implement scraping and summarization, but it also depends on an external 'tavily-search' skill (invoking /root/.openclaw/workspace/skills/tavily-search/scripts/search.mjs via subprocess) and references multiple translation API clients/tests. The skill's registry metadata declares no required env vars or binaries, yet the code implies node must be present and external API keys may be needed. Those dependencies are not declared and are disproportionate to the simple description.
Instruction Scope
SKILL.md runtime instruction is to run scripts/scraper.py. The scripts do call subprocess to run another skill's Node script and make network calls to translation/search endpoints. Some scripts (e.g., simple_scraper.py) import translate_text from scraper.py, but scraper.py does not define translate_text—indicating broken or inconsistent instructions. The instructions do not mention the tavily-search dependency, Node requirement, or required API keys.
Install Mechanism
There is no install spec (instruction-only), so nothing will be automatically downloaded or installed by the registry. However the package includes runnable Python scripts that assume a runtime (node) and presence of another skill's files; running them will execute local code and spawn external processes. Lack of an install step reduces supply-chain risk but does not eliminate execution risk when the scripts are run.
Credentials
The skill declares no required environment variables or credentials, yet references translation APIs and Tavily Search in docs and tests that expect API keys (e.g., TAVILY_API_KEY, Google/Microsoft/Baidu/Youdao credentials). Tests and api_reference.md show explicit API-key usage. This mismatch (code/doc expecting secrets but metadata declaring none) is a red flag: running tests or enabling translation/search fallbacks could prompt you to add sensitive credentials without the skill declaring why.
Persistence & Privilege
The skill does not request always:true and is user-invocable only. It does not modify other skills or system-wide settings in the provided code. However it does spawn subprocesses that execute another skill's script from the workspace, which effectively grants it the ability to run code located in other skill directories—this increases runtime blast radius even though the skill itself does not persistently install or escalate privileges.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install ai-research-scraper
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /ai-research-scraper 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.8.14
No user-facing changes in this version. - Internal update: only _meta.json file changed; documentation, features, and usage unchanged.
v1.8.13
Version 1.8.13 of ai-research-scraper - No file changes detected in this release. - All features, configuration options, and documentation remain unchanged.
v1.8.12
- No changes detected in this version; functionality and documentation remain the same as the previous release.
v1.8.11
ai-research-scraper v1.8.11 - No file or documentation changes detected in this version. - Functionality and skill documentation remain unchanged from the previous release.
v1.8.10
Version 1.8.10 - No file changes detected in this release. - Functionality, configuration, and documentation remain unchanged from the previous version.
v1.8.9
Version 1.8.9 - No file changes detected in this release. - Functionality and documentation remain unchanged from the previous version.
v1.8.8
Version 1.8.8 of ai-research-scraper - No file changes detected in this release. - All documentation and features remain unchanged.
v1.8.7
Version 1.8.7 of ai-research-scraper - No changes were detected in this version. - Existing features and documentation remain the same.
v1.8.6
- No user-facing changes in this release. - Internal update: metadata file (_meta.json) modified. - No changes to usage or functionality.
v1.8.5
Version 1.8.5 - Documentation updated; no changes to the skill logic. - SKILL.md revised for clarity and detail on features and usage.
v1.8.4
- No file changes detected in this version. - Documentation remains unchanged from the previous release. - No new features, fixes, or updates introduced.
v1.8.3
Version 1.8.3 of ai-research-scraper - No file changes detected in this release. - No updates or modifications to SKILL.md or associated scripts/resources.
v1.8.2
Version 1.8.2 - Documentation updated: SKILL.md was revised for clarity and completeness. - No functional or code changes; only documentation was modified.
v1.8.1
- Initial metadata file (_meta.json) added for the skill. - No changes to logic or functionality. This update is for metadata purposes only.
v1.8.0
删除翻译功能,避免网络连接不稳定导致的超时问题
v1.7.0
- Translation features have been temporarily removed to avoid API errors and network timeout issues. - Added test scripts for translation modules: test_googletrans.py and test_microsoft_translator.py. - Documentation updated to reflect the removal of translation and simplify related usage instructions.
v1.6.0
改进翻译功能,添加OpenAI API翻译支持
v1.5.0
添加英文摘要自动翻译为中文的功能
v1.4.0
优化输出格式,只显示标题、摘要和来源链接
v1.3.0
添加使用tavily-search技能替代的功能,避免网络超时问题
元数据
Slug ai-research-scraper
版本 1.8.14
许可证
累计安装 7
当前安装数 7
历史版本数 23
常见问题

Ai Research Scraper 是什么?

用于抓取AI领域最新研究信息的技能,重点关注AI产品发展。从知名AI网站获取信息,提供简洁概括和链接,限制数据量以便快速阅读。 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 2357 次。

如何安装 Ai Research Scraper?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install ai-research-scraper」即可一键安装,无需额外配置。

Ai Research Scraper 是免费的吗?

是的,Ai Research Scraper 完全免费(开源免费),可自由下载、安装和使用。

Ai Research Scraper 支持哪些平台?

Ai Research Scraper 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 Ai Research Scraper?

由 Kernix0421(@kernix0421)开发并维护,当前版本 v1.8.14。

💬 留言讨论