← 返回 Skills 市场
Apify Substack Scraper
作者
Marcin Dudek
· GitHub ↗
· v1.0.0
299
总下载
0
收藏
1
当前安装
1
版本数
在 OpenClaw 中安装
/install apify-substack-scraper
功能描述
Scrape Substack newsletters and articles. Use when user asks to search Substack, find newsletter posts, extract Substack content, or monitor Substack publica...
使用说明 (SKILL.md)
Substack Scraper
Scrape Substack newsletters using an Apify Actor via the REST API.
Actor ID
BULaGFURBV7WG3K81
Prerequisites
APIFY_TOKENenvironment variable must be setcurlandjqmust be available
Workflow
Step 1: Confirm parameters with user
Ask what they want to scrape. Supported input fields:
urls(array of strings) - Substack publication URLs to scrapemaxArticles(integer) - max articles per publicationincludeContent(boolean) - include full article text
Step 2: Run the Actor
RESULT=$(curl -s -X POST "https://api.apify.com/v2/acts/BULaGFURBV7WG3K81/run-sync-get-dataset-items?token=$APIFY_TOKEN" \
-H "Content-Type: application/json" \
-d '{"urls": ["https://example.substack.com"], "maxArticles": 20}')
echo "$RESULT" | jq '.'
Step 3: Poll and fetch (if async)
RUN_ID=$(curl -s -X POST "https://api.apify.com/v2/acts/BULaGFURBV7WG3K81/runs?token=$APIFY_TOKEN" \
-H "Content-Type: application/json" \
-d '{"urls": ["https://example.substack.com"], "maxArticles": 100}' | jq -r '.data.id')
curl -s "https://api.apify.com/v2/actor-runs/$RUN_ID?token=$APIFY_TOKEN" | jq -r '.data.status'
curl -s "https://api.apify.com/v2/actor-runs/$RUN_ID/dataset/items?token=$APIFY_TOKEN" | jq '.'
Step 4: Present results
Summarize articles: titles, authors, dates, engagement. Offer JSON/CSV export.
Error Handling
- If APIFY_TOKEN not set:
export APIFY_TOKEN=your_token - If run FAILS: check log endpoint
安全使用建议
This skill appears coherent for invoking an Apify actor to scrape Substack, but consider the following before installing: 1) APIFY_TOKEN is required—treat it like a secret, ensure it has minimal permissions, and rotate if you suspect exposure. 2) The examples put the token in the URL query string which can leak to shell history or logs; prefer sending the token in an Authorization header if you run similar commands locally. 3) The skill will submit user-provided URLs to a third-party actor (actor ID shown). Verify you trust the actor/owner on Apify or inspect the actor's source on Apify before sending sensitive or internal URLs. 4) Be mindful of legal/terms-of-service and privacy considerations when scraping content. If you want stronger guarantees, ask the skill author for an option to use Authorization headers and to restrict/validate input URLs to Substack domains.
功能分析
Type: OpenClaw Skill
Name: apify-substack-scraper
Version: 1.0.0
The skill provides instructions for an AI agent to scrape Substack newsletters using the Apify platform. It utilizes standard curl commands to interact with the official Apify API (api.apify.com) and requires a user-provided APIFY_TOKEN. The logic is transparent, aligns with the stated purpose in SKILL.md, and contains no evidence of malicious behavior or data exfiltration.
能力评估
Purpose & Capability
Name/description require scraping Substack via Apify and the skill declares APIFY_TOKEN plus curl and jq. Those requirements logically match the described behavior (invoking an Apify actor via REST).
Instruction Scope
Instructions are narrowly focused on calling Apify REST endpoints and presenting dataset items. They do not ask the agent to read local files or other credentials. Two operational notes: (1) examples place the APIFY_TOKEN in the query string which can expose the token in shell history, process lists, and logs — using an Authorization header would be preferable; (2) the instructions do not validate or restrict user-supplied URLs (they accept arbitrary URLs), so users could accidentally ask the actor to fetch non-Substack or internal endpoints — this is a behavioral/usage risk rather than an incoherence.
Install Mechanism
Instruction-only skill with no install spec or external downloads. This is the lowest-risk install model and matches the declared metadata.
Credentials
Only APIFY_TOKEN is required and it is declared as the primary credential; that is proportionate given the skill invokes Apify's API. As noted above, embedding the token in the URL is less safe than using an Authorization header and you should ensure the token has limited scope and is rotated if compromised.
Persistence & Privilege
always is false and the skill does not request persistent or elevated privileges, nor does it modify other skills or system config. Autonomous invocation is permitted (platform default) but not combined with other concerning privileges.
如何使用
- 确保已安装 OpenClaw(本地或 Docker 部署)
- 在对话框中输入安装命令:
/install apify-substack-scraper - 安装完成后,直接呼叫该 Skill 的名称或使用
/apify-substack-scraper触发 - 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.0
Initial release - scrape Substack newsletters
元数据
常见问题
Apify Substack Scraper 是什么?
Scrape Substack newsletters and articles. Use when user asks to search Substack, find newsletter posts, extract Substack content, or monitor Substack publica... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 299 次。
如何安装 Apify Substack Scraper?
在 OpenClaw 或 Claude Code 对话框中运行命令「/install apify-substack-scraper」即可一键安装,无需额外配置。
Apify Substack Scraper 是免费的吗?
是的,Apify Substack Scraper 完全免费(开源免费),可自由下载、安装和使用。
Apify Substack Scraper 支持哪些平台?
Apify Substack Scraper 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。
谁开发了 Apify Substack Scraper?
由 Marcin Dudek(@marcindudekdev)开发并维护,当前版本 v1.0.0。
推荐 Skills