← 返回 Skills 市场

Apify Substack Scraper

Name: Apify Substack Scraper
Author: marcindudekdev

作者 Marcin Dudek · GitHub ↗ · v1.0.0

cross-platform ✓ 安全检测通过

299

总下载

当前安装

版本数

在 OpenClaw 中安装

/install apify-substack-scraper

功能描述

Scrape Substack newsletters and articles. Use when user asks to search Substack, find newsletter posts, extract Substack content, or monitor Substack publica...

使用说明 (SKILL.md)

Substack Scraper

Scrape Substack newsletters using an Apify Actor via the REST API.

Actor ID

BULaGFURBV7WG3K81

Prerequisites

APIFY_TOKEN environment variable must be set
curl and jq must be available

Workflow

Step 1: Confirm parameters with user

Ask what they want to scrape. Supported input fields:

urls (array of strings) - Substack publication URLs to scrape
maxArticles (integer) - max articles per publication
includeContent (boolean) - include full article text

Step 2: Run the Actor

RESULT=$(curl -s -X POST "https://api.apify.com/v2/acts/BULaGFURBV7WG3K81/run-sync-get-dataset-items?token=$APIFY_TOKEN" \
  -H "Content-Type: application/json" \
  -d '{"urls": ["https://example.substack.com"], "maxArticles": 20}')
echo "$RESULT" | jq '.'

Step 3: Poll and fetch (if async)

RUN_ID=$(curl -s -X POST "https://api.apify.com/v2/acts/BULaGFURBV7WG3K81/runs?token=$APIFY_TOKEN" \
  -H "Content-Type: application/json" \
  -d '{"urls": ["https://example.substack.com"], "maxArticles": 100}' | jq -r '.data.id')
curl -s "https://api.apify.com/v2/actor-runs/$RUN_ID?token=$APIFY_TOKEN" | jq -r '.data.status'
curl -s "https://api.apify.com/v2/actor-runs/$RUN_ID/dataset/items?token=$APIFY_TOKEN" | jq '.'

Step 4: Present results

Summarize articles: titles, authors, dates, engagement. Offer JSON/CSV export.

Error Handling

If APIFY_TOKEN not set: export APIFY_TOKEN=your_token
If run FAILS: check log endpoint

安全使用建议

This skill appears coherent for invoking an Apify actor to scrape Substack, but consider the following before installing: 1) APIFY_TOKEN is required—treat it like a secret, ensure it has minimal permissions, and rotate if you suspect exposure. 2) The examples put the token in the URL query string which can leak to shell history or logs; prefer sending the token in an Authorization header if you run similar commands locally. 3) The skill will submit user-provided URLs to a third-party actor (actor ID shown). Verify you trust the actor/owner on Apify or inspect the actor's source on Apify before sending sensitive or internal URLs. 4) Be mindful of legal/terms-of-service and privacy considerations when scraping content. If you want stronger guarantees, ask the skill author for an option to use Authorization headers and to restrict/validate input URLs to Substack domains.

功能分析

Type: OpenClaw Skill Name: apify-substack-scraper Version: 1.0.0 The skill provides instructions for an AI agent to scrape Substack newsletters using the Apify platform. It utilizes standard curl commands to interact with the official Apify API (api.apify.com) and requires a user-provided APIFY_TOKEN. The logic is transparent, aligns with the stated purpose in SKILL.md, and contains no evidence of malicious behavior or data exfiltration.

能力评估

✓ Purpose & Capability

Name/description require scraping Substack via Apify and the skill declares APIFY_TOKEN plus curl and jq. Those requirements logically match the described behavior (invoking an Apify actor via REST).

ℹ Instruction Scope

Instructions are narrowly focused on calling Apify REST endpoints and presenting dataset items. They do not ask the agent to read local files or other credentials. Two operational notes: (1) examples place the APIFY_TOKEN in the query string which can expose the token in shell history, process lists, and logs — using an Authorization header would be preferable; (2) the instructions do not validate or restrict user-supplied URLs (they accept arbitrary URLs), so users could accidentally ask the actor to fetch non-Substack or internal endpoints — this is a behavioral/usage risk rather than an incoherence.

✓ Install Mechanism

Instruction-only skill with no install spec or external downloads. This is the lowest-risk install model and matches the declared metadata.

✓ Credentials

Only APIFY_TOKEN is required and it is declared as the primary credential; that is proportionate given the skill invokes Apify's API. As noted above, embedding the token in the URL is less safe than using an Authorization header and you should ensure the token has limited scope and is rotated if compromised.

✓ Persistence & Privilege

always is false and the skill does not request persistent or elevated privileges, nor does it modify other skills or system config. Autonomous invocation is permitted (platform default) but not combined with other concerning privileges.

如何使用

确保已安装 OpenClaw（本地或 Docker 部署）
在对话框中输入安装命令：/install apify-substack-scraper
安装完成后，直接呼叫该 Skill 的名称或使用 /apify-substack-scraper 触发
根据 Skill 的参数说明提供必要输入，即可获得结构化输出

版本历史

v1.0.0

Initial release - scrape Substack newsletters

元数据

Slug apify-substack-scraper

版本 1.0.0

许可证 —

累计安装 1

当前安装数 1

历史版本数 1

常见问题