← 返回 Skills 市场
protoss70

Apify Ultimate Scraper

作者 protoss70 · GitHub ↗ · v1.0.1
cross-platform ✓ 安全检测通过
1547
总下载
6
收藏
9
当前安装
2
版本数
在 OpenClaw 中安装
/install apify-ultimate-scraper
功能描述
Universal AI-powered web scraper for any platform. Scrape data from Instagram, Facebook, TikTok, YouTube, Google Maps, Google Search, Google Trends, Booking.com, and TripAdvisor. Use for lead generation, brand monitoring, competitor analysis, influencer discovery, trend research, content analytics, audience analysis, or any data extraction task.
使用说明 (SKILL.md)

Universal Web Scraper

AI-driven data extraction from 55+ Actors across all major platforms. This skill automatically selects the best Actor for your task.

Prerequisites

  • APIFY_TOKEN configured in OpenClaw settings
  • Node.js 20.6+
  • mcpc CLI (auto-installed via skill metadata)

Input Sanitization Rules

Before substituting any value into a bash command:

  • ACTOR_ID: Must be either a technical name (owner/actor-name — alphanumeric, hyphens, dots, one slash) or a raw ID (exactly 17 alphanumeric characters, e.g., oeiQgfg5fsmIJB7Cn). Reject values containing shell metacharacters (; | & $ ` ( ) { } \x3C > ! \).
  • SEARCH_KEYWORDS: Plain text words only. Reject shell metacharacters.
  • JSON_INPUT: Must be valid JSON. Must not contain single quotes (use escaped double quotes). Validate structure before use.
  • Output filenames: Must match YYYY-MM-DD_descriptive-name.{csv,json}. No path separators (/, ..), no spaces, no metacharacters.

Workflow

Copy this checklist and track progress:

Task Progress:
- [ ] Step 1: Understand user goal and select Actor
- [ ] Step 2: Fetch Actor schema via mcpc
- [ ] Step 3: Ask user preferences (format, filename)
- [ ] Step 4: Run the scraper script
- [ ] Step 5: Summarize results and offer follow-ups

Step 1: Understand User Goal and Select Actor

First, understand what the user wants to achieve. Then select the best Actor from the options below.

Instagram Actors (12)

Actor ID Best For
apify/instagram-profile-scraper Profile data, follower counts, bio info
apify/instagram-post-scraper Individual post details, engagement metrics
apify/instagram-comment-scraper Comment extraction, sentiment analysis
apify/instagram-hashtag-scraper Hashtag content, trending topics
apify/instagram-hashtag-stats Hashtag performance metrics
apify/instagram-reel-scraper Reels content and metrics
apify/instagram-search-scraper Search users, places, hashtags
apify/instagram-tagged-scraper Posts tagged with specific accounts
apify/instagram-followers-count-scraper Follower count tracking
apify/instagram-scraper Comprehensive Instagram data
apify/instagram-api-scraper API-based Instagram access
apify/export-instagram-comments-posts Bulk comment/post export

Facebook Actors (14)

Actor ID Best For
apify/facebook-pages-scraper Page data, metrics, contact info
apify/facebook-page-contact-information Emails, phones, addresses from pages
apify/facebook-posts-scraper Post content and engagement
apify/facebook-comments-scraper Comment extraction
apify/facebook-likes-scraper Reaction analysis
apify/facebook-reviews-scraper Page reviews
apify/facebook-groups-scraper Group content and members
apify/facebook-events-scraper Event data
apify/facebook-ads-scraper Ad creative and targeting
apify/facebook-search-scraper Search results
apify/facebook-reels-scraper Reels content
apify/facebook-photos-scraper Photo extraction
apify/facebook-marketplace-scraper Marketplace listings
apify/facebook-followers-following-scraper Follower/following lists

TikTok Actors (14)

Actor ID Best For
clockworks/tiktok-scraper Comprehensive TikTok data
clockworks/free-tiktok-scraper Free TikTok extraction
clockworks/tiktok-profile-scraper Profile data
clockworks/tiktok-video-scraper Video details and metrics
clockworks/tiktok-comments-scraper Comment extraction
clockworks/tiktok-followers-scraper Follower lists
clockworks/tiktok-user-search-scraper Find users by keywords
clockworks/tiktok-hashtag-scraper Hashtag content
clockworks/tiktok-sound-scraper Trending sounds
clockworks/tiktok-ads-scraper Ad content
clockworks/tiktok-discover-scraper Discover page content
clockworks/tiktok-explore-scraper Explore content
clockworks/tiktok-trends-scraper Trending content
clockworks/tiktok-live-scraper Live stream data

YouTube Actors (5)

Actor ID Best For
streamers/youtube-scraper Video data and metrics
streamers/youtube-channel-scraper Channel information
streamers/youtube-comments-scraper Comment extraction
streamers/youtube-shorts-scraper Shorts content
streamers/youtube-video-scraper-by-hashtag Videos by hashtag

Google Maps Actors (4)

Actor ID Best For
compass/crawler-google-places Business listings, ratings, contact info
compass/google-maps-extractor Detailed business data
compass/Google-Maps-Reviews-Scraper Review extraction
poidata/google-maps-email-extractor Email discovery from listings

Other Actors (6)

Actor ID Best For
apify/google-search-scraper Google search results
apify/google-trends-scraper Google Trends data
voyager/booking-scraper Booking.com hotel data
voyager/booking-reviews-scraper Booking.com reviews
maxcopell/tripadvisor-reviews TripAdvisor reviews
vdrmota/contact-info-scraper Contact enrichment from URLs

Actor Selection by Use Case

Use Case Primary Actors
Lead Generation compass/crawler-google-places, poidata/google-maps-email-extractor, vdrmota/contact-info-scraper
Influencer Discovery apify/instagram-profile-scraper, clockworks/tiktok-profile-scraper, streamers/youtube-channel-scraper
Brand Monitoring apify/instagram-tagged-scraper, apify/instagram-hashtag-scraper, compass/Google-Maps-Reviews-Scraper
Competitor Analysis apify/facebook-pages-scraper, apify/facebook-ads-scraper, apify/instagram-profile-scraper
Content Analytics apify/instagram-post-scraper, clockworks/tiktok-scraper, streamers/youtube-scraper
Trend Research apify/google-trends-scraper, clockworks/tiktok-trends-scraper, apify/instagram-hashtag-stats
Review Analysis compass/Google-Maps-Reviews-Scraper, voyager/booking-reviews-scraper, maxcopell/tripadvisor-reviews
Audience Analysis apify/instagram-followers-count-scraper, clockworks/tiktok-followers-scraper, apify/facebook-followers-following-scraper

Multi-Actor Workflows

For complex tasks, chain multiple Actors:

Workflow Step 1 Step 2
Lead enrichment compass/crawler-google-places vdrmota/contact-info-scraper
Influencer vetting apify/instagram-profile-scraper apify/instagram-comment-scraper
Competitor deep-dive apify/facebook-pages-scraper apify/facebook-posts-scraper
Local business analysis compass/crawler-google-places compass/Google-Maps-Reviews-Scraper

Can't Find a Suitable Actor?

If none of the Actors above match the user's request, search the Apify Store directly:

mcpc --json mcp.apify.com --header "Authorization: Bearer $APIFY_TOKEN" tools-call search-actors keywords:="SEARCH_KEYWORDS" limit:=10 offset:=0 category:="" | jq -r '.content[0].text'

Replace SEARCH_KEYWORDS with 1-3 simple terms (e.g., "LinkedIn profiles", "Amazon products", "Twitter").

Step 2: Fetch Actor Schema

Fetch the Actor's input schema and details dynamically using mcpc:

mcpc --json mcp.apify.com --header "Authorization: Bearer $APIFY_TOKEN" tools-call fetch-actor-details actor:="ACTOR_ID" | jq -r ".content"

Replace ACTOR_ID with the selected Actor (e.g., compass/crawler-google-places).

This returns:

  • Actor description and README
  • Required and optional input parameters
  • Output fields (if available)

Step 3: Ask User Preferences

Before running, ask:

  1. Output format:
    • Quick answer - Display top few results in chat (no file saved)
    • CSV - Full export with all fields
    • JSON - Full export in JSON format
  2. Number of results: Based on character of use case

Step 4: Run the Script

Quick answer (display in chat, no file):

node {baseDir}/reference/scripts/run_actor.js \
  --actor 'ACTOR_ID' \
  --input 'JSON_INPUT'

CSV:

node {baseDir}/reference/scripts/run_actor.js \
  --actor 'ACTOR_ID' \
  --input 'JSON_INPUT' \
  --output 'YYYY-MM-DD_OUTPUT_FILE.csv' \
  --format csv

JSON:

node {baseDir}/reference/scripts/run_actor.js \
  --actor 'ACTOR_ID' \
  --input 'JSON_INPUT' \
  --output 'YYYY-MM-DD_OUTPUT_FILE.json' \
  --format json

Step 5: Summarize Results and Offer Follow-ups

After completion, report:

  • Number of results found
  • File location and name
  • Key fields available
  • Suggested follow-up workflows based on results:
If User Got Suggest Next
Business listings Enrich with vdrmota/contact-info-scraper or get reviews
Influencer profiles Analyze engagement with comment scrapers
Competitor pages Deep-dive with post/ad scrapers
Trend data Validate with platform-specific hashtag scrapers

Security & Data Privacy

This skill instructs the agent to select an Apify Actor, fetch its schema (via mcpc), and run scrapers. The included script communicates only with api.apify.com and writes outputs to files under the current working directory; it does not access unrelated system files or other environment variables.

Apify Actors only scrape publicly available data and do not collect private or personally identifiable information beyond what is openly accessible on the target platforms. For additional security assurance, you can check an Actor's permission level by querying https://api.apify.com/v2/acts/:actorId — an Actor with LIMITED_PERMISSIONS operates in a restricted sandbox, while FULL_PERMISSIONS indicates broader system access. For full details, see Apify's General Terms and Conditions.

Error Handling

APIFY_TOKEN not found - Ask user to configure APIFY_TOKEN in OpenClaw settings mcpc not found - Run npm install -g @apify/mcpc Actor not found - Check Actor ID spelling Run FAILED - Ask user to check Apify console link in error output Timeout - Reduce input size or increase --timeout

安全使用建议
This skill appears to do what it says: it runs Apify Actors and saves the results. Before installing, consider: 1) APIFY_TOKEN is powerful—use a token scoped/minimized for this purpose or a dedicated Apify account to contain risk; 2) the skill will call api.apify.com and write files to the agent's working directory (it validates output paths); 3) it installs an npm CLI (@apify/mcpc) — review your npm policies; 4) the script sets a User-Agent that identifies usage to Apify (analytics). If you are comfortable granting an Apify token and installing the CLI, the skill is coherent and expected. If you cannot or will not provide a token, do not install.
功能分析
Type: OpenClaw Skill Name: apify-ultimate-scraper Version: 1.0.1 The skill is designed for web scraping via Apify Actors and includes robust security measures. The `SKILL.md` explicitly instructs the AI agent to sanitize all user-provided inputs (ACTOR_ID, SEARCH_KEYWORDS, JSON_INPUT, Output filenames) to prevent shell injection and path traversal. The `run_actor.js` script further enforces these protections with strong internal validation, notably `validateOutputPath` which strictly confines file writes to the current working directory, and `JSON.parse` for input validation. All network communication is restricted to `api.apify.com`, and there is no evidence of unauthorized data exfiltration, persistence mechanisms, or other malicious intent. The use of shell commands and file system access is directly aligned with the skill's stated purpose and is adequately secured.
能力评估
Purpose & Capability
Name/description (universal Apify scraper) match the declared requirements: APIFY_TOKEN (Apify API access), node and mcpc (Apify CLI) are expected and necessary to run Actors.
Instruction Scope
SKILL.md instructs selecting an Actor, fetching schemas via mcpc, validating inputs, starting runs through Apify API, polling status, and downloading datasets. The included script (run_actor.js) implements these actions and does not reference unrelated files, endpoints, or extra environment variables.
Install Mechanism
Install uses a single npm package (@apify/mcpc) to provide the mcpc CLI. This is a standard, traceable install method for a Node-based CLI; it will add the mcpc binary as declared.
Credentials
Only APIFY_TOKEN is required and is appropriate for controlling and reading Apify Actor runs and datasets. Note: that token grants access to your Apify account (start runs, read datasets, etc.), so it is powerful — ensure you provide a token with appropriate scope and trust.
Persistence & Privilege
Skill is not always-enabled and does not request system-wide persistent privileges. It can be invoked autonomously (default), which is typical for skills; there is no indication it alters other skills or system settings.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install apify-ultimate-scraper
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /apify-ultimate-scraper 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.1
- Updated CLI dependency from `mcporter` to the new `mcpc` tool for all workflows. - Changed the required Node.js package from `mcporter` to `@apify/mcpc`. - All task and workflow commands now reference `mcpc` instead of `mcporter`. - No changes to scraping logic or available Actors; functionality remains the same.
v1.0.0
Initial public release — universal AI-powered web scraping skill for 55+ platforms. - Scrape data from Instagram, Facebook, TikTok, YouTube, Google Maps, Google Search, Google Trends, Booking.com, and TripAdvisor. - Automatically selects the best Apify Actor for any data extraction use case. - Supports lead generation, influencer discovery, brand monitoring, competitor analysis, review analysis, trend research, and more. - Provides a step-by-step workflow, strict input validation, and clear multi-actor workflow examples. - Easily extensible: fallback to search for additional actors in Apify Store if needed.
元数据
Slug apify-ultimate-scraper
版本 1.0.1
许可证
累计安装 9
当前安装数 9
历史版本数 2
常见问题

Apify Ultimate Scraper 是什么?

Universal AI-powered web scraper for any platform. Scrape data from Instagram, Facebook, TikTok, YouTube, Google Maps, Google Search, Google Trends, Booking.com, and TripAdvisor. Use for lead generation, brand monitoring, competitor analysis, influencer discovery, trend research, content analytics, audience analysis, or any data extraction task. 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 1547 次。

如何安装 Apify Ultimate Scraper?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install apify-ultimate-scraper」即可一键安装,无需额外配置。

Apify Ultimate Scraper 是免费的吗?

是的,Apify Ultimate Scraper 完全免费(开源免费),可自由下载、安装和使用。

Apify Ultimate Scraper 支持哪些平台?

Apify Ultimate Scraper 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 Apify Ultimate Scraper?

由 protoss70(@protoss70)开发并维护,当前版本 v1.0.1。

💬 留言讨论