← Back to Skills Marketplace
Twitter Scraper
by
ArulmozhiV
· GitHub ↗
· v0.1.2
1604
Downloads
1
Stars
11
Active Installs
3
Versions
Install in OpenClaw
/install x-twitter-scraper
Description
Scrapes public Twitter/X profiles and recent tweets using browser automation with anti-detection and optional profile discovery via Google or DuckDuckGo.
Usage Guidance
This SKILL.md describes a heavy-weight Playwright/Python scraper but the package contains no code, no install instructions, and declares no dependencies or credentials — that's inconsistent. Before installing or using this skill, ask the publisher for: (1) the source repository or packaged installer, (2) a clear install spec that declares required binaries (python3, Playwright, chromium), (3) an explicit list of environment variables/credentials it will use (Google API key, proxy credentials) and where/how they are stored, and (4) a reproducible CLI or binary that implements the advertised commands. Do not supply API keys or proxy credentials until you can verify the source code and installation steps. Also consider legal and policy implications of scraping X/Twitter and of using residential proxy providers; ensure you have permission and compliance controls. If the publisher cannot provide source or an installable artifact, treat this skill as unusable and potentially unsafe.
Capability Analysis
Type: OpenClaw Skill
Name: x-twitter-scraper
Version: 0.1.2
The OpenClaw AgentSkills skill bundle for 'x-twitter-scraper' is classified as benign. The `SKILL.md` clearly outlines a legitimate purpose: scraping public Twitter/X profiles using Python and Playwright. It details necessary capabilities such as network access (to Twitter/X, Google Custom Search API, and residential proxies), local file system interaction for data storage, and reading environment variables for API and proxy credentials. While these capabilities involve sensitive operations, there is no evidence of malicious intent, prompt injection attempts, data exfiltration, or unauthorized actions within the provided documentation. The affiliate links for proxy providers are a commercial aspect, not a security concern.
Capability Assessment
Purpose & Capability
The descriptive purpose is a browser-automated scraper built with Playwright (Python) and chromium, but the registry-level requirements list no binaries, no install steps, and there are no code files. The skill claims CLI commands (discover/scrape) and persistent data directories; however, nothing in the package would actually provide those binaries or scripts. This is internally inconsistent — a production scraper would legitimately need Python, Playwright, and a browser binary, yet those are not declared or provided.
Instruction Scope
The SKILL.md instructs the agent to run CLI commands, read/edit config/scraper_config.json, create and write queue/output/thumbnails directories, download thumbnails, use residential proxies, optionally call Google Custom Search API, and 'auto-dismiss' login overlays. Those operations require local binaries, network access, credentials, and file system write privileges. The instructions give broad runtime behaviors (anti-detection, fingerprint spoofing, proxy use) but the skill package provides no code or declared binaries to perform them — the instructions therefore grant broad authority without the supporting artifacts.
Install Mechanism
There is no install specification (instruction-only), which is lowest-risk in isolation, but problematic here because the SKILL.md itself documents a non-trivial runtime stack (python3, chromium, Playwright, proxy configuration). The absence of an install step or source repository means an agent following the instructions might attempt to pull or execute third-party code ad hoc — the mismatch increases the chance of unclear or unsafe runtime behavior.
Credentials
The skill references optional Google API credentials and residential proxy providers (e.g., BrightData) and expects saving of local files, but the declared required env vars/credentials are none. That omission is a red flag: the runtime clearly needs API keys and potentially proxy credentials, yet the skill does not declare them. Asking users to provide such secrets without clear declaration or handling details is disproportionate and risky.
Persistence & Privilege
The skill does not request 'always: true' and is user-invocable; autonomous invocation is allowed (platform default) but not a separate privilege here. The skill does expect to write to local data directories (data/queue, data/output, thumbnails), which is normal for a scraper but should be noted by the user.
How to Use
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install x-twitter-scraper - After installation, invoke the skill by name or use
/x-twitter-scraper - Provide required inputs per the skill's parameter spec and get structured output
Version History
v0.1.2
**Added ScrapeClaw suite info and updated version metadata.**
- Added a reference and link to the "ScrapeClaw" suite at the top of the documentation.
- Updated YAML version from 1.0.0 to 1.0.2.
- No code/feature changes; documentation only.
v0.1.1
**x-twitter-scraper v0.1.1 Changelog**
- Major SKILL.md rewrite for clarity, brevity, and improved structure.
- Added YAML skill metadata section with tags and requirements.
- Expanded and clarified configuration example and proxy provider info.
- Improved feature list, troubleshooting, and usage documentation.
- All technical details and configuration examples updated for easier onboarding.
v0.1.0
Initial release: fast, configurable two-phase scraping for public Twitter/X profiles (no login required).
- Discover profiles via Google Custom Search API or DuckDuckGo, filtered by location and category
- Scrape public profile data and recent tweets using Playwright with robust anti-detection techniques
- Flexible CLI: discover, scrape, batch modes, output in JSON and CSV, checkpoint/resume support
- Downloads profile pics and tweet media to structured local directories
- Skips suspended, private, or low-follower accounts (configurable thresholds)
- Handles Twitter UI changes and login overlays for reliable data extraction
Metadata
Frequently Asked Questions
What is Twitter Scraper?
Scrapes public Twitter/X profiles and recent tweets using browser automation with anti-detection and optional profile discovery via Google or DuckDuckGo. It is an AI Agent Skill for Claude Code / OpenClaw, with 1604 downloads so far.
How do I install Twitter Scraper?
Run "/install x-twitter-scraper" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is Twitter Scraper free?
Yes, Twitter Scraper is completely free (open-source). You can download, install and use it at no cost.
Which platforms does Twitter Scraper support?
Twitter Scraper is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).
Who created Twitter Scraper?
It is built and maintained by ArulmozhiV (@arulmozhiv); the current version is v0.1.2.
More Skills