← Back to Skills Marketplace
929
Downloads
0
Stars
3
Active Installs
2
Versions
Install in OpenClaw
/install gh-extract
Description
Extract content from a GitHub url.
README (SKILL.md)
GitHub Extract
Extract content from a GitHub url.
Use this skill when the user types /gh-extract or asks to extract/download/summarize a GitHub url.
What it does
- Accepts an GitHub url, could be repo/tree/blob.
- Convert the url to github raw url.
- Extract file content from the raw url or save to a temp path.
Requirements
uvwget
Usage
# print file content to stdout
uv run --script ${baseDir}/gh_extract.py \x3Curl>
# save file to a temp path, with a proper filename
uv run --script ${baseDir}/gh_extract.py \x3Curl> --save
Notes
- only works for public repo.
- url can be repo/tree/blob
- for repo/tree, will try to get
README.mdorSKILL.mdorREADME.txt
Usage Guidance
This skill appears to do what it says — fetch content from public GitHub URLs — but check these points before installing: (1) SKILL.md and the script call 'wget' though the registry metadata only lists 'uv'; ensure 'wget' is available on the host or adjust expectations. (2) The script declares Python dependencies (furl, requests, loguru, urllib3) but there is no install specification — confirm how your runtime will provide or install them. (3) The tool will download arbitrary user-supplied URLs (via requests/wget) and write them to a temporary directory — only use it with trusted GitHub URLs. (4) No secrets are requested and the code only talks to GitHub/raw.githubusercontent.com, but if you run this in a shared environment be aware it will perform outbound HTTP requests and write files to /tmp. If any of these inconsistencies are unacceptable, ask the author to: add 'wget' to required binaries, document dependency installation in the registry, or provide an explicit install spec so you can review what will be installed.
Capability Analysis
Type: OpenClaw Skill
Name: gh-extract
Version: 0.0.2
The skill is suspicious due to the direct execution of `wget` with a user-provided URL in `gh_extract.py`. The `SKILL.md` instructs the AI agent to pass user input directly as the `<url>` argument to the Python script, which then uses `subprocess.run(['wget', ..., url])`. While `subprocess.run` with a list mitigates direct shell injection, it exposes the `wget` command to potential vulnerabilities if a crafted URL can exploit `wget` itself (e.g., arbitrary file writes or SSRF), representing a lack of robust input sanitization for external command execution.
Capability Assessment
Purpose & Capability
The skill's name/description match the code and instructions: it converts GitHub URLs to raw.githubusercontent.com and fetches/ saves file contents. However there is a minor inconsistency: registry metadata lists only 'uv' as a required binary, while SKILL.md and the code also require 'wget'. The script also lists Python dependencies in its header (furl, requests, loguru, urllib3) but the registry has no install spec — this is plausible if the execution environment (uv) handles them, but it is a mismatch between declared and used requirements.
Instruction Scope
SKILL.md instructs the agent to run the provided script with a single GitHub URL and optionally save to a temp file. The script limits its actions to converting URLs, trying common README/SKILL files for repo/tree URLs, fetching via HTTP, or using wget to save to a temp dir. It does not reference unrelated system files, additional env vars, or external endpoints beyond GitHub/raw.githubusercontent.com. It will download arbitrary user-supplied URLs (expected behavior), so user-provided URLs must be trusted.
Install Mechanism
There is no install spec (instruction-only), which is low-risk. The script header declares Python dependencies (PyPI packages) that will need to be present or installed by the runtime; how those get installed depends on the uv runtime but is not documented here. No remote install URLs or archive extraction are used.
Credentials
The skill requests no environment variables or credentials. It performs network requests to GitHub/raw.githubusercontent.com and spawns wget to save files into a temp directory. No access to other credentials, config paths, or unrelated services is requested.
Persistence & Privilege
always:false and no requests to modify other skills or global agent configuration. The skill creates temporary files when saving downloads, which is normal for this functionality and limited in scope.
How to Use
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install gh-extract - After installation, invoke the skill by name or use
/gh-extract - Provide required inputs per the skill's parameter spec and get structured output
Version History
v0.0.2
- Renamed main script from `gh-extract.py` to `gh_extract.py`.
- Now prints file content to stdout by default; use `--save` to save the file to a temp path.
- Added requirement for `wget` alongside `uv`.
- Improved handling for repo/tree URLs: automatically extracts `README.md`, `SKILL.md`, or `README.txt` if present.
- Clarified usage and updated documentation to match new script behavior.
v0.0.1
Initial release: Extracts and downloads content from GitHub URLs.
- Supports GitHub repo, tree, and file (blob) URLs.
- Converts standard URLs to their raw content equivalent.
- Downloads the raw file and saves it to a temporary directory.
- Prints the path to the downloaded file.
- Requires the `uv` tool installed on your system.
- Works with public GitHub repositories only.
Metadata
Frequently Asked Questions
What is GitHub Extract?
Extract content from a GitHub url. It is an AI Agent Skill for Claude Code / OpenClaw, with 929 downloads so far.
How do I install GitHub Extract?
Run "/install gh-extract" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is GitHub Extract free?
Yes, GitHub Extract is completely free (open-source). You can download, install and use it at no cost.
Which platforms does GitHub Extract support?
GitHub Extract is cross-platform and runs anywhere OpenClaw / Claude Code is available (darwin, linux, win32).
Who created GitHub Extract?
It is built and maintained by guoqiao (@guoqiao); the current version is v0.0.2.
More Skills