← Back to Skills Marketplace
jdchi

llms-txt-sniffer

by JDChi · GitHub ↗ · v1.3.1 · MIT-0
cross-platform ✓ Security Clean
117
Downloads
0
Stars
0
Active Installs
5
Versions
Install in OpenClaw
/install llms-txt-sniffer
Description
Locate and utilize AI-friendly documentation index files (llms.txt, llms-full.txt) or sitemap.xml. Use when encountering documentation URLs (containing /docs...
README (SKILL.md)

llms-txt-sniffer: The Smart Document Radar

This skill streamlines documentation ingestion by locating the most AI-optimized version of a site's content.

🧠 Why llms.txt?

It provides a high-density, Markdown-based index designed for LLMs to map entire sites instantly and save tokens.

🚀 Discovery Strategy (Two-Stage)

Stage 1: Quick Jump Probes (Instructional)

  1. URL + /llms.txt: Probe {input_url}/llms.txt using curl -I.
  2. Domain Root: Probe https://{domain}/llms.txt using curl -I.

Stage 2: Advanced Sniffing (Tool-based)

If Stage 1 fails, run the companion sniffer script located in this skill's directory: python3 sniffer.py $ARGUMENTS

📜 Behavioral Rules

  • User-Initiated Only: Only invoke this skill when the user explicitly provides a documentation URL. Do not autonomously scan domains.
  • Switch to High-Speed Mode: Once an index is found, prioritize its links over manual scraping.
  • Index Summary: Always present a brief structure overview.
  • Fallback: Use sitemap.xml parser results if llms.txt is missing.
Usage Guidance
This skill appears coherent and limited: it runs HTTP probes and a small Python script to find llms.txt or sitemap.xml and returns a short preview. Before installing, consider: (1) only use it against public documentation URLs you control or are authorized to scan (the script does DNS lookups and fetches content and could reveal page text); (2) the script returns up to 2000 characters of file content — avoid running it on URLs that might expose secrets; (3) the SKILL.md asks for user-initiated use only, but the platform permits autonomous invocation by default — ensure you only enable autonomous runs if you trust it; (4) source/homepage is unknown — if you need stronger assurances, review or run the sniffer.py code in a sandboxed environment yourself before use.
Capability Analysis
Type: OpenClaw Skill Name: llms-txt-sniffer Version: 1.3.1 The skill bundle is designed to locate AI-friendly documentation index files (llms.txt and sitemap.xml) to optimize documentation ingestion. The Python script `sniffer.py` includes proactive security measures, such as an SSRF protection mechanism (`is_internal_host`) to prevent the agent from probing internal network resources. The instructions in `SKILL.md` are transparent, align with the tool's stated purpose, and include behavioral constraints to ensure the tool is only used upon explicit user request.
Capability Assessment
Purpose & Capability
Name, description, declared binaries (python3, curl), and included script align with the stated goal of locating llms.txt / sitemap.xml on documentation sites. No unrelated credentials, config paths, or binaries are requested.
Instruction Scope
SKILL.md restricts use to user-initiated document URLs and describes probing with curl -I then running sniffer.py. The script performs HTTP GETs (via urllib) and returns a content preview (up to 2000 chars). It does not read local files or environment variables. Note: the skill performs network requests and DNS resolution; this is expected but the agent should follow the 'user-initiated only' guideline when invoking it.
Install Mechanism
Instruction-only with a bundled small Python script. No install spec or external downloads; nothing is written to disk beyond the existing code file. Low install risk.
Credentials
No environment variables, credentials, or config paths are requested. The required tools (python3, curl) are appropriate for the task.
Persistence & Privilege
always is false and the skill does not request elevated or persistent platform privileges. It does not modify other skills or system-wide settings.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install llms-txt-sniffer
  3. After installation, invoke the skill by name or use /llms-txt-sniffer
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.3.1
Security: add SSRF protection (block internal IPs), fix metadata (remove requests), clarify allowed-tools, add User-Initiated-Only rule
v1.2.9
New format with argument-hint and allowed-tools
v1.3.0
Streamline description, simplified content
v1.2.2
Fix YAML syntax, add allowed-tools for Claude Code
v1.2.1
v1.2.1: 移除requests依赖,新增触发场景说明
Metadata
Slug llms-txt-sniffer
Version 1.3.1
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 5
Frequently Asked Questions

What is llms-txt-sniffer?

Locate and utilize AI-friendly documentation index files (llms.txt, llms-full.txt) or sitemap.xml. Use when encountering documentation URLs (containing /docs... It is an AI Agent Skill for Claude Code / OpenClaw, with 117 downloads so far.

How do I install llms-txt-sniffer?

Run "/install llms-txt-sniffer" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is llms-txt-sniffer free?

Yes, llms-txt-sniffer is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does llms-txt-sniffer support?

llms-txt-sniffer is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created llms-txt-sniffer?

It is built and maintained by JDChi (@jdchi); the current version is v1.3.1.

💬 Comments