← Back to Skills Marketplace
agitalent

LiteBrowse

by agitalent · GitHub ↗ · v0.1.1 · MIT-0
cross-platform ✓ Security Clean
140
Downloads
0
Stars
0
Active Installs
2
Versions
Install in OpenClaw
/install litebrowse
Description
Extracts and summarizes the most relevant webpage passages for focused, low-token research without loading or summarizing the full page.
README (SKILL.md)

LiteBrowse Skill

Direct access:

Purpose

LiteBrowse is an OpenClaw skill for low-token webpage research.

Use it when:

  • the user wants facts from a specific webpage
  • the page is long or cluttered
  • token cost matters
  • you need the most relevant passages first instead of full-page dumps

Core Rule

Do not load or summarize the full page first.

Always run the local extractor before reasoning on webpage content:

python3 ./scripts/web_relevance_extract.py "\x3Curl-or-html-file>" "\x3Cquery>"

The extractor returns only the most relevant blocks under a fixed character budget. Use that compact output as the default context for answering.

Required Workflow

  1. Restate the information target as a short query string.
  2. Run:
    python3 ./scripts/web_relevance_extract.py "\x3Csource>" "\x3Cquery>" --top-k 5 --max-chars 2400 --format json
    
  3. Read only the returned blocks.
  4. Answer from those blocks if they are sufficient.
  5. Only if recall is clearly insufficient, rerun with one controlled expansion:
    • increase --top-k
    • or increase --max-chars
    • or narrow / refine the query
  6. Do not jump to raw-page scraping unless the extractor failed.

Budget Discipline

  • Prefer --max-chars 1200 to 2400 for narrow fact lookup.
  • Keep --top-k between 3 and 6 unless the user explicitly asks for breadth.
  • Narrow the query instead of widening the token budget when possible.
  • If the first run already contains the answer, stop there.

Output Discipline

When answering:

  • cite which returned block supports the answer
  • say when the extractor output is incomplete or ambiguous
  • distinguish extracted text from your inference
  • do not claim the full page was reviewed unless it actually was

Examples

Find pricing details from a long page:

python3 ./scripts/web_relevance_extract.py "https://example.com/pricing" "pricing tiers api limits enterprise" --max-chars 1600 --top-k 4 --format text

Find job requirements from a careers page:

python3 ./scripts/web_relevance_extract.py "https://example.com/jobs/ml-engineer" "requirements python llm retrieval location" --max-chars 1800 --top-k 5 --format json

Use a saved HTML file:

python3 ./scripts/web_relevance_extract.py "/tmp/page.html" "refund policy cancellation deadline" --max-chars 1200

Failure Handling

If the page cannot be fetched or parsed:

  • report the fetch or parse failure directly
  • ask for a local HTML copy if network access is blocked
  • do not fabricate an answer from URL guesses
Usage Guidance
This skill appears coherent: it includes a readable Python extractor that fetches a page (or reads a local HTML file), parses it, and returns high-relevance blocks. Before installing or enabling it, consider: (1) review the script (already included) and confirm you trust it; (2) if you run agents in environments with access to internal services, be aware the extractor will fetch arbitrary URLs you or the agent provide — this can be used to access internal endpoints if network access isn't restricted; (3) if you prefer tighter control, run the extractor in a network-restricted sandbox or feed it local HTML snapshots instead of live URLs. If those considerations are acceptable, the skill is consistent with its stated purpose.
Capability Analysis
Type: OpenClaw Skill Name: litebrowse Version: 0.1.1 The LiteBrowse skill is a utility for efficient webpage information extraction designed to minimize token usage. The core logic in `scripts/web_relevance_extract.py` uses standard Python libraries to fetch remote URLs or read local HTML files, parsing them into ranked text blocks based on a user-provided query. There is no evidence of data exfiltration, malicious execution, or prompt injection; the local file access is consistent with the stated purpose of processing saved HTML documents.
Capability Assessment
Purpose & Capability
The name, description, SKILL.md, and included script all describe the same functionality: extracting and ranking relevant text blocks from a page. The skill requests no credentials, no unusual binaries, and no config paths — everything is proportionate to a web-extraction helper.
Instruction Scope
Instructions are specific: run the bundled Python extractor with a URL or local HTML file and use only the returned blocks. This scope matches the stated purpose. One operational note: the script will perform network fetches for any HTTP(S) URL provided, and will read local files if given — so when executed in an environment where the agent has network access it can reach arbitrary hosts (including internal endpoints). That behavior is expected for a fetcher but carries the usual network/SSRF risk depending on your runtime environment.
Install Mechanism
No install spec — instruction-only with one included Python script. The script uses only Python standard libraries (urllib, html.parser, etc.), so there is no package download or extraction risk.
Credentials
The skill declares no environment variables, no credentials, and no config paths. The script does read either a network URL or a local file path provided at runtime, which is appropriate for its purpose.
Persistence & Privilege
always is false and the skill does not request persistent/system-level privileges or modify other skills. Autonomous invocation is allowed by default but is not combined with broad credentials or suspicious behavior here.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install litebrowse
  3. After installation, invoke the skill by name or use /litebrowse
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v0.1.1
Bundle extractor script inside published skill
v0.1.0
Initial release
Metadata
Slug litebrowse
Version 0.1.1
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 2
Frequently Asked Questions

What is LiteBrowse?

Extracts and summarizes the most relevant webpage passages for focused, low-token research without loading or summarizing the full page. It is an AI Agent Skill for Claude Code / OpenClaw, with 140 downloads so far.

How do I install LiteBrowse?

Run "/install litebrowse" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is LiteBrowse free?

Yes, LiteBrowse is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does LiteBrowse support?

LiteBrowse is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created LiteBrowse?

It is built and maintained by agitalent (@agitalent); the current version is v0.1.1.

💬 Comments