← Back to Skills Marketplace
bill492

Browser Read

by bill492 · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ✓ Security Clean
228
Downloads
0
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install browser-read
Description
Extract readable content from browser pages as markdown. Use when web_fetch fails (bot protection, auth-required pages, Twitter/X, LinkedIn) and you already...
README (SKILL.md)

browser-read

Extract readable text from an already-open browser page and return markdown, suitable for pages where web_fetch is blocked or missing auth context.

When to use

  • web_fetch returned an error or empty content.
  • Page requires authentication/cookies/session state available only in the browser.
  • You need text extraction from Twitter/X or LinkedIn timelines/articles where screenshot/OCR was previously used.

When NOT to use

  • web_fetch already returns good markdown/text (faster and cheaper).
  • Purely static pages where normal fetch is sufficient.

Steps

  1. Navigate to the URL with browser navigate.
  2. Read extraction script from ~/clawd/skills/browser-read/extract.js.
  3. Run browser act with kind=evaluate and pass the script contents as fn.
  4. Script returns {title, content, excerpt, byline, siteName, length} where content is markdown.
  5. If extraction fails or returns empty content, script falls back to document.body.innerText.

Example (tool calls)

{
  "action": "navigate",
  "targetId": "...", 
  "url": "https://example.com"
}
{
  "action": "act",
  "targetId": "...",
  "kind": "evaluate",
  "fn": "(() => { ... return {title, content, excerpt, byline, siteName, length}; })()"
}

Notes

  • extract.js is a self-contained IIFE so it can be passed directly as the fn value to browser act.
  • Keep in mind this is a lightweight extractor; it intentionally strips script/style/nav/header/footer/aside/cookie/ad elements before conversion.
Usage Guidance
This skill appears coherent and low-risk: it runs a JS extractor inside the browser page and returns markdown, and it does not request secrets or make network calls. Before installing, confirm that (1) the agent's file-access is limited to the skill directory (the SKILL.md references ~/clawd/skills/browser-read/extract.js), (2) you are comfortable allowing the agent to evaluate scripts in pages you open (the script can read page content, metadata, and URLs), and (3) you avoid running it on highly sensitive pages (banking, 2FA flows) or pages where extracting content violates terms. If you want extra assurance, review extract.js yourself and test it on a non-sensitive page first.
Capability Analysis
Type: OpenClaw Skill Name: browser-read Version: 1.0.0 The browser-read skill is a utility designed to extract readable markdown content from web pages currently open in a browser, particularly useful for authenticated or bot-protected sites like LinkedIn or X (Twitter). The core logic in extract.js performs standard DOM cleaning and transformation into markdown without any evidence of data exfiltration to third parties, credential theft, or obfuscated malicious code.
Capability Assessment
Purpose & Capability
Name/description (extract readable content from an open browser page) match the included JS and SKILL.md. No unrelated binaries or credentials are requested.
Instruction Scope
Instructions tell the agent to read the extractor from ~/clawd/skills/browser-read/extract.js and run it in the browser via browser act evaluate. Reading the skill's own file and executing it in the page context is expected for this purpose; verify the runtime will only read the skill path (not arbitrary host files) and that browser evaluate is appropriately sandboxed.
Install Mechanism
No install spec — instruction-only with one included JS file. Nothing is downloaded from external URLs and no archives are extracted.
Credentials
No environment variables, credentials, or config paths are required. The extract script only reads DOM content, metadata, and document.location (expected for content extraction).
Persistence & Privilege
Skill is not always-enabled and is user-invocable. It does not request elevated persistence or modify other skills or system-wide settings.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install browser-read
  3. After installation, invoke the skill by name or use /browser-read
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
- Initial release of browser-read skill for extracting readable content from open browser pages as markdown. - Designed to handle cases where web_fetch fails due to bot protection, authentication requirements, or dynamic content (e.g., Twitter/X, LinkedIn). - Provides markdown output with metadata: title, excerpt, byline, site name, and content length. - Falls back to extracting plain text from document body if primary extraction fails. - Not intended for use when web_fetch already works or on static pages.
Metadata
Slug browser-read
Version 1.0.0
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 1
Frequently Asked Questions

What is Browser Read?

Extract readable content from browser pages as markdown. Use when web_fetch fails (bot protection, auth-required pages, Twitter/X, LinkedIn) and you already... It is an AI Agent Skill for Claude Code / OpenClaw, with 228 downloads so far.

How do I install Browser Read?

Run "/install browser-read" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Browser Read free?

Yes, Browser Read is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Browser Read support?

Browser Read is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Browser Read?

It is built and maintained by bill492 (@bill492); the current version is v1.0.0.

💬 Comments