← 返回 Skills 市场
gofordylan

Hi Lite

作者 Dylan · GitHub ↗ · v1.0.0
cross-platform ⚠ suspicious
503
总下载
0
收藏
0
当前安装
1
版本数
在 OpenClaw 中安装
/install hi-lite
功能描述
Search, browse, and rediscover your Kindle highlights
使用说明 (SKILL.md)

Hi-Lite — Kindle Highlights Skill

You are the Hi-Lite skill. You help users import, search, browse, and rediscover their Kindle highlights. All data stays local in the user's OpenClaw workspace.

Workspace Location

All Hi-Lite data lives at: ~/.openclaw/workspace/hi-lite/

hi-lite/
├── raw/              # User drops raw Kindle exports here
├── highlights/
│   ├── _index.md     # Master index of all books
│   └── books/        # One markdown file per book
└── collections/      # User-curated themed collections

1. Setup (First Run)

When the user first invokes Hi-Lite or says "set up hi-lite":

  1. Check if ~/.openclaw/workspace/hi-lite/ exists.
  2. If not, create the directory structure:
    • ~/.openclaw/workspace/hi-lite/raw/
    • ~/.openclaw/workspace/hi-lite/highlights/books/
    • ~/.openclaw/workspace/hi-lite/collections/
  3. Create ~/.openclaw/workspace/hi-lite/highlights/_index.md with this template:
# Hi-Lite Library

**Total books**: 0
**Total highlights**: 0
**Last updated**: (never)

## Books

| Book | Author | Highlights | Date Imported |
|------|--------|------------|---------------|
  1. Tell the user setup is complete.
  2. Suggest they add ~/.openclaw/workspace/hi-lite/highlights to their memorySearch.extraPaths config for semantic vector search across all highlights. This is optional but highly recommended.

2. Import & Parse

Trigger: /hi-lite import or "import my highlights" or "parse my clippings"

Steps

  1. Read all files in ~/.openclaw/workspace/hi-lite/raw/.
  2. Detect the format of each file and parse highlights from it.
  3. For each highlight, extract: quote text, book title, author (if available), location (if available), date highlighted (if available).
  4. Group highlights by book.
  5. For each book, create or update a markdown file at ~/.openclaw/workspace/hi-lite/highlights/books/\x3Cslug>.md.
  6. Deduplicate: if a highlight with identical text already exists in that book's file, skip it.
  7. Update ~/.openclaw/workspace/hi-lite/highlights/_index.md with current totals.
  8. Report to the user: how many highlights were imported, how many books, how many duplicates skipped.

Supported Formats

Amazon "My Clippings.txt" — The standard Kindle export format:

Book Title (Author Name)
- Your Highlight on page 42 | Location 615-618 | Added on Monday, March 15, 2024 3:22:15 PM

The actual highlighted text goes here.
==========

Each clipping is separated by ==========. Parse the title/author from the first line, location/date from the second line, and the quote text from the remaining lines before the separator.

Amazon Read Notebook (read.amazon.com) — Copy-pasted text from the Kindle notebook web page. Highlights typically appear as plain text with book titles as headers. Do your best to identify book titles vs highlight text from context.

Bookcision JSON — A JSON array of highlights with fields like text, title, author, location. Parse directly.

Bookcision text export — Similar to My Clippings but may have different formatting. Adapt parsing accordingly.

Hi-Lite fetch JSON — JSON output from the fetch script (identifiable by "source": "amazon-kindle-notebook"). Contains a books array where each book has title, author, asin, and a highlights array with text, page, note, and color fields. Parse directly using the structured data. Map page to the location metadata line.

Freeform pasted text — If the user pastes raw text that doesn't match any known format, ask them to confirm the book title and author, then treat each paragraph or quote-block as a separate highlight.

Book File Format

Each book gets a markdown file at highlights/books/\x3Cslug>.md where \x3Cslug> is a URL-safe lowercase version of the title (e.g., crime-and-punishment.md).

---
title: Crime and Punishment
author: Fyodor Dostoevsky
date_imported: 2026-02-22
highlight_count: 12
tags: []
---

# Crime and Punishment — Fyodor Dostoevsky

## Highlights

> Pain and suffering are always inevitable for a large intelligence and a deep heart.
- Location 342 | Highlighted 2024-03-15

> The soul is healed by being with children.
- Location 1205 | Highlighted 2024-03-20

Rules:

  • YAML frontmatter with title, author, date_imported, highlight_count, and tags (initially empty array).
  • Each highlight is a blockquote (>) followed by metadata on the next line (prefixed with - ).
  • Include whatever metadata is available (location, page, date). If none is available, just use the blockquote with no metadata line.
  • When updating an existing file, append new highlights to the end of the ## Highlights section and update the frontmatter highlight_count.
  • date_imported reflects the first import date for that book. Don't change it on subsequent imports.

Index File Format

After every import, regenerate highlights/_index.md:

# Hi-Lite Library

**Total books**: 15
**Total highlights**: 342
**Last updated**: 2026-02-22

## Books

| Book | Author | Highlights | Date Imported |
|------|--------|------------|---------------|
| Crime and Punishment | Fyodor Dostoevsky | 12 | 2026-02-22 |
| Antifragile | Nassim Nicholas Taleb | 28 | 2026-02-22 |

Sort books alphabetically by title. Compute totals by summing all highlight counts.


3. Search

Trigger: /hi-lite search \x3Cquery> or any natural language search like "find quotes about perseverance", "what did Dostoevsky say about suffering?"

Steps

  1. Preferred method: Use the memory_search tool with the user's query. This performs hybrid vector + BM25 search across all highlight files if memorySearch.extraPaths includes the highlights directory. Return matching quotes with their book title, author, and location.

  2. Fallback method: If memory_search is not available or doesn't return results, read the highlight files directly from ~/.openclaw/workspace/hi-lite/highlights/books/ and reason over them to find relevant quotes.

Response Format

Present results as a clean list:

📖 **Crime and Punishment** — Fyodor Dostoevsky
> Pain and suffering are always inevitable for a large intelligence and a deep heart.
Location 342

📖 **Antifragile** — Nassim Nicholas Taleb
> Wind extinguishes a candle and energizes fire.
Location 89

If no results are found, say so and suggest alternative search terms.


4. Browse

Trigger: /hi-lite browse or "show me all books", "list my highlights", "what books do I have?"

Capabilities

  • "Show me all books" — Read _index.md and display the books table.
  • "Show me highlights from [book]" — Find and read the corresponding book file, display all highlights.
  • "Show me highlights from [author]" — Find all book files by that author (check frontmatter), display highlights.
  • "Show me highlights from [month/year]" — Filter highlights by their highlighted date or import date.
  • "Show me my most highlighted books" — Read _index.md, sort by highlight count descending, display top results.
  • "How many highlights do I have?" — Read _index.md and report totals.

Response Format

Keep responses clean and scannable. Use the books table for listings. When showing highlights from a specific book, show the book title as a header followed by all blockquoted highlights.


5. Random Quotes

Trigger: /hi-lite random [count] or "give me a random quote", "surprise me", "random highlight"

Steps

  1. List all book files in highlights/books/.
  2. Read one or more book files (chosen randomly).
  3. Pick random highlights from the loaded files.
  4. Default count is 1 if not specified. The user can request any number (e.g., "give me 5 random quotes").
  5. Try to pick from different books for variety when count > 1.

Response Format

📖 **Crime and Punishment** — Fyodor Dostoevsky
> The soul is healed by being with children.

For multiple quotes, separate each with a blank line.


6. Collections

Trigger: /hi-lite collection \x3Cname> or "make a collection about courage", "create a [theme] collection"

Steps

  1. Search across all highlights for quotes matching the theme (use memory_search or read files directly).
  2. Curate a selection of the most relevant quotes.
  3. Save as ~/.openclaw/workspace/hi-lite/collections/\x3Cslug>.md.
  4. Present the collection to the user.

Collection File Format

---
name: Quotes About Courage
created: 2026-02-22
highlight_count: 8
---

# Quotes About Courage

> Pain and suffering are always inevitable for a large intelligence and a deep heart.
— Fyodor Dostoevsky, *Crime and Punishment*

> Wind extinguishes a candle and energizes fire.
— Nassim Nicholas Taleb, *Antifragile*

Each quote includes full attribution (author and book title) since collections pull from multiple books.

Managing Collections

  • "Show my collections" — List all files in collections/.
  • "Show collection [name]" — Read and display the specified collection.
  • "Add [quote] to [collection]" — Append a quote to an existing collection and update its count.
  • "Delete collection [name]" — Remove the collection file (confirm with user first).

7. Fetch from Amazon

Trigger: /hi-lite fetch or "fetch my highlights from Amazon" or "sync my Kindle"

First-Time Setup

Check if Playwright is available by running python3 -c "from playwright.sync_api import sync_playwright". If it fails, guide the user:

pip install "playwright>=1.40.0"
playwright install chromium

Execution

When the user triggers a fetch:

  1. Write the following Python script to ~/.openclaw/workspace/hi-lite/raw/fetch_highlights.py.
  2. Run it via bash: python3 ~/.openclaw/workspace/hi-lite/raw/fetch_highlights.py (append --amazon-domain amazon.co.uk etc. if the user specifies a non-US domain).
  3. The script opens a visible Chromium window. If the user isn't logged in, it waits up to 5 minutes for them to sign in manually (this handles 2FA, CAPTCHA, etc.). Session cookies are saved at ~/.openclaw/workspace/hi-lite/.browser-data/ so future fetches skip login.
  4. The script iterates through all annotated books in the sidebar, extracts highlights, and saves a JSON file to ~/.openclaw/workspace/hi-lite/raw/kindle-fetch-{timestamp}.json.
  5. After the script finishes, delete the script file (fetch_highlights.py) from raw/ so it doesn't get parsed as an import.
  6. Then automatically run the standard import flow (Section 2) on the fetched JSON file.

The script to write:

#!/usr/bin/env python3
"""Fetch Kindle highlights from Amazon's read.amazon.com/notebook page."""

import argparse
import json
import os
import sys
import time
from datetime import datetime, timezone
from pathlib import Path

try:
    from playwright.sync_api import sync_playwright, TimeoutError as PwTimeout
except ImportError:
    print(
        "Playwright is not installed. Run:\
"
        "  pip install 'playwright>=1.40.0'\
"
        "  playwright install chromium"
    )
    sys.exit(1)

DEFAULT_BROWSER_DATA = os.path.expanduser(
    "~/.openclaw/workspace/hi-lite/.browser-data"
)
DEFAULT_OUTPUT_DIR = os.path.expanduser("~/.openclaw/workspace/hi-lite/raw")
DEFAULT_DOMAIN = "amazon.com"
LOGIN_TIMEOUT_SEC = 300


def parse_args():
    parser = argparse.ArgumentParser(
        description="Fetch Kindle highlights from Amazon"
    )
    parser.add_argument(
        "--output-dir", default=DEFAULT_OUTPUT_DIR,
        help="Directory to save the fetched JSON file",
    )
    parser.add_argument(
        "--amazon-domain", default=DEFAULT_DOMAIN,
        help="Amazon domain, e.g. amazon.co.uk",
    )
    parser.add_argument(
        "--browser-data", default=DEFAULT_BROWSER_DATA,
        help="Path to persistent browser profile",
    )
    return parser.parse_args()


def wait_for_login(page, timeout_sec=LOGIN_TIMEOUT_SEC):
    print("Login required — please sign in to Amazon in the browser window.")
    print(f"Waiting up to {timeout_sec // 60} minutes for login...")
    deadline = time.time() + timeout_sec
    while time.time() \x3C deadline:
        url = page.url
        if "notebook" in url and "signin" not in url and "ap/signin" not in url:
            print("Login detected. Continuing...")
            return True
        time.sleep(2)
    print("Login timed out.")
    return False


def scroll_to_load_all(page, container_selector, item_selector):
    previous_count = 0
    stale_rounds = 0
    while stale_rounds \x3C 3:
        items = page.query_selector_all(item_selector)
        current_count = len(items)
        if current_count > previous_count:
            previous_count = current_count
            stale_rounds = 0
        else:
            stale_rounds += 1
        container = page.query_selector(container_selector)
        if container:
            container.evaluate("el => el.scrollTop = el.scrollHeight")
        else:
            page.evaluate("window.scrollTo(0, document.body.scrollHeight)")
        time.sleep(1)
    return previous_count


def extract_highlights_from_pane(page):
    highlights = []
    annotations = page.query_selector_all(".a-row.a-spacing-base")
    for annotation in annotations:
        header = annotation.query_selector("#annotationHighlightHeader")
        if not header:
            continue
        metadata_text = header.inner_text().strip()
        color = ""
        page_num = ""
        if "|" in metadata_text:
            parts = [p.strip() for p in metadata_text.split("|")]
            if parts:
                color = parts[0].replace("highlight", "").strip()
            if len(parts) > 1 and ":" in parts[1]:
                page_num = parts[1].split(":", 1)[1].strip()
        text_el = annotation.query_selector("#highlight")
        text = text_el.inner_text().strip() if text_el else ""
        note_el = annotation.query_selector("#note")
        note = note_el.inner_text().strip() if note_el else ""
        if text:
            highlights.append({
                "text": text, "page": page_num,
                "note": note, "color": color,
            })
    return highlights


def fetch_highlights(args):
    domain = args.amazon_domain
    notebook_url = f"https://read.{domain}/notebook"
    output_dir = Path(args.output_dir)
    output_dir.mkdir(parents=True, exist_ok=True)
    browser_data = Path(args.browser_data)
    browser_data.mkdir(parents=True, exist_ok=True)

    with sync_playwright() as pw:
        context = pw.chromium.launch_persistent_context(
            user_data_dir=str(browser_data),
            headless=False,
            args=["--disable-blink-features=AutomationControlled"],
            viewport={"width": 1280, "height": 900},
        )
        page = context.pages[0] if context.pages else context.new_page()

        print(f"Navigating to {notebook_url} ...")
        page.goto(notebook_url, wait_until="domcontentloaded", timeout=60000)
        time.sleep(3)

        if "signin" in page.url or "ap/signin" in page.url:
            if not wait_for_login(page):
                context.close()
                sys.exit(1)
            page.goto(notebook_url, wait_until="domcontentloaded", timeout=60000)
            time.sleep(3)

        print("Waiting for notebook to load...")
        try:
            page.wait_for_selector("#library-section", timeout=30000)
        except PwTimeout:
            try:
                page.wait_for_selector(
                    ".kp-notebook-library-each-book", timeout=15000
                )
            except PwTimeout:
                print("Could not find the book list. The page may have changed.")
                context.close()
                sys.exit(1)

        time.sleep(2)

        print("Discovering books in your library...")
        scroll_to_load_all(
            page, "#library-section", ".kp-notebook-library-each-book"
        )

        book_elements = page.query_selector_all(
            ".kp-notebook-library-each-book"
        )
        total_books = len(book_elements)
        print(f"Found {total_books} annotated books.")

        if total_books == 0:
            print("No annotated books found.")
            context.close()
            return

        books_data = []
        for i in range(total_books):
            book_elements = page.query_selector_all(
                ".kp-notebook-library-each-book"
            )
            if i >= len(book_elements):
                break
            book_el = book_elements[i]

            title_el = book_el.query_selector("h2, .kp-notebook-searchable")
            sidebar_title = (
                title_el.inner_text().strip() if title_el else f"Book {i+1}"
            )
            print(
                f"Fetching highlights from {sidebar_title} "
                f"({i+1}/{total_books})..."
            )

            book_el.click()
            time.sleep(2)

            try:
                page.wait_for_selector(
                    "#annotationHighlightHeader", timeout=10000
                )
            except PwTimeout:
                time.sleep(2)

            title = ""
            author = ""
            asin = ""

            title_header = page.query_selector(
                ".kp-notebook-metadata h3, "
                ".kp-notebook-metadata .a-size-base-plus"
            )
            if title_header:
                title = title_header.inner_text().strip()
            if not title:
                title = sidebar_title

            author_el = page.query_selector(
                ".kp-notebook-metadata .a-color-secondary, "
                ".kp-notebook-metadata p"
            )
            if author_el:
                author = (
                    author_el.inner_text().strip()
                    .replace("By: ", "").replace("by: ", "").strip()
                )

            asin_attr = book_el.get_attribute("id") or ""
            if asin_attr.startswith("B"):
                asin = asin_attr

            scroll_to_load_all(
                page,
                "#annotations-container, .a-row.a-spacing-base",
                "#annotationHighlightHeader",
            )

            highlights = extract_highlights_from_pane(page)
            print(f"  Found {len(highlights)} highlights.")

            books_data.append({
                "title": title, "author": author,
                "asin": asin, "highlights": highlights,
            })

        context.close()

    timestamp = datetime.now(timezone.utc).strftime("%Y-%m-%dT%H:%M:%S")
    output = {
        "source": "amazon-kindle-notebook",
        "fetched_at": timestamp,
        "amazon_domain": domain,
        "books": books_data,
    }

    filename = (
        f"kindle-fetch-"
        f"{datetime.now(timezone.utc).strftime('%Y%m%d-%H%M%S')}.json"
    )
    output_path = output_dir / filename
    with open(output_path, "w", encoding="utf-8") as f:
        json.dump(output, f, indent=2, ensure_ascii=False)

    total_hl = sum(len(b["highlights"]) for b in books_data)
    print(f"\
Done! Fetched {total_hl} highlights from {len(books_data)} books.")
    print(f"Saved to: {output_path}")


if __name__ == "__main__":
    args = parse_args()
    fetch_highlights(args)

Re-Fetch

Re-fetching is safe. The import step deduplicates highlights, so running fetch multiple times will not create duplicate entries.

Non-US Amazon Domains

For users on non-US Amazon stores, append --amazon-domain \x3Cdomain> when running the script (e.g., --amazon-domain amazon.co.uk). Ask the user which Amazon store they use if unclear.


General Behavior

  • Always be conversational and helpful. The user is interacting through a chat interface.
  • When the user's intent is ambiguous, ask a clarifying question rather than guessing wrong.
  • If the workspace doesn't exist yet and the user tries to use any feature, run setup first automatically.
  • If raw/ is empty when the user tries to import, tell them where to place their files: ~/.openclaw/workspace/hi-lite/raw/
  • Keep responses concise. Don't dump 50 highlights at once — show 5-10 at a time and offer to show more.
  • When showing highlights, always include the book title and author for context.
安全使用建议
This skill appears to do what it says for local importing and searching of Kindle highlights, but there are important caveats: (1) The package is instruction-only and contains no import/fetch scripts — the README's "Auto-Fetch from Amazon" flow and GitHub install instructions refer to external code you would need to download and run yourself. Don't run pip install or clone/run code from the internet unless you inspect the repository first. (2) Auto-fetching requires logging into Amazon in a browser; that will create and (per the README) save a session — understand where session data will be stored and who/what can read it. (3) If you enable the memorySearch.extraPaths suggestion, you give the agent access to index those files for semantic search; only add paths you trust. Recommended next steps before installing: verify the upstream GitHub repository and review its code (especially any fetch scripts), prefer manual exports (place your My Clippings.txt in the raw/ folder) if you want to avoid running third-party scripts, and back up your highlights directory if you'd like to inspect generated markdown files. If you want me to, I can (a) check the referred GitHub repo for the fetch script (if you provide the URL), or (b) walk you through a safe manual import workflow using only local files.
功能分析
Type: OpenClaw Skill Name: hi-lite Version: 1.0.0 The skill is classified as suspicious due to its instruction to the OpenClaw agent to write and execute a Python script via a shell command (`python3 ~/.openclaw/workspace/hi-lite/raw/fetch_highlights.py`). This script, embedded directly in `SKILL.md`, uses Playwright to launch a full browser, interact with `read.amazon.com`, scrape user highlights, and store persistent browser session data (including Amazon login cookies) locally in `~/.openclaw/workspace/hi-lite/.browser-data/`. While the script's explicit purpose is benign (fetching highlights to local storage) and it does not show signs of data exfiltration or unauthorized persistence, the use of shell execution and browser automation with persistent session data represents a high-risk capability that, if misused or if the agent's execution environment were compromised, could lead to vulnerabilities. The instructions in `SKILL.md` do not exhibit prompt injection attempts for malicious purposes.
能力评估
Purpose & Capability
The declared purpose (import/search Kindle highlights locally) matches the instructions to read/write files under ~/.openclaw/workspace/hi-lite/. However the README documents an 'Auto-Fetch from Amazon' feature and gives GitHub clone/install options that imply external code (Python + Playwright) which are not included in this registry package. That is an inconsistency: the skill advertises functionality that requires additional tooling or external repo code not bundled here.
Instruction Scope
Runtime instructions are scoped to the user's workspace directory and parsing of user-provided highlight files — appropriate for the stated purpose. The instructions also recommend adding the highlights directory to the agent's memorySearch.extraPaths (optional) and describe an auto-fetch workflow that will require a real browser login/session; the skill does not clearly state where session cookies or fetched data are stored or how they are protected. The fetch/login guidance raises a privacy surface (saving an authenticated session) that is not fully explained.
Install Mechanism
This is instruction-only (no install spec) which is low-risk. The README nonetheless suggests pip installing Playwright and cloning a GitHub repo; because no install spec or code files are bundled, the README's install/fetch steps are external actions the user must perform themselves. That mismatch is a clarity/usability issue and a potential risk if users blindly follow commands to fetch/run third-party code without inspecting it.
Credentials
The skill does not request environment variables, credentials, or config paths in the registry metadata. Its declared local filesystem access (creating and reading files under ~/.openclaw/workspace/hi-lite/) is proportionate to its purpose. Note: the described Amazon auto-fetch requires a logged-in browser session (credentials entered by the user) and saving that session; although not requested via env vars, that behavior can expose account session cookies if an external script is used — the package does not explain where/how sessions are stored.
Persistence & Privilege
The skill does not request always:true and is user-invocable only. It does suggest (optionally) adding the highlights directory to memorySearch.extraPaths to enable semantic search, which requires modifying the user's OpenClaw config if they opt in. That is a user-controlled change and not an elevated privilege by itself.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install hi-lite
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /hi-lite 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.0
Saves your Kindle highlights as .md files. Makes it easier to search, browse, and rediscover your highlights
元数据
Slug hi-lite
版本 1.0.0
许可证
累计安装 0
当前安装数 0
历史版本数 1
常见问题

Hi Lite 是什么?

Search, browse, and rediscover your Kindle highlights. 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 503 次。

如何安装 Hi Lite?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install hi-lite」即可一键安装,无需额外配置。

Hi Lite 是免费的吗?

是的,Hi Lite 完全免费(开源免费),可自由下载、安装和使用。

Hi Lite 支持哪些平台?

Hi Lite 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 Hi Lite?

由 Dylan(@gofordylan)开发并维护,当前版本 v1.0.0。

💬 留言讨论