← 返回 Skills 市场
yzqzuigao-ui

Document Learning

作者 yzqzuigao-ui · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ⚠ suspicious
69
总下载
0
收藏
0
当前安装
1
版本数
在 OpenClaw 中安装
/install document-learning
功能描述
Comprehensive document learning system with progress tracking, resume capability, and long-term memory integration. Use when you need to read PDF/text docume...
使用说明 (SKILL.md)

Document Learning System

A complete system for reading documents, tracking progress across multiple sessions, and building long-term memory.

Quick Start

To start learning a document:

Please learn this document: [filename]

To resume from where you left off:

Resume learning [filename] from last position

To check current progress:

What's my progress on learning [filename]?

Core Features

1. Large File Support

  • Handles PDF files up to hundreds of MBs
  • Text-based documents (TXT, MD, LOG, etc.)
  • Chunked processing to avoid memory issues
  • Automatic encoding detection and recovery

2. Progress Tracking & Resume

  • Bookmark system: Automatically tracks your position (chapter/page)
  • Resume capability: Pick up exactly where you left off, even days later
  • Progress visualization: See what's been learned vs remaining
  • Manual control: Jump to specific chapters/pages if needed

3. Long-term Memory Integration

  • Extracts key concepts automatically
  • Stores learning summaries in MEMORY.md
  • Creates session notes in memory/YYYY-MM-DD.md
  • Builds a searchable knowledge base over time

How It Works

Session Flow

  1. Start Learning

    • Upload or specify document path
    • System reads and extracts content
    • Creates initial progress entry
  2. Track Progress

    • After each session, note where you stopped
    • Progress is saved automatically
    • Key points are extracted to memory files
  3. Resume Later

    • Say "resume learning [document]"
    • System loads last position
    • Continues from there seamlessly
  4. Build Knowledge Base

    • Over time, all learned content accumulates in MEMORY.md
    • Use memory_search to find specific topics later
    • Create comprehensive summaries periodically

Multi-Document Learning Support 📚🔄

You can now learn multiple PDFs simultaneously and switch between them!

How It Works

The system maintains a progress store that tracks ALL your learning documents:

  • ✅ Track progress for Document A, B, C... all at once
  • ✅ Learn Document A → Pause → Learn Document B → Pause → Resume Document A
  • ✅ Each document remembers its exact position (chapter/page)
  • ✅ Switch context seamlessly between different documents

Multi-Document Workflow Example

Day 1: Start learning PDF-A
User: Please learn this driver manual for me.
Assistant: Started tracking itop-3568-driver.pdf
   Progress: Page 0/450 (not started)

[Read some content]
User: Pause here, let's learn another document
Assistant: ✅ Saved progress for PDF-A at page 120/450
   Now learning PDF-B: user-guide.pdf

[Start PDF-B]
User: Please also learn this user guide
Assistant: Started tracking user-guide.pdf
   Progress: Page 0/80 (not started)

[Read some of PDF-B]
User: Pause, back to the driver manual
Assistant: ✅ Saved progress for PDF-B at page 35/80
   Switching context back to PDF-A...
   Resume from page 120/450

Day 2: Continue learning different documents
User: Let's continue with the user guide today
Assistant: Loading progress for user-guide.pdf...
   Last read: Page 35/80 (PDF-B)
   Continuing from where we left off...

[After reading PDF-B]
User: Show me status of all my learning documents
Assistant: 📚 Document Learning Status
   ==================================
   
   ✅ itop-3568-driver.pdf
      Progress: 120/450 pages (27%)
      Last updated: Today
   
   🔄 user-guide.pdf
      Progress: 50/80 pages (63%)
      Last updated: Just now

Day 3: Query specific document knowledge
User: What did I learn about the driver installation in PDF-A?
Assistant: [Searches memory and provides answer from PDF-A]
Based on my notes from itop-3568-driver.pdf...

Commands for Multi-Document Learning

Start a new document

Please learn this document: /path/to/document.pdf

Pause current, start another

Pause learning [current-doc], let's learn [new-doc]

Switch back to previous document

Resume learning [doc-name] from last position

Check all documents status

What's my progress on all my learning documents?
Show me the status of everything I'm learning

Key Benefits

  1. No interference: Each document has independent progress tracking
  2. Seamless switching: System automatically loads correct position for each doc
  3. Complete history: All learning sessions are preserved and searchable
  4. Flexible workflow: Learn at your own pace across multiple sources

Usage Patterns

Pattern 1: Learning a Technical Manual (e.g., PDF)

User: Here's a driver manual. Please learn it for me.
Assistant: I'll read the document, extract key points, and save progress. Let me start with Chapter 1...

[After reading first chapter]
Assistant: Progress update: Read chapters 1-2 of 15. Saved to memory. Ready to continue when you are!

[Days later - resume]
User: Resume learning the driver manual
Assistant: Loading last position (chapter 3)... Continuing from where we left off...

Pattern 2: Learning a Book/Text Document

User: I want to learn this book. Track my progress chapter by chapter.
Assistant: Got it! I'll read each chapter and save the summary after every session. Progress tracking enabled.

[After Chapter 1]
Assistant: ✅ Chapter 1 complete - Summary saved to memory/YYYY-MM-DD.md
📊 Progress: 1/24 chapters (4%)

[Next day]
User: Continue from last position
Assistant: Resuming at Chapter 2...

Pattern 3: Reference Learning (Look-up as needed)

User: I need to learn this specification document for reference.
Assistant: I'll read it and create a searchable knowledge base. You can later ask questions like "What does section 5 say about API limits?"

Later: What are the API rate limits mentioned in the spec?
Assistant: [Searches memory and provides answer from learned content]

Commands & Tools

Built-in Script

The skill includes scripts/read_large_document.py which handles:

  • PDF reading with pdfplumber
  • Text file encoding detection
  • Progress saving/loading
  • Key point extraction

Memory Integration Files

Read these for detailed integration patterns:

Best Practices

  1. Start small: Begin with one document at a time
  2. Regular updates: Add progress notes after each session, even brief ones
  3. Use summaries: Ask for chapter summaries to reinforce learning
  4. Review periodically: Use memory_search to find what you've learned before
  5. Be specific: When asking about content, mention the document name

Limitations & Notes

  • PDFs must be text-based (not scanned images) for optimal reading
  • Very large PDFs (>200MB) may take longer to process
  • Progress tracking works best with structured documents (chapters/sections)
  • For scanned PDFs, consider OCR tools first

Examples

Example 1: Learning a Driver Manual

User: Please learn this driver manual for me. It's in /workspace/linuxdriver/itop-3568-driver-v2.8.pdf
Assistant: I'll start reading the document now...

[Reads and extracts]
✅ Document loaded: itop-3568-driver-v2.8.pdf (120MB)
📑 Total pages: 450
🔑 Key concepts extracted: Hardware specs, Pinout details, Driver installation...

Progress saved! You can resume anytime with "Resume learning the driver manual"

Example 2: Resume After Days

User: Resume learning the driver manual from last position
Assistant: Loading progress... Last read: page 156 of 450 (Chapter 7)
Continuing from where we left off...

[Reads next section]
✅ Chapter 7 complete - Progress updated to page 234/450

Example 3: Query Learned Content

User: What did I learn about the pinout configuration?
Assistant: [Searches memory and provides answer from driver manual learning session]
Based on my notes from the itop-3568 driver manual (learned on April 6):
- GPIO pins are numbered X, Y, Z...
- Power supply requirements: 5V at 2A...

Ready to start learning? Just tell me which document you want to learn! 📚

安全使用建议
This skill appears to do what it says: reading documents, chunked extraction, and saving progress and summaries to local memory files. Before using: (1) only supply document paths you trust (the scripts will read any path you give them); (2) expect the skill to create/update .multi_doc_learning_progress.json in the current working directory and .document_learning_progress.json alongside documents and to write memory/YYYY-MM-DD.md or MEMORY.md as described; (3) there are no network calls or credential requests, but pdfplumber is needed to read PDFs (install it in a safe environment if required); (4) if you are concerned, review the two included Python scripts or run them in a sandboxed workspace to confirm behavior.
功能分析
Type: OpenClaw Skill Name: document-learning Version: 1.0.0 The skill bundle provides document reading and progress tracking capabilities through Python scripts (read_large_document.py and multi_doc_progress.py) that interact with the filesystem. While the functionality aligns with the stated purpose, the scripts exhibit high-risk behaviors by performing unsanitized file read/write operations based on agent-provided paths, which could be exploited via prompt injection to access sensitive system files (path traversal). Additionally, the scripts create hidden metadata files (.multi_doc_learning_progress.json) in the current directory and potentially in arbitrary document directories, and the SKILL.md instructions direct the agent to automatically modify workspace memory files (MEMORY.md), representing a significant side-effect surface.
能力评估
Purpose & Capability
Name/description match the included scripts and docs: both Python scripts implement reading large PDFs/text, chunking, extracting key points, and tracking progress across multiple documents. There are no unexpected credential or network requirements.
Instruction Scope
SKILL.md and references instruct the agent to read documents (paths provided by the user), save progress, and integrate notes into MEMORY.md / memory/YYYY-MM-DD.md. This is expected, but the skill will read any file path you give it and will create/update local progress and memory files in the workspace.
Install Mechanism
No install spec is present (instruction-only with bundled scripts). The scripts optionally use pdfplumber (standard Python package) if available; nothing is downloaded or executed from remote URLs.
Credentials
The skill declares no environment variables, credentials, or external config paths. The file I/O it performs (reading documents, writing .multi_doc_learning_progress.json and per-document .document_learning_progress.json, and writing memory files) is proportional to its stated purpose.
Persistence & Privilege
The skill is not always-enabled and does not request elevated platform privileges, but it will create/update local progress files ('.multi_doc_learning_progress.json' in cwd and '.document_learning_progress.json' in document directories) and may add entries to MEMORY.md / memory/ files per its documentation. That is normal for this functionality but worth knowing.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install document-learning
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /document-learning 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.0
Initial release of the comprehensive document learning system with multi-document support. - Supports reading and learning from large PDF and text documents with automatic encoding detection. - Tracks progress (chapter/page) for each document, allowing you to pause, resume, and switch between multiple documents seamlessly. - Extracts and saves key concepts and learning summaries into a searchable long-term memory (MEMORY.md). - Provides commands for checking progress, resuming from the last position, and reviewing all active learning documents. - Handles large files via chunked processing to ensure stable performance.
元数据
Slug document-learning
版本 1.0.0
许可证 MIT-0
累计安装 0
当前安装数 0
历史版本数 1
常见问题

Document Learning 是什么?

Comprehensive document learning system with progress tracking, resume capability, and long-term memory integration. Use when you need to read PDF/text docume... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 69 次。

如何安装 Document Learning?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install document-learning」即可一键安装,无需额外配置。

Document Learning 是免费的吗?

是的,Document Learning 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

Document Learning 支持哪些平台?

Document Learning 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 Document Learning?

由 yzqzuigao-ui(@yzqzuigao-ui)开发并维护,当前版本 v1.0.0。

💬 留言讨论