← Back to Skills Marketplace
xb19960921

knowledge-health-checker

by xb19960921 · GitHub ↗ · v1.0.1 · MIT-0
cross-platform ✓ Security Clean
64
Downloads
1
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install knowledge-health-checker
Description
Audit and improve Markdown knowledge-base health across Obsidian, Logseq, Notion exports, docs folders, and wiki repositories. Detect empty placeholder notes...
README (SKILL.md)

Knowledge Health Checker

Knowledge Health Checker audits a Markdown-based knowledge base as a living system, not a folder full of files.

It detects whether the knowledge garden is:

  • connected or fragmented
  • dense or hollow
  • current or stale
  • navigable or full of dead links
  • safe to auto-fix or requiring human review

The goal is not only to find problems, but to produce a prioritized, safe, actionable health report.


When to Use

Use this skill for:

  • Obsidian vault cleanup
  • Logseq / Notion Markdown export review
  • documentation repository health checks
  • wiki linting before migration or publishing
  • broken link detection
  • empty placeholder / TODO note detection
  • orphan note and graph fragmentation analysis
  • content density and structure quality review
  • periodic knowledge-base maintenance

Do not use it for semantic fact-checking. This skill checks structure, links, density, freshness, and maintainability, not whether every claim is true.


Core Principle

A healthy knowledge base has four properties:

  1. Substance — notes contain enough content to be useful.
  2. Connectivity — important notes are linked into the graph.
  3. Navigability — links, headings, and structure help readers move through knowledge.
  4. Maintainability — stale, broken, duplicate, or low-value content is visible and repairable.

A knowledge base can be large and still unhealthy. Size is not health.


Default Workflow

Step 1: Confirm scope and safety

Before scanning, identify:

Target path:
Formats: markdown / wiki links / relative links
External URL check: yes/no
Generate fix script: yes/no
Auto-apply fixes: no by default
Exclude directories:
Estimated file count:

Safe default:

scan only → report only → generate fix plan → user reviews → user applies

Never delete, rename, rewrite, or auto-apply fixes without explicit confirmation.

Step 2: Build file and heading index

Index:

  • .md files
  • normalized filenames and aliases
  • headings / anchors
  • relative paths
  • wiki links such as [[note]] and [[note#heading]]
  • markdown links such as [text](path.md)

Exclude by default:

.git/
node_modules/
__pycache__/
.obsidian/
.trash/
dist/
build/

Step 3: Detect hollow or low-value notes

Flag likely hollow notes when they match one or more:

  • fewer than 200 characters
  • no heading
  • only TODO / placeholder text
  • image-heavy with very little explanation
  • template content not filled in
  • empty exported page from Notion/Logseq

Classify severity:

Severity Meaning Typical action
P0 Empty or pure placeholder delete, archive, or fill immediately
P1 Too thin to be useful expand with definition, context, examples
P2 Usable but weak improve structure or add links

Step 4: Detect broken links

Check:

  • wiki file links: [[filename]]
  • wiki heading links: [[filename#heading]]
  • local markdown links: [text](../path/file.md)
  • image/embed paths
  • optional external URLs, only with user confirmation because it can be slow/noisy

For each broken link, report:

source file
link text
target
link type
probable fix if a similar file exists

Step 5: Analyze content density and structure

Measure:

  • word/character count
  • heading depth and hierarchy
  • list/table/code-block usage
  • internal link count
  • external link count
  • last modified time
  • very long files that may need splitting
  • files with no inbound or outbound links

Suggested ranges:

Signal Healthy range Warning
Short note 300+ words or intentionally atomic \x3C200 characters
Long note still navigable with headings >3000 words without structure
Internal links at least 1-3 for durable notes zero links = possible orphan
Freshness depends on domain stale if >90 days and marked active

Step 6: Analyze knowledge graph health

Build a graph:

node = markdown file
edge = internal link

Report:

  • total nodes
  • total edges
  • orphan nodes
  • central nodes
  • weakly connected components
  • one-way links
  • fragmented topic clusters

A perfect graph is not required. The goal is to identify the highest-value repair points.

Step 7: Score health

Default scoring:

Dimension Weight Good state
Hollow note rate 25% few or no empty placeholders
Broken link rate 30% no broken internal links
Content density 25% most notes have useful substance and structure
Network connectivity 20% important notes are connected; few accidental orphans

Health score:

health = weighted score from 0 to 100

Use labels:

Score Label
90-100 Excellent
75-89 Healthy
60-74 Needs maintenance
40-59 Fragile
0-39 Critical

Step 8: Generate report and fix plan

Return a concise summary first. For large scans, provide a full report path.

Fix plans must be safe:

  • generate proposed changes
  • group by risk
  • include reason for each fix
  • require user review before applying destructive changes

Never silently delete or rewrite knowledge files.


Output Format

Use this format:

## Knowledge Health Summary
- Target:
- Files scanned:
- Health score:
- Label:
- Top risks:

## Findings
| Category | Count | Severity | Notes |
|---|---:|---|---|
| Hollow notes |  |  |  |
| Broken links |  |  |  |
| Orphan notes |  |  |  |
| Overlong notes |  |  |  |
| Stale active notes |  |  |  |

## Highest-Impact Fixes
1. P0:
2. P1:
3. P2:

## Safe Fix Plan
- Auto-safe fixes:
- Needs human review:
- Do not auto-apply:

## Artifacts
- Report:
- Fix script:
- Raw JSON:

For small knowledge bases, include concrete file examples. For large ones, include top 10 examples per category and write full details to a report file.


Safe Fix Policy

Classify fixes by risk:

Risk Examples Permission
Low generate report, list broken links, suggest links no extra confirmation
Medium create fix script, add missing backlinks in draft output ask before writing files
High delete notes, rename files, rewrite links globally, split files explicit confirmation required

Default behavior: report and propose, do not mutate.


Bundled Scripts

Use these when available:

  • scripts/health_check.py — core scanner for hollow files, broken links, density, and graph stats.
  • scripts/report_generator.py — HTML report generation.
  • scripts/auto_fix.py — fix-plan or repair-script generation.

Run scripts from the skill directory or pass absolute paths. If a script lacks CLI ergonomics, inspect it and adapt safely rather than guessing destructive behavior.


Example Commands

Basic scan:

python3 scripts/health_check.py /path/to/knowledge-base

Generate a report from scan results if supported:

python3 scripts/report_generator.py results.json --output health-report.html

Generate a fix plan, not auto-apply:

python3 scripts/auto_fix.py results.json --dry-run

If the bundled script does not support these exact flags, read the script first and use its actual interface.


Test Prompts

Use test-prompts.json for Darwin-style regression evaluation. Good test coverage should include:

  • small Markdown folder with broken links
  • Obsidian-style wiki links and missing headings
  • placeholder-heavy exported notes
  • a large graph with orphan clusters
  • request for safe fix plan without auto-apply

Anti-Patterns

Avoid:

  • equating more notes with better knowledge
  • deleting or rewriting files without confirmation
  • checking external URLs by default on large vaults
  • treating all orphan notes as bad; some are intentionally private/draft
  • creating huge reports with no prioritized next action
  • producing a repair script without explaining risk
  • ignoring non-English filenames and encodings

Quality Bar

A good knowledge health check must be:

  • safe: no destructive changes without confirmation
  • specific: names files and link targets
  • prioritized: P0/P1/P2, not a flat dump
  • actionable: includes exact repair suggestions
  • scalable: summarizes large vaults without flooding context
  • portable: works for Obsidian, Logseq, Notion exports, and plain Markdown

If the output only says “you have broken links” without showing where, why it matters, and what to do next, it failed.

Usage Guidance
This skill appears to do what it says: scan Markdown files, produce a prioritized health report, and generate safe fix plans. Before installing or running: (1) run it on a copy or backup of your knowledge base; (2) inspect generated fix scripts before executing—they include rm/sed operations and are executable by default; (3) do not grant the agent/platform permission to auto‑execute generated scripts without manual review; (4) note the skill outputs full file contents in reports—avoid sharing those outputs if they contain sensitive data; (5) test on a small dataset first to confirm behavior and platform compatibility (e.g., sed -i differences on macOS vs Linux). If you want higher assurance, request a quick code review of the truncated/unfinished portions (there is a small coding bug/truncation in health_check.py as provided) before relying on it at scale.
Capability Analysis
Type: OpenClaw Skill Name: knowledge-health-checker Version: 1.0.1 The knowledge-health-checker is a legitimate utility designed to audit and maintain Markdown-based knowledge bases like Obsidian or Logseq. The bundle includes scripts for scanning files (health_check.py), generating HTML reports (report_generator.py), and creating repair scripts (auto_fix.py). While the tool has the capability to suggest file deletions and modifications, the SKILL.md instructions and script logic emphasize safety, dry-runs, and mandatory user review. Internal documentation (code-review-report.md) demonstrates a proactive approach to security by addressing potential command injection and XSS vulnerabilities. No evidence of data exfiltration, unauthorized execution, or malicious prompt injection was found.
Capability Assessment
Purpose & Capability
Name/description match the included artifacts: three Python scripts implement scanning, HTML report generation, and fix‑script generation for Markdown knowledge bases. No unrelated credentials, binaries, or install steps are requested.
Instruction Scope
SKILL.md limits destructive actions (default: report only, require explicit confirmation before applying fixes). It also asks the agent to produce full reports and file contents for review. This is coherent but increases the risk that users will accidentally expose repository contents when sharing outputs—inspect outputs carefully before publishing.
Install Mechanism
No install spec; this is instruction‑plus-code. All code is bundled in the skill (no network installs/downloads). That reduces supply‑chain risk compared with remote fetches.
Credentials
The skill requests no environment variables or external credentials and only manipulates local files under the provided scan path, which is proportionate to its purpose.
Persistence & Privilege
always:false and no cross‑skill/system config changes. However the skill can generate executable scripts (shell + Python) that include rm and sed commands; if the agent/platform is permitted to execute those scripts automatically, that increases potential damage. The SKILL.md default forbids auto‑applying fixes, which mitigates this risk.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install knowledge-health-checker
  3. After installation, invoke the skill by name or use /knowledge-health-checker
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.1
# Changelog All notable changes to `knowledge-health-checker` will be documented in this file. ## [1.1.0] - 2026-04-25 ### Added - Added ClawHub-ready metadata and public-facing skill positioning. - Added a complete Markdown knowledge-base health audit workflow: - confirm scan scope and safety - build file and heading index - detect hollow or low-value notes - detect broken wiki and markdown links - analyze content density and structure - analyze knowledge graph connectivity - calculate a health score - generate a report and safe fix plan - Added support guidance for multiple knowledge-base formats: - Obsidian vaults - Logseq exports - Notion Markdown exports - documentation repositories - plain Markdown wiki folders - Added a 0-100 knowledge health scoring model. - Added health labels: - Excellent - Healthy - Needs maintenance - Fragile - Critical - Added a safe fix policy with three risk levels: - Low-risk reporting and suggestions - Medium-risk generated fix plans - High-risk destructive or global changes requiring explicit confirmation - Added a standard output format for: - health summary - findings - highest-impact fixes - safe fix plan - generated artifacts - Added Darwin-style regression prompts in `test-prompts.json`. ### Improved - Reframed the skill as a general-purpose Markdown knowledge-base health auditor instead of an internal-only checker. - Improved safety guidance so the default behavior is now: ```text scan only → report only → generate fix plan → user reviews → user applies Improved audit clarity by separating knowledge-base health into four dimensions: hollow note rate broken link rate content density network connectivity Improved scalability guidance for large knowledge bases. Improved compatibility guidance for non-English filenames and mixed Markdown conventions. Improved output usefulness by requiring prioritized P0 / P1 / P2 repair suggestions. Changed Reorganized the original long workflow into a cleaner ClawHub-friendly structure. Changed “automatic repair” guidance into a safer reviewed fix-plan workflow. Moved the skill from a local archived copy into the active skills directory: Copy /Users/mac/.openclaw/skills/knowledge-health-checker Verified Ran a smoke test using a temporary Markdown knowledge-base sample. Verified that the bundled scanner can process Markdown files and generate health output. Command used: Copy python3 scripts/health_check.py <tmp-sample> Result: Copy 扫描完成:发现 3 个Markdown文件 检查完成!健康分:27.0 Files Updated SKILL.md Added CHANGELOG.md Added test-prompts.json Verified scripts/health_check.py
Metadata
Slug knowledge-health-checker
Version 1.0.1
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 1
Frequently Asked Questions

What is knowledge-health-checker?

Audit and improve Markdown knowledge-base health across Obsidian, Logseq, Notion exports, docs folders, and wiki repositories. Detect empty placeholder notes... It is an AI Agent Skill for Claude Code / OpenClaw, with 64 downloads so far.

How do I install knowledge-health-checker?

Run "/install knowledge-health-checker" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is knowledge-health-checker free?

Yes, knowledge-health-checker is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does knowledge-health-checker support?

knowledge-health-checker is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created knowledge-health-checker?

It is built and maintained by xb19960921 (@xb19960921); the current version is v1.0.1.

💬 Comments