docx-md
/install docx-md
Word DOCX (OOXML) – docx-md
Overview
Three entry points: Read – output compact Markdown (default, token-efficient) or full JSON; Modify – apply AI-returned edits to the docx; Finalize – accept all revisions and remove all comments. Implemented via OOXML (ZIP + XML). No commercial Word libraries required.
Workflow
| Goal | Action |
|---|---|
| Get document for AI | Read: run read script → Markdown (default) or JSON. Markdown includes \x3C!-- b:N --> blockIndex markers for edit targeting. |
| Apply AI edits to docx | Modify: run apply script with docx + edits JSON → new docx with track changes and comments. |
| Deliver final version | Finalize: run finalize script → new docx with no revisions/comments. |
LLM-oriented pipeline
- Read – Parse docx; output Markdown (default) or JSON. Markdown uses
\x3C!-- b:N -->prefix per block; revisions:{+inserted+}{-deleted-}; comments:[comment: text]. - Send the output + task prompt to the model; require the model to output only the edit JSON:
blockIndex,originalContent,content,basis. - Modify – Script infers op from
blockIndex,originalContent,content,basis; converts to OOXML (w:ins/w:del/ comment anchors), then write back to Word. - Finalize – When the user confirms, run finalize to accept all revisions and remove all comments.
See references/llm-pipeline.md for the Markdown format, JSON schema, and edit format.
1. Read
- Parse
word/document.xml(w:bodyonly) andword/comments.xml. - Output Markdown (default) or JSON. Markdown is compact and token-efficient.
Script: scripts/read_docx.py
# Default: Markdown output (token-efficient)
python3 skills/docx-md/scripts/read_docx.py document.docx
python3 skills/docx-md/scripts/read_docx.py document.docx -o result.md
# JSON output (full structure)
python3 skills/docx-md/scripts/read_docx.py document.docx -f json -o result.json
Options:
-o,--output– Output path (default: stdout)-f,--format–md(default) orjson
2. Modify
- Input: docx path + edit JSON
{ modifications: [{ blockIndex, originalContent, content, basis }] }(sameblockIndexas read output). - Flow: Convert JSON to OOXML (
w:ins/w:del/ comments), then write back to Word.
Script: scripts/apply_edits_docx.py. Use - as edits file to read JSON from stdin.
python3 skills/docx-md/scripts/apply_edits_docx.py document.docx edits.json -o output.docx
python3 skills/docx-md/scripts/apply_edits_docx.py document.docx - -o output.docx # stdin
Options: --author (default: "Review")
3. Finalize
- Accept all revisions (flatten to final text), remove all comments. Save as new docx.
- Uses
docx-revisionsto accept revisions (preserves encoding), then removes comment markup via regex on raw bytes.
Script: scripts/finalize_docx.py
Requires: pip install docx-revisions (see requirements.txt)
python3 skills/docx-md/scripts/finalize_docx.py input.docx -o output.docx
Resources
scripts/
- read_docx.py – Read:
python3 scripts/read_docx.py document.docx [-o out.md] [-f md|json] - apply_edits_docx.py – Modify:
python3 scripts/apply_edits_docx.py document.docx edits.json -o output.docx - finalize_docx.py – Finalize:
python3 scripts/finalize_docx.py input.docx -o output.docx
references/
- ooxml.md – OOXML layout (document.xml, comments.xml, revisions, comments)
- llm-pipeline.md – Pipeline: read → Markdown/JSON → model edits → modify; defines Markdown format, JSON shape (blockIndex, originalContent, content, basis)
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install docx-md - After installation, invoke the skill by name or use
/docx-md - Provide required inputs per the skill's parameter spec and get structured output
What is docx-md?
Low-level docx format tool for AI document review. Three operations: (1) read docx → output compact Markdown or JSON; (2) apply edits JSON back to docx (trac... It is an AI Agent Skill for Claude Code / OpenClaw, with 663 downloads so far.
How do I install docx-md?
Run "/install docx-md" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is docx-md free?
Yes, docx-md is completely free (open-source). You can download, install and use it at no cost.
Which platforms does docx-md support?
docx-md is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).
Who created docx-md?
It is built and maintained by yanweiliang323868-del (@yanweiliang323868-del); the current version is v1.0.1.