/install pdf2word-skills
PDF to Word Converter
A skill to extract text from scanned PDF documents and convert them into reusable Word (.docx) files using the free, local docr OCR engine.
Prerequisites
- Initialize the OCR engine by downloading the binaries:
bash scripts/install.sh - Install the required Python dependencies:
pip install -r scripts/requirements.txt
Usage
Run the Python script passing the input PDF file and the desired output .docx file path. You can also append any additional standard docr arguments (such as engine preferences).
python scripts/pdf2word.py \x3Cinput.pdf> \x3Coutput.docx> [docr_args...]
Examples
Convert a single file with the default local engine:
python scripts/pdf2word.py sample.pdf sample_output.docx
Using Other API Engines
By default, the script uses the local RapidOCR engine. The underlying docr tool also supports other engines like the Google Gemini API for potentially higher recognition accuracy on complex layouts.
To use Gemini, first configure your API key:
mkdir -p ~/.ocr
echo "gemini_api_key=your_gemini_key" > ~/.ocr/config
Then pass the -engine gemini argument to the script:
python scripts/pdf2word.py sample.pdf sample_output.docx -engine gemini
If your document has tables, you can force Gemini to output them in Markdown format so the script can parse them into native Word tables:
python scripts/pdf2word.py sample.pdf sample_output.docx -engine gemini -prompt "Extract all text and preserve tables in Markdown format using | symbols."
How it Works
- The script calls
docr, which uses the specified OCR model (RapidOCR by default) to read text from the scanned PDF. - The extracted text is temporarily stored.
- The
python-docxlibrary is used to read the temporary text and construct a formatted Word document. - Temporary files are cleaned up automatically.
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install pdf2word-skills - After installation, invoke the skill by name or use
/pdf2word-skills - Provide required inputs per the skill's parameter spec and get structured output
What is Pdf2word Skills?
Convert scanned PDF documents into Word text documents using a free, local OCR engine or remote api. It is an AI Agent Skill for Claude Code / OpenClaw, with 203 downloads so far.
How do I install Pdf2word Skills?
Run "/install pdf2word-skills" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is Pdf2word Skills free?
Yes, Pdf2word Skills is completely free, licensed under MIT-0. You can download, install and use it at no cost.
Which platforms does Pdf2word Skills support?
Pdf2word Skills is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).
Who created Pdf2word Skills?
It is built and maintained by scottkiss (@scottkiss); the current version is v1.0.0.