← Back to Skills Marketplace
camscanner-ai

CamScanner-Pdf2Office

by CamScanner-AI · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ✓ Security Clean
94
Downloads
0
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install camscanner-pdf2office-office
Description
Use CamScanner to convert PDF documents to editable Word (.docx) or Excel (.xlsx) format, with intelligent content recognition and accurate format preservati...
README (SKILL.md)

CamScanner PDF to Office

Overview

CamScanner provides document conversion capabilities that convert PDF documents to Word or Excel documents while preserving original formatting. The workflow is a 3-step pipeline: upload the PDF, convert it, then download the result.

When to Use

  • User wants to convert a PDF to Word (.docx) or Excel (.xlsx)
  • User wants to make a PDF editable
  • User has a PDF and needs it as an Office document

Privacy & Data

Important: Privacy & Data Flow Notice

  • Third-party service: This skill sends your files to CamScanner's official servers (ai-tools.camscanner.com) for processing.
  • Data retention: CamScanner servers process your files in real-time. Files are not permanently stored on the server.
  • Local files: Output files are saved to your local filesystem at the path you specify.

API Reference

Base URL: https://ai-tools.camscanner.com

Supported Conversions

source_type target_type Output
pdf word .docx
pdf excel .xlsx

Step 1: Upload PDF

BASE="https://ai-tools.camscanner.com"

IN_FILE_ID=$(curl -sS -X POST "$BASE/v1/tools/upload_file/execute" \
  -H "Content-Type: application/octet-stream" \
  --data-binary "@/path/to/document.pdf" | jq -r '.tool_result.data.file_id')

Response:

{
  "code": 200,
  "tool": "upload_file",
  "tool_result": {
    "success": true,
    "data": {
      "file_id": "file_1741857600_ab12cd34ef56",
      "size": 24576
    }
  }
}

Step 2: Convert PDF

OUT_FILE_ID=$(curl -sS -X POST "$BASE/v1/tools/convert_pdf/execute" \
  -H "Content-Type: application/json" \
  -d "{\"file_id\":\"$IN_FILE_ID\",\"source_type\":\"pdf\",\"target_type\":\"TARGET\",\"output_mode\":\"file_id\"}" \
  | jq -r '.tool_result.data.file_id')

Replace TARGET with one of: word, excel.

Response:

{
  "code": 200,
  "tool": "convert_pdf",
  "tool_result": {
    "success": true,
    "data": {
      "file_id": "file_1741857722_ddeeff001122",
      "target_type": "word"
    }
  }
}

Step 3: Download Result

curl -sS -X POST "$BASE/v1/tools/download_file/execute?response_mode=raw" \
  -H "Content-Type: application/json" \
  -d "{\"file_id\":\"$OUT_FILE_ID\"}" \
  -o /path/to/output.docx

Critical: The response_mode=raw query parameter is required to get the binary file. Without it, the response is JSON.

Quick Reference: Complete Pipeline

BASE="https://ai-tools.camscanner.com"
INPUT_PDF="/path/to/document.pdf"
TARGET_TYPE="word"          # word | excel
OUTPUT_FILE="/path/to/output.docx"

# Upload
IN_FILE_ID=$(curl -sS -X POST "$BASE/v1/tools/upload_file/execute" \
  -H "Content-Type: application/octet-stream" \
  --data-binary "@$INPUT_PDF" | jq -r '.tool_result.data.file_id')

# Convert
OUT_FILE_ID=$(curl -sS -X POST "$BASE/v1/tools/convert_pdf/execute" \
  -H "Content-Type: application/json" \
  -d "{\"file_id\":\"$IN_FILE_ID\",\"source_type\":\"pdf\",\"target_type\":\"$TARGET_TYPE\",\"output_mode\":\"file_id\"}" \
  | jq -r '.tool_result.data.file_id')

# Download
curl -sS -X POST "$BASE/v1/tools/download_file/execute?response_mode=raw" \
  -H "Content-Type: application/json" \
  -d "{\"file_id\":\"$OUT_FILE_ID\"}" \
  -o "$OUTPUT_FILE"

File Extension Mapping

target_type Extension
word .docx
excel .xlsx

Common Mistakes

Mistake Fix
Forgetting response_mode=raw on download Always append ?response_mode=raw to the download URL
Wrong Content-Type on upload Upload uses application/octet-stream, not multipart/form-data
Using GET instead of POST All three endpoints use POST
Missing source_type in convert request Always include "source_type": "pdf"
Missing output_mode in convert request Always include "output_mode": "file_id" to get a downloadable file_id
Wrong output extension Match extension to target_type (see table above)

Error Handling

Check each step before proceeding:

# After upload
if [ -z "$IN_FILE_ID" ] || [ "$IN_FILE_ID" = "null" ]; then
  echo "Upload failed"; exit 1
fi

# After convert
if [ -z "$OUT_FILE_ID" ] || [ "$OUT_FILE_ID" = "null" ]; then
  echo "Conversion failed"; exit 1
fi
Usage Guidance
This skill uploads the PDF you specify to CamScanner's API (ai-tools.camscanner.com) and returns a converted file — that behavior is expected and consistent. Before using it, confirm you are comfortable sending the document to a third party (do not upload sensitive or regulated documents unless you have verified CamScanner's privacy/retention policies). Test with non-sensitive files first, ensure curl and jq are installed, and verify TLS/network access to the API endpoint. If you need offline conversion or cannot share data externally, do not use this skill.
Capability Analysis
Type: OpenClaw Skill Name: camscanner-pdf2office-office Version: 1.0.0 The skill provides a legitimate interface for converting PDF documents to Word or Excel formats using the CamScanner API (ai-tools.camscanner.com). The workflow is clearly documented in SKILL.md, including data privacy notices regarding the third-party service, and the shell commands (curl, jq) are standard for API interaction without any signs of obfuscation or malicious intent.
Capability Assessment
Purpose & Capability
The name/description (PDF → Word/Excel) matches the instructions and required binaries (curl, jq). No unrelated credentials or tools are requested.
Instruction Scope
Instructions explicitly upload a local PDF to ai-tools.camscanner.com, request conversion, and download the resulting file — this is exactly what the skill says it does. Note: it transmits user files to a third-party service (expected for this use-case).
Install Mechanism
Instruction-only skill with no install spec or code files; requires only common CLI tools (curl, jq). No downloads or archives are written to disk by the skill itself.
Credentials
No environment variables, credentials, or config paths are required. The lack of secrets is proportionate to the described functionality.
Persistence & Privilege
Skill is not marked always:true and does not request persistent system privileges or modify other skills. Autonomous invocation is allowed (platform default) but not combined with other red flags.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install camscanner-pdf2office-office
  3. After installation, invoke the skill by name or use /camscanner-pdf2office-office
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
- Initial release of the CamScanner PDF to Office skill. - Convert PDF files to editable Word (.docx) or Excel (.xlsx) documents with accurate format preservation. - Simple 3-step command-line workflow: upload, convert, and download files. - Requires curl and jq; processes documents via CamScanner’s official servers. - Comprehensive usage guide and API reference included for easy integration.
Metadata
Slug camscanner-pdf2office-office
Version 1.0.0
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 1
Frequently Asked Questions

What is CamScanner-Pdf2Office?

Use CamScanner to convert PDF documents to editable Word (.docx) or Excel (.xlsx) format, with intelligent content recognition and accurate format preservati... It is an AI Agent Skill for Claude Code / OpenClaw, with 94 downloads so far.

How do I install CamScanner-Pdf2Office?

Run "/install camscanner-pdf2office-office" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is CamScanner-Pdf2Office free?

Yes, CamScanner-Pdf2Office is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does CamScanner-Pdf2Office support?

CamScanner-Pdf2Office is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created CamScanner-Pdf2Office?

It is built and maintained by CamScanner-AI (@camscanner-ai); the current version is v1.0.0.

💬 Comments