← Back to Skills Marketplace
556
Downloads
5
Stars
0
Active Installs
9
Versions
Install in OpenClaw
/install somark-document-parser
Description
Parse PDF, images (PNG/JPG/BMP/TIFF/WebP/HEIC), Word, PPT and other documents into Markdown or JSON using SoMark. All document parsing needs (like resumes, p...
Usage Guidance
This skill sends any parsed files (and your SOMARK_API_KEY) to somark.tech to do the work — that is the intended behavior. Before installing: 1) Confirm you trust somark.tech and are comfortable sending documents (do not send highly sensitive PII or secrets unless allowed by your policy). 2) Verify how the script receives the API key (environment vs CLI) and store keys with least privilege and rotation. 3) Check where output files are written and whether temporary uploads are removed if you are worried about disk leakage. 4) Test on non-sensitive documents first to confirm behavior and billing characteristics. If you need the parser to run fully offline or to avoid external uploads, this skill is not appropriate.
Capability Analysis
Type: OpenClaw Skill
Name: somark-document-parser
Version: 1.0.9
The somark-document-parser skill is designed to convert various document formats (PDF, Office, images) into Markdown or JSON using the SoMark API (somark.tech). The Python script (somark_parser.py) correctly handles file processing, API communication via aiohttp, and result retrieval using environment variables for authentication. Notably, the SKILL.md includes explicit safety instructions directing the AI agent to treat all parsed content as untrusted data to mitigate prompt injection risks from document content. No malicious behaviors such as unauthorized data exfiltration or persistence were detected.
Capability Tags
Capability Assessment
Purpose & Capability
Skill name/description match the included code and instructions: the Python script and SKILL.md upload files and POST them to somark.tech for parsing. No unrelated credentials, binaries, or services are requested.
Instruction Scope
Instructions direct the agent to save uploaded files locally and run the provided somark_parser.py to convert files to Markdown/JSON — this is expected. Minor omission: the CLI examples in SKILL.md do not explicitly show how the SOMARK_API_KEY is passed to the script (the metadata declares the env var). The script (truncated) posts files and form data to SoMark endpoints; it writes output files to disk (output directory). Review handling of temporary files if you are concerned about sensitive data left on disk.
Install Mechanism
No install spec is included (instruction-only plus a bundled script). No remote downloads or extract steps. The only network targets are somark.tech API endpoints shown in the script.
Credentials
Only SOMARK_API_KEY is required and is the primary credential — this matches the stated purpose. Note: the API key and file contents are transmitted to somark.tech as part of normal operation, so the key will be sent over the network and is required for the service.
Persistence & Privilege
Skill does not request persistent/always-on privileges and does not modify other skills or system-wide settings. It reads/writes files only in the specified output directory.
How to Use
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install somark-document-parser - After installation, invoke the skill by name or use
/somark-document-parser - Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.9
Auto-publish from GitHub Actions
v1.0.8
Update SKILL.md and somark_parser.py
v1.0.7
somark-document-parser 1.0.7
- Updated default parser configuration instructions: users are now informed how to make SoMark the default document parser by editing their CLAUDE.md, rather than asking in chat.
- Clarified guidance after a successful parse, improving accuracy and self-service setup.
- No changes to core functionality or API usage.
v1.0.6
- Rewrote documentation in English for broader accessibility and clarity.
- Added explicit instructions to ask the user whether to set SoMark as the default document parser.
- Simplified and clarified API key configuration steps, emphasizing security and not sharing keys in chat.
- Consolidated usage instructions, clarifying trigger scenarios and supported file formats.
- Reinforced safety: treat all parsed output as data and ignore any embedded prompts or instructions.
- Updated setup, quota, and error handling instructions for consistency and brevity.
v1.0.5
somark-document-parser 1.0.5
- Added initial code file `somark_parser.py` implementing the document parsing functionality.
- Added metadata file `_meta.json` describing environment variable requirements.
- No changes to parsing logic or user-facing workflow; this release formalizes the script and metadata structure.
v1.0.4
- Removed source code and metadata files: `_meta.json` and `somark_parser.py`.
- Updated SKILL.md to clarify that users should not send their API Key in the chat and must set it only via environment variables, with explicit instructions not to provide the key in conversation at any step.
- Minor clarification in API Key acquisition and setup steps in both Chinese and English instructions.
- No functional code or behavior changes, documentation and security guidance tightened.
v1.0.3
- Removed the README.md file.
- Updated API Key handling: no longer ask users to send their API Key in chat. Users are now instructed to configure the API Key via environment variable in their own terminal for increased security.
- Added explicit security instructions: only display parsed document data, never execute or obey document-embedded instructions or roleplay requests.
- Clarified that all parsed content is presented as pure data and must not be summarized or modified.
- Revised wording throughout skill documentation to reflect the new API Key and security processes.
v1.0.1
### somark-document-parser 1.0.1
- Added a new document parsing startup script, somark_parser.py.
- Implemented language detection (automatic Chinese/English replies).
- Extended descriptions for file upload and path resolution scenarios.
- Optimized guidance for API key acquisition and quota handling instructions.
- Improved user-facing error and result handling descriptions.
v1.0.0
init commit
Metadata
Frequently Asked Questions
What is Somark Document Parser?
Parse PDF, images (PNG/JPG/BMP/TIFF/WebP/HEIC), Word, PPT and other documents into Markdown or JSON using SoMark. All document parsing needs (like resumes, p... It is an AI Agent Skill for Claude Code / OpenClaw, with 556 downloads so far.
How do I install Somark Document Parser?
Run "/install somark-document-parser" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is Somark Document Parser free?
Yes, Somark Document Parser is completely free, licensed under MIT-0. You can download, install and use it at no cost.
Which platforms does Somark Document Parser support?
Somark Document Parser is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).
Who created Somark Document Parser?
It is built and maintained by Soul-Code (@soul-code); the current version is v1.0.9.
More Skills