← Back to Skills Marketplace
jacob-g-park

polaris-datainsight-doc-extract

by jacob-g-park · GitHub ↗ · v1.0.0
cross-platform ⚠ suspicious
292
Downloads
1
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install polaris-datainsight-doc-extract
Description
Extract structured data from Office documents (DOCX, PPTX, XLSX, HWP, HWPX) using the Polaris AI DataInsight Doc Extract API. Use when the user wants to pars...
Usage Guidance
This skill appears to do what it says (send a document to Polaris DataInsight and parse the returned ZIP), but the registry metadata failing to declare the required POLARIS_DATAINSIGHT_API_KEY is a red flag you should resolve before installing. Actions to take before use: 1) Ask the publisher to update the registry metadata to list POLARIS_DATAINSIGHT_API_KEY as a required credential and to provide a homepage/source for verification. 2) Verify the API hostname (datainsight-api.polarisoffice.com) and the service terms/privacy policy on the official Polaris/Polaris Office site. 3) Only supply an API key you control; avoid putting long-lived/high-privilege credentials into shared environments. Prefer scoped or ephemeral keys if Polaris supports them. 4) Test with non-sensitive documents first to confirm where data is transmitted and how results are returned. 5) If you cannot confirm the publisher or metadata, treat the skill as untrusted and do not expose sensitive documents or secrets to it.
Capability Analysis
Type: OpenClaw Skill Name: polaris-datainsight-doc-extract Version: 1.0.0 The skill is designed to extract structured data from Office documents by uploading them to the Polaris AI DataInsight API at `https://datainsight-api.polarisoffice.com`. It retrieves an API key from the `POLARIS_DATAINSIGHT_API_KEY` environment variable, which is a standard practice for handling secrets. The Python code in `SKILL.md` demonstrates reading a local file, making an HTTPS POST request to the specified endpoint, and processing the ZIP response. There is no evidence of arbitrary command execution, unauthorized file system access beyond the input document, or data exfiltration to unrelated external endpoints. The instructions for the AI agent are aligned with the skill's stated purpose and do not contain malicious prompt injection attempts.
Capability Assessment
Purpose & Capability
The skill's stated purpose (extracting DOCX/PPTX/XLSX/HWP/HWPX via Polaris DataInsight) matches the runtime instructions (POST to datainsight-api.polarisoffice.com with x-po-di-apikey). However, the registry metadata lists no required environment variables or primary credential while the SKILL.md explicitly requires POLARIS_DATAINSIGHT_API_KEY — an incoherence between claimed requirements and runtime needs.
Instruction Scope
The SKILL.md stays within the stated purpose: it shows how to POST the file, parse the ZIP response, and return structured JSON. It does not instruct reading unrelated system files, harvesting other environment variables, or sending data to third parties besides the documented Polaris endpoints. It does instruct the agent to invoke the skill broadly when users mention document extraction, which is consistent with the skill's function.
Install Mechanism
This is instruction-only (no install spec, no code files). That minimizes install-time risk because nothing is downloaded or written by the skill itself.
Credentials
The SKILL.md requires an API key via the environment variable POLARIS_DATAINSIGHT_API_KEY and shows the x-po-di-apikey header, but the skill metadata did not declare any required env vars or a primary credential. Requesting a service API key is proportionate to the purpose, but the metadata omission is a mismatch that could mislead users about what secrets are needed and expected.
Persistence & Privilege
always is false and there are no install scripts or indications the skill will modify agent-wide settings or other skills. The skill does not request permanent platform privileges beyond normal autonomous invocation.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install polaris-datainsight-doc-extract
  3. After installation, invoke the skill by name or use /polaris-datainsight-doc-extract
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
Polaris DataInsight Doc Extract 1.0.0 — Initial Release - Extracts structured data from DOCX, PPTX, XLSX, HWP, and HWPX files using the Polaris AI DataInsight API. - Supports extracting text, images, tables (with CSV/HTML/JSON), charts (with CSV), shapes, equations, headers, and footers from Office-format documents. - Returns a unified, page-based JSON schema for all elements, packaged in a ZIP file response. - Ready for parsing document structure, building RAG pipelines, or automating Office data extraction workflows. - Includes usage examples in Python and curl; outlines authentication and API key requirements. - Designed for easy integration and scales for data analysis, document parsing, and content conversion tasks.
Metadata
Slug polaris-datainsight-doc-extract
Version 1.0.0
License
All-time Installs 0
Active Installs 0
Total Versions 1
Frequently Asked Questions

What is polaris-datainsight-doc-extract?

Extract structured data from Office documents (DOCX, PPTX, XLSX, HWP, HWPX) using the Polaris AI DataInsight Doc Extract API. Use when the user wants to pars... It is an AI Agent Skill for Claude Code / OpenClaw, with 292 downloads so far.

How do I install polaris-datainsight-doc-extract?

Run "/install polaris-datainsight-doc-extract" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is polaris-datainsight-doc-extract free?

Yes, polaris-datainsight-doc-extract is completely free (open-source). You can download, install and use it at no cost.

Which platforms does polaris-datainsight-doc-extract support?

polaris-datainsight-doc-extract is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created polaris-datainsight-doc-extract?

It is built and maintained by jacob-g-park (@jacob-g-park); the current version is v1.0.0.

💬 Comments