← 返回 Skills 市场

Azure Content Understanding Layout

Name: Azure Content Understanding Layout
Author: zwcih

作者 zwcih · GitHub ↗ · v1.3.0 · MIT-0

cross-platform ✓ 安全检测通过

174

总下载

当前安装

版本数

在 OpenClaw 中安装

/install azure-content-layout

功能描述

Extract document structure, text, tables, and figures from documents using Azure Content Understanding prebuilt-layout analyzer. Converts PDF, images, Office...

使用说明 (SKILL.md)

Azure Content Understanding — Layout Analyzer

Extract structured content from documents using Azure's prebuilt-layout analyzer. Outputs Markdown and structured JSON with text, tables, figures, and document hierarchy.

Setup

Set environment variables:

export AZURE_CU_ENDPOINT="https://YOUR_RESOURCE.services.ai.azure.com/"
export AZURE_CU_API_KEY="YOUR_KEY_HERE"

Optional: set API version (defaults to 2025-05-01-preview):

export AZURE_CU_API_VERSION="2025-11-01"

Quick Usage

Analyze a URL and print Markdown

node scripts/analyze.mjs --url "https://example.com/document.pdf"

Analyze a local file (pipe via stdin)

cat invoice.pdf | node scripts/analyze.mjs --stdin --markdown output.md --output result.json

Save both Markdown and full JSON

node scripts/analyze.mjs --url "https://example.com/report.pdf" \
  --markdown report.md \
  --output report.json

Direct API Call

When the script isn't available, use curl:

# Submit analysis (preview API)
curl -s -X POST "$AZURE_CU_ENDPOINT/contentunderstanding/analyzers/prebuilt-layout:analyze?api-version=2025-05-01-preview" \
  -H "Ocp-Apim-Subscription-Key: $AZURE_CU_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{"url":"https://example.com/doc.pdf"}'

# Response includes Operation-Location header — poll that URL for results

For GA API (2025-11-01), the body format changes:

{"inputs": [{"url": "https://example.com/doc.pdf"}]}

Output

Markdown

The analyzer produces GitHub Flavored Markdown preserving:

Headings (h1–h6)
Tables (as HTML \x3Ctable> blocks)
Selection marks (☒ checked, ☐ unchecked)
Figures (with references)
Paragraphs with reading order

Structured JSON

The full result includes detailed per-element data:

pages — dimensions, word/line counts per page
paragraphs — text blocks with bounding regions and semantic roles
tables — cells with row/column spans
figures — detected images/charts with bounding regions
sections — hierarchical document structure

Supported Formats

PDF, JPEG, PNG, BMP, TIFF, HEIF, DOCX, XLSX, PPTX, HTML

Best Practices

Async operation — the API returns 202; poll Operation-Location for results
Poll interval — 3 seconds is reasonable; results typically arrive in 5–60 seconds
Large documents — up to 2,000 pages supported; processing time scales linearly
File upload — use Content-Type: application/octet-stream with binary body
Tables — rendered as HTML in markdown for complex layouts (merged cells, etc.)

API Reference

See references/api.md for full request/response details.

安全使用建议

This appears to be a straightforward Azure document-layout helper. Before installing: 1) Only provide AZURE_CU_API_KEY and AZURE_CU_ENDPOINT that you trust — the key grants access to your Azure resource. Prefer a least-privilege or short-lived key if available. 2) Be aware that sending a document or a URL sends data to Azure (URL mode causes Azure to fetch the URL). Do not send sensitive documents unless you accept that. 3) The script runs locally with Node; review or run it on a trusted host. 4) The metadata marks AZURE_CU_API_VERSION as required but the script will default it — you can safely omit it. 5) If you need higher assurance, call the Azure API directly (curl or official SDK) or inspect/execute the included analyze.mjs locally rather than granting broad agent-level access.

功能分析

Type: OpenClaw Skill Name: azure-content-layout Version: 1.3.0 The skill provides a legitimate interface for Azure Content Understanding's Layout Analyzer, allowing users to extract structured content from documents. The Node.js script (scripts/analyze.mjs) correctly implements the asynchronous polling pattern required by the Azure API and handles both URL-based and local file inputs without any evidence of malicious behavior, data exfiltration, or prompt injection.

能力评估

✓ Purpose & Capability

Name/description, SKILL.md, and scripts/analyze.mjs all implement calls to Azure Content Understanding (prebuilt-layout) and only request the Azure endpoint, API key, and an API version. No unrelated services, binaries, or credentials are required.

ℹ Instruction Scope

Runtime instructions and the script only read stdin or a user-supplied URL and call the Azure analyzer. Important: when you supply a URL, the Azure service will fetch that URL (so the remote host and Azure will see it). The SKILL.md metadata lists AZURE_CU_API_VERSION as required but the code provides a default value — minor inconsistency.

✓ Install Mechanism

No install spec; this is an instruction-only skill plus a single Node script (analyze.mjs). Nothing is downloaded from external sites and no installers run — low install risk. The script requires a Node runtime to execute.

ℹ Credentials

Requiring AZURE_CU_ENDPOINT and AZURE_CU_API_KEY is proportional and expected. AZURE_CU_API_VERSION is declared required in metadata but the script defaults it if unset — the extra 'required' declaration is unnecessary but not harmful. PrimaryEnv correctly set to the API key.

✓ Persistence & Privilege

always:false and the skill does not request persistent system modifications or modify other skills' configs. Autonomous invocation is allowed but is the platform default and not in itself a concern here.

如何使用

确保已安装 OpenClaw（本地或 Docker 部署）
在对话框中输入安装命令：/install azure-content-layout
安装完成后，直接呼叫该 Skill 的名称或使用 /azure-content-layout 触发
根据 Skill 的参数说明提供必要输入，即可获得结构化输出

版本历史

v1.3.0

Remove direct file read from script; use stdin pipe instead to avoid file-read+network-send pattern flagged by static analysis

v1.2.0

Declare AZURE_CU_API_VERSION in requires.env for full credential transparency

v1.1.0

Add metadata.openclaw.requires.env and primaryEnv declarations for AZURE_CU_ENDPOINT and AZURE_CU_API_KEY

v1.0.1

Fix: remove patterns that triggered false-positive suspicious content flag

v1.0.0

Initial release: document layout analysis with markdown and JSON output via Azure Content Understanding prebuilt-layout analyzer

元数据

Slug azure-content-layout

版本 1.3.0

许可证 MIT-0

累计安装 0

当前安装数 0

历史版本数 5

常见问题