← 返回 Skills 市场

Multimedia To Obsidian

Name: Multimedia To Obsidian
Author: aidescend

作者 aidescend · GitHub ↗ · v1.0.0

cross-platform ⚠ suspicious

374

总下载

当前安装

版本数

在 OpenClaw 中安装

/install multimedia-to-obsidian

功能描述

将任意多媒体文档导入 Obsidian 知识库。支持 PPT、PDF、DOCX、图片等格式，自动提取每一页/每一张图片，使用多模态模型理解内容，生成文字描述后存入 OB。适用于：(1) 整理培训课件 (2) 迁移笔记到 OB (3) 将图片资料转为结构化知识。

安全使用建议

This skill appears to implement its stated feature (extract images and call multimodal models), but there are mismatches and privacy concerns you should consider before installing: 1) Metadata omission — the registry claims no required binaries/env vars, but the SKILL.md and script require pandoc, poppler (pdftoppm), python-pptx, pillow, and likely LibreOffice ('soffice'); confirm and install these intentionally. 2) API keys — you must provide an API key for one of the providers; review which provider you trust. The script will upload raw image bytes (base64) to the chosen provider. Do NOT use this on sensitive documents unless you trust the destination. 3) Unknown default host — the default MINIMAX_API_HOST (https://api.minimaxi.com) is not a well-known vendor; verify this endpoint and its privacy/security policy before setting MINIMAX_API_KEY or leaving the default host. 4) Inspect and/or run in isolation — review the script (it is included) and consider running it on non-sensitive sample files or inside a disposable environment/container. 5) Fix metadata — if you plan to use it, request that the publisher update the registry metadata to list required binaries and env vars so users aren’t surprised. If you need higher assurance, prefer using a known provider (OpenAI/Anthropic) or a local/offline processing path for sensitive data.

功能分析

Type: OpenClaw Skill Name: multimedia-to-obsidian Version: 1.0.0 The skill is a utility for importing multimedia files (PPT, PDF, DOCX, images) into Obsidian by extracting images and generating descriptions using multi-modal AI models (MiniMax, OpenAI, or Anthropic). The implementation in `scripts/media_to_obsidian.py` uses standard system tools like Pandoc, Poppler, and LibreOffice via subprocess calls and handles API keys through environment variables as expected for its stated purpose. No evidence of malicious intent, unauthorized data exfiltration, or prompt injection was found.

能力评估

⚠ Purpose & Capability

The skill's description matches the code: it extracts pages/images and sends them to multimodal models, then writes Markdown into an Obsidian repo. However the registry metadata lists no required binaries or env vars while the SKILL.md and the script require pandoc, poppler (pdftoppm), python-pptx/pillow, and also call 'soffice' — this mismatch is incoherent and could surprise users. The network/API access (MINIMAX/OPENAI/ANTHROPIC) is appropriate for the stated purpose but the default MINIMAX_API_HOST is an unfamiliar domain.

ℹ Instruction Scope

Runtime instructions are within the task scope (install tools, set one of several model API keys, run the script). Important privacy-sensitive behavior: the script encodes image bytes as base64 and transmits them to third-party endpoints (MiniMax/OpenAI/Anthropic). That is expected for model calls but should be explicitly highlighted because sensitive documents will be uploaded. The SKILL.md omits that the script invokes 'soffice' and 'rm -rf' and does not warn about large uploads or rate/size limits.

ℹ Install Mechanism

There is no automated install spec (lowest risk). The SKILL.md instructs manual installs via brew and pip (pandoc, poppler, python-pptx, pillow) which are reasonable. The script additionally calls 'soffice' (LibreOffice) and 'pdftoppm' but 'soffice' is not listed in prerequisites — this omission is inconsistent and may lead to runtime failures. No downloads from unknown URLs are performed by an installer.

⚠ Credentials

The registry metadata declared no required environment variables, but both SKILL.md and the script require at least one API key (MINIMAX_API_KEY or OPENAI_API_KEY or ANTHROPIC_API_KEY). Requiring model provider API keys is proportionate to the task, but the default MINIMAX_API_HOST (https://api.minimaxi.com) is not a known major provider — sending base64 images to an unfamiliar host is a privacy/exfiltration risk. The number of env vars requested is reasonable, but the metadata omission is a red flag.

✓ Persistence & Privilege

The skill is not always-enabled and is user-invocable. It writes Markdown files into the user-specified Obsidian directory and removes temporary files; it does not request system-wide configuration or modify other skills. No elevated persistence is requested.

如何使用

确保已安装 OpenClaw（本地或 Docker 部署）
在对话框中输入安装命令：/install multimedia-to-obsidian
安装完成后，直接呼叫该 Skill 的名称或使用 /multimedia-to-obsidian 触发
根据 Skill 的参数说明提供必要输入，即可获得结构化输出

版本历史

v1.0.0

Initial release: Seamlessly import multimedia documents into Obsidian with automatic content understanding. - Supports PPT, PDF, DOCX, and image (PNG, JPG) formats. - Automatically extracts and analyzes each page or image using a multimodal model. - Generates text descriptions and stores them in your chosen Obsidian vault/category. - Multiple models supported: MiniMax, OpenAI, Anthropic. - Simple command-line usage with flexible options for format, model, and category.

元数据

Slug multimedia-to-obsidian

版本 1.0.0

许可证 —

累计安装 1

当前安装数 1

历史版本数 1

常见问题

Multimedia To Obsidian 是什么？

将任意多媒体文档导入 Obsidian 知识库。支持 PPT、PDF、DOCX、图片等格式，自动提取每一页/每一张图片，使用多模态模型理解内容，生成文字描述后存入 OB。适用于：(1) 整理培训课件 (2) 迁移笔记到 OB (3) 将图片资料转为结构化知识。它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件，目前累计下载 374 次。

如何安装 Multimedia To Obsidian？

在 OpenClaw 或 Claude Code 对话框中运行命令「/install multimedia-to-obsidian」即可一键安装，无需额外配置。

Multimedia To Obsidian 是免费的吗？

是的，Multimedia To Obsidian 完全免费（开源免费），可自由下载、安装和使用。

Multimedia To Obsidian 支持哪些平台？

Multimedia To Obsidian 跨平台运行，可在任意部署了 OpenClaw / Claude Code 的环境中使用（cross-platform）。

谁开发了 Multimedia To Obsidian？

由 aidescend（@aidescend）开发并维护，当前版本 v1.0.0。