← Back to Skills Marketplace
DiePre Vision Cognition
by
KingOfZhao
· GitHub ↗
· v1.1.0
· MIT-0
99
Downloads
0
Stars
0
Active Installs
2
Versions
Install in OpenClaw
/install diepre-vision-cognition
Description
DiePre 视觉认知 Skill —— 将包装/模切机器视觉感知与 SOUL 推理融合的认知框架
README (SKILL.md)
DiePre Vision Cognition Skill
元数据
| 字段 | 值 |
|---|---|
| 名称 | diepre-vision-cognition |
| 版本 | 1.0.0 |
| 作者 | KingOfZhao |
| 发布日期 | 2026-03-31 |
| 置信度 | 96% |
学术参考文献
本视觉框架的技术路线受以下前沿研究启发:
- Generating CAD Code with Vision-Language Models — VLM生成CAD代码+迭代验证(CADCodeVerify),直接升级照片→DXF管道
- From 2D CAD to 3D Parametric via VLM — 2D图纸→参数化3D,解决透视矫正和参数化问题
- Tool-Augmented VLLMs as Generic CAD Task Solvers (ICCV 2025) — VLLM+工具调用做通用CAD,封装OpenCV管道为可调用Skill
- Efficient Vision-Language-Action Models — VLA高效优化(低延迟+内存优化),适合本地部署
- Vlaser: Synergistic Embodied Reasoning — 具身推理VLA,未来"照片→动作决策"的理论基础
核心能力
将 DiePre(模切压痕)机器视觉感知与 SOUL 认知框架融合:
- 视觉已知/未知分离:从图像中提取确定特征(已知)与模糊区域(未知)
- 文件记忆:每次检测结果写入
vision_log/YYYY-MM-DD.jsonl - 四向视觉碰撞:正视角、反转、侧光、整体布局四个维度同时分析
- 人机闭环质检:AI 初判 → 人类复核 → 标注反馈 → 模型持续进化
- 置信度质检输出:低于 90% 置信度的缺陷自动升级为人工复核
安装命令
clawhub install diepre-vision-cognition
# 或手动安装
cp -r skills/diepre-vision-cognition ~/.openclaw/skills/
调用方式
from skills.diepre_vision_cognition import DiePrevisionCognition
vision = DiePrevisionCognition(workspace=".")
result = vision.analyze(
image_path="path/to/dieline.png",
context={"material": "corrugated", "thickness_mm": 3.0}
)
print(result.confidence) # 置信度
print(result.defects) # 检测到的缺陷列表
print(result.collision_log) # 四向分析详情
Usage Guidance
Do not install or enable this skill until the author provides the missing implementation and dependency details. Specific questions and checks to request before use: (1) Provide the Python module/package that implements DiePrevisionCognition (the registry package currently has only docs). (2) List required binaries, Python packages, model weights, and whether any external APIs or VLM/CAD services are called—if so, what credentials are required. (3) Show the code paths that write vision_log/ and perform model updates and confirm how image data is protected (is any image ever sent externally?). (4) Explain how the claimed "interception of outgoing HTTP requests" is implemented locally and provide tests proving no images are exfiltrated. Until you get concrete code and a security review, treat this skill as untrusted: run it in a sandbox, avoid using production or sensitive images, and require a code review for any component that writes or uploads images or modifies model weights.
Capability Assessment
Purpose & Capability
The skill claims a full-featured DiePre vision+SOUL cognition framework (VLM→CAD pipelines, model fine-tuning, four-way analysis) and documents a Python class API, yet the package in the registry is instruction-only with no code files, no required binaries, and no declared model weights or external service credentials. It is unclear how the described capabilities would be implemented or where required models/tools come from.
Instruction Scope
Runtime instructions require writing persistent logs (vision_log/YYYY-MM-DD.jsonl), performing four-view image analyses, supporting add_human_label, and performing local model updates. The SKILL.md and VERIFICATION_PROTOCOL also assert interception of outgoing HTTP requests containing images and forbidding external image exfiltration—but there is no implementation code to enforce these behaviors. The instructions are therefore underspecified and grant broad discretion (file I/O, local model modification, potential network activity) without clear boundaries.
Install Mechanism
No install spec is included (instruction-only), so nothing will be downloaded or written by an installer. That lowers supply-chain risk, but also means the documented Python API is not actually provided by the package as published.
Credentials
The skill requests no environment variables or credentials, yet claims to integrate VLMs, CAD pipelines and to perform model fine-tuning. If external models or services are required, credentials or dependency declarations are missing. Conversely, the skill expects filesystem access to create logs and model weight files—this local persistence is not reflected in declared requirements.
Persistence & Privilege
The skill is not flagged always:true and requests no special privileges, but its documented behavior includes persistent logging (vision_log/...), model weight checks/updates, and a heartbeat check that inspects model file integrity. These are normal for a vision skill, but because there is no code provided the actual persistence/privilege footprint is unknown and should be confirmed.
How to Use
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install diepre-vision-cognition - After installation, invoke the skill by name or use
/diepre-vision-cognition - Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.1.0
新增5篇arXiv论文(VLM/CAD/VLA技术路线)
v1.0.0
初始发布 v1.0.0 —— DiePre视觉认知框架 by KingOfZhao
Metadata
Frequently Asked Questions
What is DiePre Vision Cognition?
DiePre 视觉认知 Skill —— 将包装/模切机器视觉感知与 SOUL 推理融合的认知框架. It is an AI Agent Skill for Claude Code / OpenClaw, with 99 downloads so far.
How do I install DiePre Vision Cognition?
Run "/install diepre-vision-cognition" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is DiePre Vision Cognition free?
Yes, DiePre Vision Cognition is completely free, licensed under MIT-0. You can download, install and use it at no cost.
Which platforms does DiePre Vision Cognition support?
DiePre Vision Cognition is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).
Who created DiePre Vision Cognition?
It is built and maintained by KingOfZhao (@kingofzhao); the current version is v1.1.0.
More Skills