← Back to Skills Marketplace
aipoch-ai

Data Management Plan Creator

by AIpoch · GitHub ↗ · v0.1.1 · MIT-0
cross-platform ⚠ suspicious
229
Downloads
0
Stars
0
Active Installs
2
Versions
Install in OpenClaw
/install data-management-plan-creator
Description
Automatically generate NIH 2023-compliant Data Management and Sharing Plan (DMSP) drafts following FAIR principles
README (SKILL.md)

Data Management Plan (DMP) Creator

Automatically generate draft Data Management and Sharing Plans (DMSP) compliant with NIH 2023 policy requirements and FAIR principles.

Overview

This Skill generates comprehensive Data Management and Sharing Plans (DMSP) that meet NIH's 2023 Final Policy for Data Management and Sharing. The output follows FAIR principles (Findable, Accessible, Interoperable, Reusable) to ensure research data is properly managed and shared.

Requirements

  • Python 3.8+
  • No external dependencies required (uses standard library only)

Usage

Command Line

python scripts/main.py \
    --project-title "Your Research Project Title" \
    --pi-name "Principal Investigator Name" \
    --data-types "genomic,imaging,clinical" \
    --repository "GEO,Figshare" \
    --output dmsp_draft.md

Interactive Mode

python scripts/main.py --interactive

As a Module

from scripts.main import DMSPCreator

creator = DMSPCreator(
    project_title="Cancer Genomics Study",
    pi_name="Dr. Jane Smith",
    institution="National Cancer Institute",
    data_types=["genomic sequencing", "clinical metadata"],
    estimated_size_gb=500,
    repositories=["dbGaP", "GEO"],
    sharing_timeline="6 months after study completion"
)

dmsp = creator.generate_plan()
creator.save_to_file("dmsp_output.md")

Parameters

Parameter Type Default Required Description
--project-title string - Yes Title of the research project
--pi-name string - Yes Name of the Principal Investigator
--institution string - Yes Research institution or organization
--data-types string - Yes Comma-separated list of data types (e.g., "genomic,imaging,clinical")
--estimated-size float - No Estimated data size in GB
--repository string - Yes Comma-separated list of target repositories
--sharing-timeline string No later than the end of the award period No When data will be shared
--access-restrictions string - No Any access restrictions (e.g., "controlled-access for sensitive data")
--format-standards string - No Data format standards to be used
--output string dmsp_[timestamp].md No Output file path
--interactive flag - No Run in interactive mode

NIH DMSP Required Elements

The generated plan addresses all six required elements per NIH policy:

  1. Data Type - Types and estimated amount of scientific data
  2. Related Tools, Software and/or Code - Tools needed to access/manipulate data
  3. Standards - Standards for data/metadata to be applied
  4. Data Preservation, Access, and Associated Timelines - Repository selection and sharing timeline
  5. Access, Distribution, or Reuse Considerations - Factors affecting subsequent access
  6. Oversight of Data Management and Sharing - Plans for compliance monitoring

FAIR Principles Implementation

Findable

  • Persistent identifiers (DOIs)
  • Rich metadata with standard vocabularies
  • Registration in searchable repositories

Accessible

  • Standardized communication protocols
  • Metadata available even if data is no longer available
  • Access procedures clearly documented

Interoperable

  • Standard data formats
  • Standard terminologies and vocabularies
  • Qualified references to other data

Reusable

  • Detailed provenance information
  • Clear usage licenses
  • Domain-relevant community standards

Example Output

The generated DMSP includes:

  • Executive summary
  • NIH-compliant section headers
  • Specific language for data type descriptions
  • FAIR-aligned metadata standards
  • Repository recommendations
  • Timeline for data sharing
  • Access control procedures
  • Roles and responsibilities

References

License

MIT License - See project root for details.

Risk Assessment

Risk Indicator Assessment Level
Code Execution Python/R scripts executed locally Medium
Network Access No external API calls Low
File System Access Read input files, write output files Medium
Instruction Tampering Standard prompt guidelines Low
Data Exposure Output files saved to workspace Low

Security Checklist

  • No hardcoded credentials or API keys
  • No unauthorized file system access (../)
  • Output does not expose sensitive information
  • Prompt injection protections in place
  • Input file paths validated (no ../ traversal)
  • Output directory restricted to workspace
  • Script execution in sandboxed environment
  • Error messages sanitized (no stack traces exposed)
  • Dependencies audited

Prerequisites

# Python dependencies
pip install -r requirements.txt

Evaluation Criteria

Success Metrics

  • Successfully executes main functionality
  • Output meets quality standards
  • Handles edge cases gracefully
  • Performance is acceptable

Test Cases

  1. Basic Functionality: Standard input → Expected output
  2. Edge Case: Invalid input → Graceful error handling
  3. Performance: Large dataset → Acceptable processing time

Lifecycle Status

  • Current Stage: Draft
  • Next Review Date: 2026-03-06
  • Known Issues: None
  • Planned Improvements:
    • Performance optimization
    • Additional feature support
Usage Guidance
This skill appears to do what it says (generate NIH-compliant DMSP drafts) and does not request credentials or network access in the manifest, but a few conservative checks are recommended before installing or using it on sensitive data: - Inspect the full scripts/main.py file yourself (or ask the author) to confirm there are no hidden network calls, telemetry, or use of exec/subprocess. The provided file is large and was truncated in the review material; fully review it. - The SKILL.md claims 'standard library only' but also suggests running pip install -r requirements.txt; requirements.txt lists 'dataclasses' which is part of Python 3.8+. If you run pip, do so in a fresh virtualenv to avoid pulling unwanted packages. - Run the script in a sandbox or isolated environment first and test with dummy inputs to see what files it writes and where. Confirm it enforces safe path handling (no '../' traversal) and that output files are only written to the intended workspace. - Grep the code for network libraries (requests, urllib, socket) and for subprocess/exec usage. If any network or shell execution appears, require justification before use. - If you will feed real or sensitive data, ensure outputs do not inadvertently include identifying information; validate sanitization/de-identification steps. If you are not comfortable reviewing the code, consider asking the skill author for a short security statement or running the tool on synthetic data only.
Capability Analysis
Type: OpenClaw Skill Name: data-management-plan-creator Version: 0.1.1 The skill is a template generator for NIH Data Management Plans, but it is classified as suspicious due to a significant discrepancy between its documentation and implementation. While SKILL.md explicitly claims that 'Input file paths validated (no ../ traversal)' and 'Output directory restricted to workspace,' the actual implementation in scripts/main.py lacks any path sanitization or validation for the --output argument, allowing for potential path traversal and arbitrary file overwrites. The code lacks clear malicious intent but contains this unmitigated vulnerability despite documentation suggesting otherwise.
Capability Assessment
Purpose & Capability
Name/description (NIH DMSP, FAIR) align with the provided code and docs: the repository contains a DMSP template, README, SKILL.md, and a Python script that generates plan text. Requested capabilities (none: no env vars, no binaries) are appropriate for this task.
Instruction Scope
SKILL.md instructs running scripts/main.py (CLI, interactive, or as module), which is consistent with generating DMSPs. The documentation claims 'No external API calls' and 'uses standard library only', which appears plausible from visible imports; however SKILL.md also includes a pip install -r requirements.txt step (requirements.txt lists 'dataclasses'), an inconsistency worth noting. The script reads/writes workspace files (expected for this tool) but you should confirm there is no unexpected file access or network I/O in the parts of main.py not shown in the truncated excerpt.
Install Mechanism
There is no install specification in the registry (instruction-only), which is low-risk. SKILL.md still instructs 'pip install -r requirements.txt' even though it claims to use only the standard library; requirements.txt contains only 'dataclasses' (a stdlib type in Python 3.8+), so the pip step is unnecessary for the declared Python version — minor inconsistency but not itself malicious.
Credentials
The skill declares no required environment variables, no credentials, and no config paths. The code shown imports only standard modules (json, os, re, pathlib, etc.). This is proportionate to the described purpose.
Persistence & Privilege
Flags show always: false and normal model invocation behavior. The skill does not request permanent agent presence or elevated privileges in the registry metadata.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install data-management-plan-creator
  3. After installation, invoke the skill by name or use /data-management-plan-creator
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v0.1.1
- No changes detected in this version. - Version and contents remain the same as the previous release.
v0.1.0
Initial release: Automatically generates NIH 2023-compliant Data Management and Sharing Plan drafts following FAIR principles. - Command line, interactive, and module usage supported - No external dependencies; uses Python standard library - Covers all 6 required NIH DMSP elements - Includes FAIR (Findable, Accessible, Interoperable, Reusable) implementation - Outputs structured, ready-to-edit DMSP drafts with recommended repositories, compliance language, and sharing timelines
Metadata
Slug data-management-plan-creator
Version 0.1.1
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 2
Frequently Asked Questions

What is Data Management Plan Creator?

Automatically generate NIH 2023-compliant Data Management and Sharing Plan (DMSP) drafts following FAIR principles. It is an AI Agent Skill for Claude Code / OpenClaw, with 229 downloads so far.

How do I install Data Management Plan Creator?

Run "/install data-management-plan-creator" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Data Management Plan Creator free?

Yes, Data Management Plan Creator is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Data Management Plan Creator support?

Data Management Plan Creator is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Data Management Plan Creator?

It is built and maintained by AIpoch (@aipoch-ai); the current version is v0.1.1.

💬 Comments