Description

Create and populate Annotated Research Contexts (ARCs) following the nfdi4plants ARC specification. Use when creating a new ARC, adding studies/assays/workflows/runs, annotating ISA metadata, organizing research data into ARC structure, or pushing ARCs to a DataHUB. Guides the user interactively through all required and optional metadata fields.

README (SKILL.md)

ARC Creator

Name: ARC Creator
Author: ingogiebel

Create FAIR Digital Objects following the nfdi4plants ARC specification v3.0.0.

Prerequisites

git and git-lfs installed
ARC Commander CLI at ~/bin/arc (optional but recommended)
For DataHUB sync: Personal Access Token for git.nfdi4plants.org or datahub.hhu.de

Interactive ARC Creation Workflow

Guide the user through these phases in order. Ask questions conversationally — don't dump all questions at once. Batch 2-4 related questions per message.

Phase 1: Investigation Setup

Ask the user:

Investigation identifier (short, lowercase-hyphenated, e.g. cold-stress-arabidopsis)
Title (concise name for the investigation)
Description (textual description of the research goals)
Where to store the ARC locally (suggest /home/uranus/arc-projects/\x3Cidentifier>/)

Then run scripts/create_arc.sh \x3Cpath> \x3Cidentifier> and set investigation metadata via:

arc investigation update -i "\x3Cid>" --title "\x3Ctitle>" --description "\x3Cdesc>"

Phase 2: Studies

For each study, ask:

Study identifier (e.g. plant-growth)
Title and description
Organism (for Characteristic [Organism])
Growth conditions (temperature, light, medium, etc.)
Source materials (what goes in — seeds, cell lines, etc.)
Sample materials (what comes out — leaves, roots, extracts, etc.)
Protocols — does the user have protocol documents to include?
Factors — what experimental variables are being tested? (e.g., temperature, genotype, treatment)

Create with:

arc study init --studyidentifier "\x3Cid>"
arc study update --studyidentifier "\x3Cid>" --title "\x3Ctitle>" --description "\x3Cdesc>"

Copy protocol files to studies/\x3Cid>/protocols/. Copy resource files to studies/\x3Cid>/resources/.

Phase 3: Assays

For each assay, ask:

Assay identifier (e.g. proteomics-ms, rnaseq, sugar-measurement)
Measurement type (e.g., protein expression profiling, transcription profiling, metabolite profiling)
Technology type (e.g., mass spectrometry, nucleotide sequencing, plate reader)
Technology platform (e.g., Illumina NovaSeq, Bruker timsTOF)
Data files — where are the raw data files? (will go into assays/\x3Cid>/dataset/)
Processed data — any processed output files?
Protocols — assay-specific protocols?
Performers — who performed this assay? (name, affiliation, role)

Create with:

arc assay init -a "\x3Cid>" --measurementtype "\x3Ctype>" --technologytype "\x3Ctech>"

Copy data to assays/\x3Cid>/dataset/, protocols to assays/\x3Cid>/protocols/.

Phase 4: Workflows (optional)

Ask if there are computational analysis steps. For each:

Workflow identifier (e.g. deseq2-analysis, heatmap-generation)
Description of what it does
Code files (scripts, notebooks)
Dependencies (Python packages, R libraries, Docker image)

Place code in workflows/\x3Cid>/. Note: workflow.cwl is REQUIRED by spec but often created later. Inform user.

Phase 5: Runs (optional)

Ask if there are computation outputs. For each:

Run identifier
Which workflow produced it
Output files (figures, tables, processed data)

Place outputs in runs/\x3Cid>/.

Phase 6: Contacts & Publications

Ask:

Investigation contacts (name, email, affiliation, role — at minimum the PI)
Publications (if any — DOI, PubMed ID, title, authors)

Add via:

arc investigation person register --lastname "\x3Clast>" --firstname "\x3Cfirst>" --email "\x3Cemail>" --affiliation "\x3Caff>"

Phase 7: Git Commit & DataHUB Sync

Configure git user:

git config user.name "\x3Cname>"
git config user.email "\x3Cemail>"

Commit:

git add -A
git commit -m "Initial ARC: \x3Cinvestigation title>"

Ask if the user wants to push to a DataHUB. If yes:
- Ask which host (git.nfdi4plants.org, datahub.hhu.de, etc.)
- Create remote repo (via browser or API)
- Set remote and push

ISA Metadata Reference

For detailed ISA-XLSX fields, annotation table columns, and ontology references, read references/arc-spec.md.

Key Reminders

Assay data is immutable — never modify files in assays/\x3Cid>/dataset/ after initial placement
Studies describe materials, assays describe measurements
Workflows are code, runs are outputs
Git LFS for files > 100 MB: git lfs track "*.fastq.gz" "*.bam" "*.raw"
Don't store ARCs on OneDrive/Dropbox — Git + cloud sync causes conflicts
ARC Commander CLI reference: arc \x3Csubcommand> --help

Usage Guidance

This skill appears to do what it says (create ARC repositories and guide metadata entry), but its metadata is incomplete: SKILL.md requires git, git-lfs, and optionally an ARC Commander CLI and a DataHUB personal access token, yet none of these are declared in the skill manifest. Before installing or running: 1) review and understand scripts/create_arc.sh (it will mkdir, cd into the target, run arc init if available or git init otherwise); 2) ensure git and git-lfs are installed and be prepared to supply DataHUB credentials if you choose to push; 3) be cautious when the agent asks to create or push a remote repository — only provide tokens or create remotes for hosts you trust; 4) the skill will run shell commands and modify files under the chosen path, so avoid running it with elevated privileges and verify the target path; 5) the recommendation to store projects under '/home/uranus/...' is just an example and should be changed to a path you control. If you want higher assurance, ask the developer to update the skill manifest to list required binaries and any env vars (e.g., DATAHUB_TOKEN) explicitly.

Capability Analysis

Type: OpenClaw Skill Name: arc-creator Version: 1.0.0 The skill is designed to manage research contexts, involving extensive file system operations (creating directories, copying files to user-specified paths) and network activity (Git pushes to a DataHUB). While these capabilities are necessary for its stated purpose, the `SKILL.md` instructs the agent to take user input for critical paths and command arguments (e.g., `ARC_PATH` in `scripts/create_arc.sh`, file copy destinations, `git config` values). This design creates a significant attack surface, as a malicious user could exploit the agent by providing crafted input to perform unintended file system manipulations or data exfiltration, even though the skill itself does not explicitly instruct the agent to act maliciously. The reliance on user input for sensitive operations without explicit sanitization instructions for the agent makes it suspicious.

Capability Assessment

ℹ Purpose & Capability

The name and description match the instructions and included script: this is an ARC creation helper that initializes a directory, creates ARC subdirectories, updates ARC metadata with an 'arc' CLI if present, and guides the user to add studies/assays/workflows and push to a DataHUB. However the skill metadata lists no required binaries or env vars while SKILL.md explicitly lists prerequisites (git, git-lfs, optional ARC Commander CLI and DataHUB Personal Access Token). That mismatch is an inconsistency (likely sloppy metadata) but not proof of malicious intent.

✓ Instruction Scope

The SKILL.md stays within the stated purpose: it interactively collects identifiers/metadata and instructs the agent to run a small create script, run 'arc' subcommands, copy files into repository folders, run git commands, and optionally push to a remote DataHUB. It does not instruct arbitrary system enumeration or to read unrelated files. It does recommend creating/pushing remotes (which requires credentials provided by the user) and references external hosts (git.nfdi4plants.org, datahub.hhu.de) in a way consistent with the described workflow.

✓ Install Mechanism

There is no install specification (instruction-only skill plus a small helper script). The included script (create_arc.sh, 877 bytes) is short, readable, and performs local filesystem and git initialization only. No downloads, extraction, or third-party package installs are performed by the skill itself.

⚠ Credentials

The skill metadata declares no required environment variables or primary credential, yet the SKILL.md references a 'Personal Access Token for git.nfdi4plants.org or datahub.hhu.de' for DataHUB sync and expects git/git-lfs and optionally an ARC Commander CLI binary. The absence of declared env vars/binaries in metadata is an omission that could lead an agent to attempt to use credentials or binaries from the environment without explicit requirements being visible to the user. This is a proportionality / transparency concern (not direct evidence of exfiltration).

✓ Persistence & Privilege

The skill does not request persistent or elevated privileges; always is false and it does not modify other skills or system-wide agent settings. The runtime actions are limited to creating directories, initializing git, and running local arc/git commands (subject to user consent).

Version History

v1.0.0

Initial release: interactive ARC creation following nfdi4plants spec v3.0.0

Metadata

Slug arc-creator

Version 1.0.0

License —

All-time Installs 0

Active Installs 0

Total Versions 1

Frequently Asked Questions

What is ARC Creator?

Create and populate Annotated Research Contexts (ARCs) following the nfdi4plants ARC specification. Use when creating a new ARC, adding studies/assays/workflows/runs, annotating ISA metadata, organizing research data into ARC structure, or pushing ARCs to a DataHUB. Guides the user interactively through all required and optional metadata fields. It is an AI Agent Skill for Claude Code / OpenClaw, with 1063 downloads so far.

How do I install ARC Creator?

Run "/install arc-creator" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is ARC Creator free?

Yes, ARC Creator is completely free (open-source). You can download, install and use it at no cost.

Which platforms does ARC Creator support?

ARC Creator is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created ARC Creator?

It is built and maintained by IngoGiebel (@ingogiebel); the current version is v1.0.0.

More Skills

ARC Creator

ARC Creator

Prerequisites

Interactive ARC Creation Workflow

Phase 1: Investigation Setup

Phase 2: Studies

Phase 3: Assays

Phase 4: Workflows (optional)

Phase 5: Runs (optional)

Phase 6: Contacts & Publications

Phase 7: Git Commit & DataHUB Sync

ISA Metadata Reference

Key Reminders

What is ARC Creator?

How do I install ARC Creator?

Is ARC Creator free?

Which platforms does ARC Creator support?

Who created ARC Creator?

💬 Comments