← Back to Skills Marketplace
107
Downloads
0
Stars
0
Active Installs
6
Versions
Install in OpenClaw
/install persona-model-trainer
Description
Fine-tune any HuggingFace instruction-tuned model (Gemma 4, Qwen 3, Llama, Phi, Mistral, and more) on persona data from anyone-skill. Produces a self-contain...
Usage Guidance
This skill largely does what it says (local persona fine-tuning) but you should not install blindly. Before running it: 1) Review training data for PII and remove/redact sensitive content (prepare_data.py helps but is not perfect). 2) Inspect scripts/train.py, scripts/export.py and any generated root-level wrappers for unexpected network calls or hardcoded remote endpoints. 3) Note the skill assumes external tools (Python 3.11+, torch/peft/bitsandbytes, ollama, llama.cpp convert script, vLLM, mlx-lm, Unsloth) — install them from official sources and audit any third-party wheels. 4) Be cautious with the optional autoresearch integration: it edits project scripts and runs training loops autonomously — run it only in an isolated environment and after reading .agents/skills/autoresearch/SKILL.md. 5) If you plan to publish or push adapter weights, obtain explicit consent from any people whose data was used and ensure you configure HuggingFace (or other) auth tokens securely. 6) If you see any 'ignore previous instructions' or system-override text in the SKILL.md or generated notebooks, treat it as suspicious and remove or sanitize before running. If you want, I can (a) list the exact locations of prompt-injection-like strings inside SKILL.md and scripts, or (b) produce a checklist of binaries and environment setup commands to run in a safe sandbox.
Capability Analysis
Type: OpenClaw Skill
Name: persona-model-trainer
Version: 0.3.3
The persona-model-trainer skill bundle is a legitimate and well-documented toolset for fine-tuning HuggingFace models on persona data. It provides a complete pipeline including data preparation (prepare_data.py), environment validation (check_env.py), multi-backend training (train.py), and model export (export.py). The bundle demonstrates strong security and privacy awareness by including explicit instructions in SKILL.md to treat training data as untrusted to prevent prompt injection, implementing a PII scanner in prepare_data.py, and providing comprehensive documentation on data handling (privacy.md). All high-risk capabilities, such as shell execution and network access for model uploads, are strictly aligned with the stated purpose of model training and deployment.
Capability Assessment
Purpose & Capability
The SKILL.md, reference docs, and included scripts (train.py, export.py, eval_probe.py, pipeline.sh, etc.) implement fine-tuning, evaluation, and export flows consistent with the skill description. However, the manifest declares no required binaries or environment variables even though the instructions expect tools like Python ≥3.11, Ollama, llama.cpp conversion scripts, vLLM, and optional HuggingFace pushes (which typically require HF credentials). The omission of required binaries/credentials is an inconsistency (likely intentional to keep things optional) but worth surfacing.
Instruction Scope
Instructions operate on local training data (training/ directory) as expected, but they also: (1) recommend pushing adapter weights and possibly training data to HuggingFace Hub (which entails credentials and external upload), (2) instruct integration with an external 'autoresearch' skill that will modify root-level train.py to run iterative hyperparameter search, and (3) add or overwrite files (root train.py, prepare.py, merged model directories, exported artifacts). The pre-scan detected prompt-injection patterns (e.g., 'system-prompt-override' style content) in SKILL.md; while some use of system prompts is expected for persona training, the presence of generic 'ignore-previous-instructions' / system-override patterns is a red flag because this skill directs the agent to modify and run code and to follow other skills' SKILL.md content — that combination increases the attack surface if a malicious autoresearch or external dependency is present.
Install Mechanism
No formal install spec (instruction-only) — lowest disk-write risk. The package includes many executable scripts and uses subprocess calls to local tools (ollama, llama.cpp converter, uv pip installs, vLLM launch scripts). There are no remote download URLs or opaque archives in the provided files; exports rely on local tools and Python packages. This is relatively low risk, but you must manually ensure the expected external binaries are installed from trustworthy sources.
Credentials
The skill declares no required environment variables or primary credentials, which is reasonable for a local training pipeline. However, some optional flows (pushing to HuggingFace via 'version.py push' or using cloud backends or third-party services) implicitly require credentials or configured CLI auth (HUGGINGFACE_TOKEN, ollama account, etc.) that are not declared. The absence of declared env requirements is not inherently malicious but is an omission you should be aware of before attempting 'push' or cloud upload steps.
Persistence & Privilege
always:false and no system config paths are requested — good. One area to note: the skill explicitly instructs using an autoresearch skill that will modify project-level scripts (root train.py wrapper) and may iterate by editing scripts/train.py hyperparameters. That grants the agent the ability to modify code in this skill's workspace and run it — acceptable for automated hyperparameter tuning but increases risk if you also grant the agent access to other skills or untrusted code. The skill does not request persistent global privileges or attempt to modify other skills' configuration files, but cross-skill code modification is present and should be treated cautiously.
How to Use
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install persona-model-trainer - After installation, invoke the skill by name or use
/persona-model-trainer - Provide required inputs per the skill's parameter spec and get structured output
Version History
v0.3.3
MLX export format, HF Model Card + Dataset Card auto-generation, fix shell injection in pipeline.sh, add lora_alpha to training summary, encoding='utf-8' across all file I/O
v0.3.2
Fix Model Card showing '?' for LoRA rank and Train turns (field name mismatch lora_r vs lora_rank, samples vs train_samples); auto-create HF repo before upload_folder.
v0.3.1
Patch: fix 3 High bugs in export.py (broken Ollama Modelfile on GGUF failure, missing MLX format, merge-fail leakthrough); fix 5 Medium in pipeline.sh (path-with-spaces, encoding, archive nesting, formats inject); fix 1 High in train.py (Unsloth install URL) + add lora_alpha to all training summaries; clarify conversations.jsonl input format in docs.
v0.3.0
Auto-generate HuggingFace Model Card and Dataset Card on version push; fix 3 bugs in push flow (archive pollution, tag idempotency, empty hash display).
v0.2.1
fix: exclude agent client files from published package
v0.2.0
Evaluation layer (perplexity + probe), model versioning, Gemma 4 preset, pipeline orchestrator, Colab sync, 138 tests.
Metadata
Frequently Asked Questions
What is persona-model-trainer?
Fine-tune any HuggingFace instruction-tuned model (Gemma 4, Qwen 3, Llama, Phi, Mistral, and more) on persona data from anyone-skill. Produces a self-contain... It is an AI Agent Skill for Claude Code / OpenClaw, with 107 downloads so far.
How do I install persona-model-trainer?
Run "/install persona-model-trainer" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is persona-model-trainer free?
Yes, persona-model-trainer is completely free, licensed under MIT-0. You can download, install and use it at no cost.
Which platforms does persona-model-trainer support?
persona-model-trainer is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).
Who created persona-model-trainer?
It is built and maintained by acnlabs (@neiljo-gy); the current version is v0.3.3.
More Skills