← 返回 Skills 市场

persona-model-trainer

Name: persona-model-trainer
Author: neiljo-gy

作者 acnlabs · GitHub ↗ · v0.3.3 · MIT-0

cross-platform ⚠ suspicious

107

总下载

当前安装

版本数

在 OpenClaw 中安装

/install persona-model-trainer

功能描述

Fine-tune any HuggingFace instruction-tuned model (Gemma 4, Qwen 3, Llama, Phi, Mistral, and more) on persona data from anyone-skill. Produces a self-contain...

安全使用建议

This skill largely does what it says (local persona fine-tuning) but you should not install blindly. Before running it: 1) Review training data for PII and remove/redact sensitive content (prepare_data.py helps but is not perfect). 2) Inspect scripts/train.py, scripts/export.py and any generated root-level wrappers for unexpected network calls or hardcoded remote endpoints. 3) Note the skill assumes external tools (Python 3.11+, torch/peft/bitsandbytes, ollama, llama.cpp convert script, vLLM, mlx-lm, Unsloth) — install them from official sources and audit any third-party wheels. 4) Be cautious with the optional autoresearch integration: it edits project scripts and runs training loops autonomously — run it only in an isolated environment and after reading .agents/skills/autoresearch/SKILL.md. 5) If you plan to publish or push adapter weights, obtain explicit consent from any people whose data was used and ensure you configure HuggingFace (or other) auth tokens securely. 6) If you see any 'ignore previous instructions' or system-override text in the SKILL.md or generated notebooks, treat it as suspicious and remove or sanitize before running. If you want, I can (a) list the exact locations of prompt-injection-like strings inside SKILL.md and scripts, or (b) produce a checklist of binaries and environment setup commands to run in a safe sandbox.

功能分析

Type: OpenClaw Skill Name: persona-model-trainer Version: 0.3.3 The persona-model-trainer skill bundle is a legitimate and well-documented toolset for fine-tuning HuggingFace models on persona data. It provides a complete pipeline including data preparation (prepare_data.py), environment validation (check_env.py), multi-backend training (train.py), and model export (export.py). The bundle demonstrates strong security and privacy awareness by including explicit instructions in SKILL.md to treat training data as untrusted to prevent prompt injection, implementing a PII scanner in prepare_data.py, and providing comprehensive documentation on data handling (privacy.md). All high-risk capabilities, such as shell execution and network access for model uploads, are strictly aligned with the stated purpose of model training and deployment.

能力评估

ℹ Purpose & Capability

The SKILL.md, reference docs, and included scripts (train.py, export.py, eval_probe.py, pipeline.sh, etc.) implement fine-tuning, evaluation, and export flows consistent with the skill description. However, the manifest declares no required binaries or environment variables even though the instructions expect tools like Python ≥3.11, Ollama, llama.cpp conversion scripts, vLLM, and optional HuggingFace pushes (which typically require HF credentials). The omission of required binaries/credentials is an inconsistency (likely intentional to keep things optional) but worth surfacing.

⚠ Instruction Scope

Instructions operate on local training data (training/ directory) as expected, but they also: (1) recommend pushing adapter weights and possibly training data to HuggingFace Hub (which entails credentials and external upload), (2) instruct integration with an external 'autoresearch' skill that will modify root-level train.py to run iterative hyperparameter search, and (3) add or overwrite files (root train.py, prepare.py, merged model directories, exported artifacts). The pre-scan detected prompt-injection patterns (e.g., 'system-prompt-override' style content) in SKILL.md; while some use of system prompts is expected for persona training, the presence of generic 'ignore-previous-instructions' / system-override patterns is a red flag because this skill directs the agent to modify and run code and to follow other skills' SKILL.md content — that combination increases the attack surface if a malicious autoresearch or external dependency is present.

ℹ Install Mechanism

No formal install spec (instruction-only) — lowest disk-write risk. The package includes many executable scripts and uses subprocess calls to local tools (ollama, llama.cpp converter, uv pip installs, vLLM launch scripts). There are no remote download URLs or opaque archives in the provided files; exports rely on local tools and Python packages. This is relatively low risk, but you must manually ensure the expected external binaries are installed from trustworthy sources.

ℹ Credentials

The skill declares no required environment variables or primary credentials, which is reasonable for a local training pipeline. However, some optional flows (pushing to HuggingFace via 'version.py push' or using cloud backends or third-party services) implicitly require credentials or configured CLI auth (HUGGINGFACE_TOKEN, ollama account, etc.) that are not declared. The absence of declared env requirements is not inherently malicious but is an omission you should be aware of before attempting 'push' or cloud upload steps.

ℹ Persistence & Privilege

always:false and no system config paths are requested — good. One area to note: the skill explicitly instructs using an autoresearch skill that will modify project-level scripts (root train.py wrapper) and may iterate by editing scripts/train.py hyperparameters. That grants the agent the ability to modify code in this skill's workspace and run it — acceptable for automated hyperparameter tuning but increases risk if you also grant the agent access to other skills or untrusted code. The skill does not request persistent global privileges or attempt to modify other skills' configuration files, but cross-skill code modification is present and should be treated cautiously.

如何使用

确保已安装 OpenClaw（本地或 Docker 部署）
在对话框中输入安装命令：/install persona-model-trainer
安装完成后，直接呼叫该 Skill 的名称或使用 /persona-model-trainer 触发
根据 Skill 的参数说明提供必要输入，即可获得结构化输出

版本历史

v0.3.3

MLX export format, HF Model Card + Dataset Card auto-generation, fix shell injection in pipeline.sh, add lora_alpha to training summary, encoding='utf-8' across all file I/O

v0.3.2

Fix Model Card showing '?' for LoRA rank and Train turns (field name mismatch lora_r vs lora_rank, samples vs train_samples); auto-create HF repo before upload_folder.

v0.3.1

Patch: fix 3 High bugs in export.py (broken Ollama Modelfile on GGUF failure, missing MLX format, merge-fail leakthrough); fix 5 Medium in pipeline.sh (path-with-spaces, encoding, archive nesting, formats inject); fix 1 High in train.py (Unsloth install URL) + add lora_alpha to all training summaries; clarify conversations.jsonl input format in docs.

v0.3.0

Auto-generate HuggingFace Model Card and Dataset Card on version push; fix 3 bugs in push flow (archive pollution, tag idempotency, empty hash display).

v0.2.1

fix: exclude agent client files from published package

v0.2.0

Evaluation layer (perplexity + probe), model versioning, Gemma 4 preset, pipeline orchestrator, Colab sync, 138 tests.

元数据

Slug persona-model-trainer

版本 0.3.3

许可证 MIT-0

累计安装 0

当前安装数 0

历史版本数 6

常见问题

persona-model-trainer 是什么？

Fine-tune any HuggingFace instruction-tuned model (Gemma 4, Qwen 3, Llama, Phi, Mistral, and more) on persona data from anyone-skill. Produces a self-contain... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件，目前累计下载 107 次。

如何安装 persona-model-trainer？

在 OpenClaw 或 Claude Code 对话框中运行命令「/install persona-model-trainer」即可一键安装，无需额外配置。

persona-model-trainer 是免费的吗？

是的，persona-model-trainer 完全免费，采用 MIT-0 许可证，可自由下载、安装和使用。

persona-model-trainer 支持哪些平台？

persona-model-trainer 跨平台运行，可在任意部署了 OpenClaw / Claude Code 的环境中使用（cross-platform）。

谁开发了 persona-model-trainer？

由 acnlabs（@neiljo-gy）开发并维护，当前版本 v0.3.3。