← Back to Skills Marketplace
Advanced ML Classification Skill
by
bamboo9805
· GitHub ↗
· v1.0.0
· MIT-0
298
Downloads
0
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install advanced-ml-classification-skill
Description
自动化生成工业级机器学习分类算法代码、调用算法做预测、输出准确率对比和可视化结果,支持新手友好的结果解读。
README (SKILL.md)
AdvancedMLClassificationSkill
输入参数
data_path: str(必填)CSV 数据集路径target_col: str(必填)预测目标列名algorithms: list[str](可选)默认[ "逻辑回归", "决策树", "随机森林", "XGBoost", "LightGBM" ]test_size: float(可选)默认0.2random_state: int(可选)默认42
输出结构
accuracy_results: dict[str, float|None]interpretation: strgenerated_codes: dict[str, str]visualization_data: dict
关键流程
- 自动预处理(缺失值、类别编码、数值标准化)
- 按算法生成训练代码(优先
code-davinci-002,失败回退本地模板) - 执行算法代码并统计准确率(失败时返回具体错误)
- 可选交叉验证(
StratifiedKFold/KFold/RepeatedStratifiedKFold) - 可选参数搜索(
GridSearchCV/RandomizedSearchCV) - 生成置换特征重要性排序(默认对最佳算法)
- 生成新手友好中文解读(优先
gpt-3.5-turbo) - 输出可视化数据(柱状图/折线图)
运行示例
cd /Users/bamboo/skills/advanced-ml-classification-skill/scripts
python3 -m venv .venv
source .venv/bin/activate
pip install -r requirements.txt
python generate_complex_demo.py
python advanced_ml_skill.py --data-path ./demo_complex.csv --target-col target_label --enable-cv --enable-search
streamlit run app.py
Usage Guidance
What to consider before installing or running this skill:
- The skill can optionally use your OpenAI key to generate Python training functions (Codex) and to create natural-language interpretations. That key is sensitive; the manifest did not declare it. Expect prompts and possibly dataset metadata to be sent to OpenAI if you provide the key.
- The skill may execute Python code returned by the LLM. Generated code can be arbitrary and could read files, open network connections, or exfiltrate data. If you plan to run it, do so only in an isolated environment (container or VM) and with non-sensitive example data until you audit the generated code.
- If you do not want any data leaving your environment, do NOT provide an OpenAI key and do not enable features that call external LLMs; instead let the skill fall back to its bundled local templates (the code indicates it has local templates as a fallback).
- Before trusting results: inspect the generated code (the UI exposes generated_codes) and verify it only uses sklearn/XGBoost/LightGBM and does not perform I/O or network calls you don't expect.
- Prefer running the demo in a disposable virtualenv/VM, review requirements.txt before pip installing, and avoid running with production or sensitive datasets unless you have audited the runtime behavior and are comfortable with the LLM calls.
Confidence is medium because parts of advanced_ml_skill.py were truncated in the provided listing; reviewing the complete run()/code-generation/execution implementation would reduce uncertainty.
Capability Analysis
Type: OpenClaw Skill
Name: advanced-ml-classification-skill
Version: 1.0.0
The skill bundle implements an automated machine learning pipeline that dynamically generates and executes Python code using an LLM (Codex) via the `exec()` function in `advanced_ml_skill.py`. While this aligns with the stated goal of 'automated code generation,' it introduces a critical Remote Code Execution (RCE) vulnerability, as the agent executes unvalidated code produced by an external model. There is no evidence of intentional malice, data exfiltration, or backdoors, but the architectural reliance on executing untrusted generated strings makes the bundle high-risk.
Capability Assessment
Purpose & Capability
The name/description (automated ML classification benchmarking) matches the included code, example datasets, and UI. Requiring scikit-learn, xgboost, lightgbm, plotly and streamlit (via requirements.txt) is proportionate. However the skill optionally uses OpenAI/Codex to generate algorithm code and GPT to produce plain-language interpretation — this external LLM dependency is not declared in the registry metadata (no required env vars). Using an LLM to generate training code is a plausible design choice for this purpose, but it should be explicitly documented as a required/optional capability.
Instruction Scope
The runtime executes training functions that may be generated by an external model (code-davinci-002 / Codex) and then runs that generated Python. Executing arbitrary generated Python at runtime expands scope beyond purely local, deterministic ML steps and can lead to arbitrary file system and network access, plus potential exfiltration of dataset contents. SKILL.md explicitly instructs use of Codex and GPT for code/interpretation; the app UI also accepts an OpenAI API key. There is no explicit guidance or sandboxing for inspecting or restricting generated code before execution.
Install Mechanism
There is no remote download/install spec in the skill definition; all code files are bundled and the provided shell helpers create a local venv and pip-install the packages listed in requirements.txt (packages from PyPI). No external arbitrary archive downloads or unknown URLs are used. This is low-to-moderate install risk typical for Python projects.
Credentials
The code reads an OpenAI API key (OPENAI_API_KEY env or user-supplied via the Streamlit UI) to call OpenAI (Codex/GPT). The skill metadata declared no required environment variables or primary credential; that's a mismatch. An OpenAI key is a sensitive credential and should be declared as optional in metadata so users understand the credential will be used and (potentially) sent dataset-related prompts.
Persistence & Privilege
The skill is not always-enabled and does not request system-wide permissions in the manifest. It does not appear to modify other skills or global agent settings. The main privilege concern is runtime: executing externally generated code, but this is not a persistence/privilege escalation issue in the skill metadata itself.
How to Use
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install advanced-ml-classification-skill - After installation, invoke the skill by name or use
/advanced-ml-classification-skill - Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
Initial release
Metadata
Frequently Asked Questions
What is Advanced ML Classification Skill?
自动化生成工业级机器学习分类算法代码、调用算法做预测、输出准确率对比和可视化结果,支持新手友好的结果解读。 It is an AI Agent Skill for Claude Code / OpenClaw, with 298 downloads so far.
How do I install Advanced ML Classification Skill?
Run "/install advanced-ml-classification-skill" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is Advanced ML Classification Skill free?
Yes, Advanced ML Classification Skill is completely free, licensed under MIT-0. You can download, install and use it at no cost.
Which platforms does Advanced ML Classification Skill support?
Advanced ML Classification Skill is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).
Who created Advanced ML Classification Skill?
It is built and maintained by bamboo9805 (@bamboo9805); the current version is v1.0.0.
More Skills