← 返回 Skills 市场
xueyetianya

Genai Toolkit

作者 bytesagain4 · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ✓ 安全检测通过
147
总下载
0
收藏
1
当前安装
1
版本数
在 OpenClaw 中安装
/install genai-toolkit
功能描述
Bridge AI models to databases through MCP with config and evaluation tools. Use when setting up DB tools, comparing engines, or evaluating prompt quality.
使用说明 (SKILL.md)

Genai Toolkit

Genai Toolkit v2.0.0 — an AI toolkit for managing generative AI workflows from the command line. Log configurations, benchmarks, prompts, evaluations, fine-tuning runs, cost tracking, and optimization notes. Each entry is timestamped and persisted locally. Works entirely offline — your data never leaves your machine.

Why Genai Toolkit?

  • Works entirely offline — your data never leaves your machine
  • Simple command-line interface with no GUI dependency
  • Export to JSON, CSV, or plain text at any time for sharing or archival
  • Automatic activity history logging across all commands
  • Each domain command doubles as both a logger and a viewer

Commands

Domain Commands

Each domain command works in two modes: log mode (with arguments) saves a timestamped entry, view mode (no arguments) shows the 20 most recent entries.

Command Description
genai-toolkit configure \x3Cinput> Log a configuration note such as model parameters, API keys, or environment settings. Use this to record setup changes and track which configurations were active during experiments.
genai-toolkit benchmark \x3Cinput> Log a benchmark result or performance observation. Record latency, throughput, accuracy, or other metrics to compare across runs and model versions.
genai-toolkit compare \x3Cinput> Log a comparison note between models, configurations, or approaches. Useful for side-by-side evaluations like GPT-4 vs Claude on specific tasks.
genai-toolkit prompt \x3Cinput> Log a prompt template or prompt engineering note. Track iterations on prompt design, record what worked, and document prompt versioning.
genai-toolkit evaluate \x3Cinput> Log an evaluation result or quality metric. Record accuracy scores, F1 metrics, human ratings, or any qualitative assessment of model outputs.
genai-toolkit fine-tune \x3Cinput> Log a fine-tuning run or hyperparameter note. Track epochs, learning rates, dataset sizes, and resulting model performance after fine-tuning.
genai-toolkit analyze \x3Cinput> Log an analysis observation or insight. Record patterns found in data, failure mode analysis, or trends across experiments.
genai-toolkit cost \x3Cinput> Log cost tracking data including API costs, compute expenses, and token consumption. Essential for budget monitoring across projects and providers.
genai-toolkit usage \x3Cinput> Log usage metrics or consumption data. Track request volumes, token counts, rate limit encounters, and daily/monthly consumption patterns.
genai-toolkit optimize \x3Cinput> Log optimization attempts or performance improvements. Record what was changed, the expected vs actual impact, and next steps.
genai-toolkit test \x3Cinput> Log test results or test case notes. Record pass/fail outcomes, edge cases discovered, and regression test results.
genai-toolkit report \x3Cinput> Log a report entry or summary finding. Capture weekly summaries, milestone reports, or executive-level findings from AI workflows.

Utility Commands

Command Description
genai-toolkit stats Show summary statistics across all log files, including entry counts per category and total data size on disk.
genai-toolkit export \x3Cfmt> Export all data to a file in the specified format. Supported formats: json, csv, txt. Output is saved to the data directory.
genai-toolkit search \x3Cterm> Search all log entries for a term using case-insensitive matching. Results are grouped by log category for easy scanning.
genai-toolkit recent Show the 20 most recent entries from the unified activity log, giving a quick overview of recent work across all commands.
genai-toolkit status Health check showing version, data directory path, total entry count, disk usage, and last activity timestamp.
genai-toolkit help Show the built-in help message listing all available commands and usage information.
genai-toolkit version Print the current version (v2.0.0).

Data Storage

All data is stored locally at ~/.local/share/genai-toolkit/. Each domain command writes to its own log file (e.g., configure.log, benchmark.log). A unified history.log tracks all actions across commands. Use export to back up your data at any time.

Requirements

  • Bash (4.0+)
  • No external dependencies — pure shell script
  • No network access required

When to Use

  • Tracking AI model benchmarks and comparisons across different providers and versions over time
  • Logging prompt engineering iterations to understand what improvements actually moved the needle
  • Monitoring API costs and token usage across multiple projects and billing periods
  • Evaluating fine-tuning experiments with detailed hyperparameter and metric tracking
  • Building a searchable knowledge base of optimization attempts and analysis insights

Examples

# Log a benchmark result
genai-toolkit benchmark "GPT-4o latency: avg 1.2s, p99 3.8s on summarization task, 500 samples"

# Track a cost entry
genai-toolkit cost "March batch processing: $42.50 across 15k requests, avg $0.0028/req"

# Compare two models
genai-toolkit compare "Claude 3.5 vs GPT-4o on code generation — Claude 15% faster, GPT-4o 5% more accurate"

# Log a prompt iteration
genai-toolkit prompt "v3: Added chain-of-thought instruction, reduced hallucination rate from 12% to 3%"

# Record a fine-tuning run
genai-toolkit fine-tune "SQL-gen model epoch 5: accuracy=0.96, loss=0.12, lr=2e-5, dataset=50k rows"

# View all statistics
genai-toolkit stats

# Export everything to JSON
genai-toolkit export json

# Search for entries mentioning latency
genai-toolkit search latency

# Check recent activity
genai-toolkit recent

# Health check
genai-toolkit status

Powered by BytesAgain | bytesagain.com | [email protected]

安全使用建议
This appears to be a simple local logging CLI. Before installing: (1) Review the script if you want to be certain it won’t be changed; (2) be aware it stores all entries in plaintext at ~/.local/share/genai-toolkit — do not log secrets (API keys, passwords) there; (3) consider setting restrictive permissions on that directory (chmod 700) or using an encrypted workspace if you must record sensitive data; (4) exports (json/csv/txt) are created as files and could be shared accidentally, so handle exported files carefully.
功能分析
Type: OpenClaw Skill Name: genai-toolkit Version: 1.0.0 The Genai Toolkit is a local-only Bash-based logging utility designed to track AI development metrics such as benchmarks, prompts, and costs. Analysis of scripts/script.sh and SKILL.md confirms the tool operates entirely offline, storing data in ~/.local/share/genai-toolkit/ without any network calls, obfuscation, or unauthorized execution patterns. While it allows users to log sensitive information like API keys for configuration tracking, this behavior is consistent with its stated purpose and lacks any indicators of malicious intent or data exfiltration.
能力评估
Purpose & Capability
Name/description promise is a CLI toolkit for logging and evaluating generative-AI experiments. The included shell script implements the described commands and stores data under ~/.local/share/genai-toolkit, so required capabilities match the stated purpose.
Instruction Scope
SKILL.md instructs only local logging, exporting, searching, and status checks. The script reads/writes only within the data directory and standard system utilities (date, grep, tail, wc, du). There are no instructions to read unrelated system files, environment secrets, or to transmit data externally.
Install Mechanism
No install spec is provided (instruction-only skill). A single bash script is included; it is a plain shell implementation and does not download or extract remote code. Risk from install mechanism is low.
Credentials
The skill requests no environment variables or credentials. It does use $HOME to create ~/.local/share/genai-toolkit (expected). SKILL.md suggests logging items like 'API keys or environment settings' — this is a usage note rather than an automatic collection, but users should avoid storing secrets in plaintext log files.
Persistence & Privilege
Skill is not marked always:true and does not modify other skills or system-wide settings. It persists only its own data under the user's home directory and does not request elevated privileges.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install genai-toolkit
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /genai-toolkit 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.0
publish v1.0.0
元数据
Slug genai-toolkit
版本 1.0.0
许可证 MIT-0
累计安装 1
当前安装数 1
历史版本数 1
常见问题

Genai Toolkit 是什么?

Bridge AI models to databases through MCP with config and evaluation tools. Use when setting up DB tools, comparing engines, or evaluating prompt quality. 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 147 次。

如何安装 Genai Toolkit?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install genai-toolkit」即可一键安装,无需额外配置。

Genai Toolkit 是免费的吗?

是的,Genai Toolkit 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

Genai Toolkit 支持哪些平台?

Genai Toolkit 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 Genai Toolkit?

由 bytesagain4(@xueyetianya)开发并维护,当前版本 v1.0.0。

💬 留言讨论