Genai Toolkit
/install genai-toolkit
Genai Toolkit
Genai Toolkit v2.0.0 — an AI toolkit for managing generative AI workflows from the command line. Log configurations, benchmarks, prompts, evaluations, fine-tuning runs, cost tracking, and optimization notes. Each entry is timestamped and persisted locally. Works entirely offline — your data never leaves your machine.
Why Genai Toolkit?
- Works entirely offline — your data never leaves your machine
- Simple command-line interface with no GUI dependency
- Export to JSON, CSV, or plain text at any time for sharing or archival
- Automatic activity history logging across all commands
- Each domain command doubles as both a logger and a viewer
Commands
Domain Commands
Each domain command works in two modes: log mode (with arguments) saves a timestamped entry, view mode (no arguments) shows the 20 most recent entries.
| Command | Description |
|---|---|
genai-toolkit configure \x3Cinput> |
Log a configuration note such as model parameters, API keys, or environment settings. Use this to record setup changes and track which configurations were active during experiments. |
genai-toolkit benchmark \x3Cinput> |
Log a benchmark result or performance observation. Record latency, throughput, accuracy, or other metrics to compare across runs and model versions. |
genai-toolkit compare \x3Cinput> |
Log a comparison note between models, configurations, or approaches. Useful for side-by-side evaluations like GPT-4 vs Claude on specific tasks. |
genai-toolkit prompt \x3Cinput> |
Log a prompt template or prompt engineering note. Track iterations on prompt design, record what worked, and document prompt versioning. |
genai-toolkit evaluate \x3Cinput> |
Log an evaluation result or quality metric. Record accuracy scores, F1 metrics, human ratings, or any qualitative assessment of model outputs. |
genai-toolkit fine-tune \x3Cinput> |
Log a fine-tuning run or hyperparameter note. Track epochs, learning rates, dataset sizes, and resulting model performance after fine-tuning. |
genai-toolkit analyze \x3Cinput> |
Log an analysis observation or insight. Record patterns found in data, failure mode analysis, or trends across experiments. |
genai-toolkit cost \x3Cinput> |
Log cost tracking data including API costs, compute expenses, and token consumption. Essential for budget monitoring across projects and providers. |
genai-toolkit usage \x3Cinput> |
Log usage metrics or consumption data. Track request volumes, token counts, rate limit encounters, and daily/monthly consumption patterns. |
genai-toolkit optimize \x3Cinput> |
Log optimization attempts or performance improvements. Record what was changed, the expected vs actual impact, and next steps. |
genai-toolkit test \x3Cinput> |
Log test results or test case notes. Record pass/fail outcomes, edge cases discovered, and regression test results. |
genai-toolkit report \x3Cinput> |
Log a report entry or summary finding. Capture weekly summaries, milestone reports, or executive-level findings from AI workflows. |
Utility Commands
| Command | Description |
|---|---|
genai-toolkit stats |
Show summary statistics across all log files, including entry counts per category and total data size on disk. |
genai-toolkit export \x3Cfmt> |
Export all data to a file in the specified format. Supported formats: json, csv, txt. Output is saved to the data directory. |
genai-toolkit search \x3Cterm> |
Search all log entries for a term using case-insensitive matching. Results are grouped by log category for easy scanning. |
genai-toolkit recent |
Show the 20 most recent entries from the unified activity log, giving a quick overview of recent work across all commands. |
genai-toolkit status |
Health check showing version, data directory path, total entry count, disk usage, and last activity timestamp. |
genai-toolkit help |
Show the built-in help message listing all available commands and usage information. |
genai-toolkit version |
Print the current version (v2.0.0). |
Data Storage
All data is stored locally at ~/.local/share/genai-toolkit/. Each domain command writes to its own log file (e.g., configure.log, benchmark.log). A unified history.log tracks all actions across commands. Use export to back up your data at any time.
Requirements
- Bash (4.0+)
- No external dependencies — pure shell script
- No network access required
When to Use
- Tracking AI model benchmarks and comparisons across different providers and versions over time
- Logging prompt engineering iterations to understand what improvements actually moved the needle
- Monitoring API costs and token usage across multiple projects and billing periods
- Evaluating fine-tuning experiments with detailed hyperparameter and metric tracking
- Building a searchable knowledge base of optimization attempts and analysis insights
Examples
# Log a benchmark result
genai-toolkit benchmark "GPT-4o latency: avg 1.2s, p99 3.8s on summarization task, 500 samples"
# Track a cost entry
genai-toolkit cost "March batch processing: $42.50 across 15k requests, avg $0.0028/req"
# Compare two models
genai-toolkit compare "Claude 3.5 vs GPT-4o on code generation — Claude 15% faster, GPT-4o 5% more accurate"
# Log a prompt iteration
genai-toolkit prompt "v3: Added chain-of-thought instruction, reduced hallucination rate from 12% to 3%"
# Record a fine-tuning run
genai-toolkit fine-tune "SQL-gen model epoch 5: accuracy=0.96, loss=0.12, lr=2e-5, dataset=50k rows"
# View all statistics
genai-toolkit stats
# Export everything to JSON
genai-toolkit export json
# Search for entries mentioning latency
genai-toolkit search latency
# Check recent activity
genai-toolkit recent
# Health check
genai-toolkit status
Powered by BytesAgain | bytesagain.com | [email protected]
- 确保已安装 OpenClaw(本地或 Docker 部署)
- 在对话框中输入安装命令:
/install genai-toolkit - 安装完成后,直接呼叫该 Skill 的名称或使用
/genai-toolkit触发 - 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
Genai Toolkit 是什么?
Bridge AI models to databases through MCP with config and evaluation tools. Use when setting up DB tools, comparing engines, or evaluating prompt quality. 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 147 次。
如何安装 Genai Toolkit?
在 OpenClaw 或 Claude Code 对话框中运行命令「/install genai-toolkit」即可一键安装,无需额外配置。
Genai Toolkit 是免费的吗?
是的,Genai Toolkit 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。
Genai Toolkit 支持哪些平台?
Genai Toolkit 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。
谁开发了 Genai Toolkit?
由 bytesagain4(@xueyetianya)开发并维护,当前版本 v1.0.0。