← 返回 Skills 市场
xueyetianya

Agent Learner

作者 bytesagain4 · GitHub ↗ · v2.0.2 · MIT-0
cross-platform ✓ 安全检测通过
421
总下载
0
收藏
1
当前安装
13
版本数
在 OpenClaw 中安装
/install agent-learner
功能描述
Benchmark and compare agent prompts and evaluation results. Use when tuning strategies, evaluating outputs, or comparing configurations.
使用说明 (SKILL.md)

Agent Learner

An AI toolkit for configuring, benchmarking, comparing, and optimizing agent prompts and evaluation results. Agent Learner provides persistent, file-based logging for each command category with timestamped entries, summary statistics, multi-format export, and full-text search across all records.

Commands

Command Description
configure Configure agent settings — log configuration entries or view recent ones
benchmark Benchmark agent performance — log benchmark results or view history
compare Compare agent outputs — log comparison data or view recent comparisons
prompt Prompt management — log prompt variations or view recent prompts
evaluate Evaluate agent outputs — log evaluation results or view history
fine-tune Fine-tune parameters — log fine-tuning sessions or view recent ones
analyze Analyze agent behavior — log analysis entries or view recent analyses
cost Cost tracking — log cost data or view recent cost entries
usage Usage monitoring — log usage metrics or view recent usage data
optimize Optimize configurations — log optimization runs or view history
test Test agent behavior — log test results or view recent tests
report Report generation — log report entries or view recent reports
stats Show summary statistics across all log categories (entry counts, data size, first entry date)
export \x3Cfmt> Export all data in json, csv, or txt format to the data directory
search \x3Cterm> Full-text search across all log files (case-insensitive)
recent Show the 20 most recent entries from the activity history log
status Health check — show version, data directory, total entries, disk usage, and last activity
help Show the full help message with all available commands
version Print the current version string

Each data command (configure, benchmark, compare, etc.) works in two modes:

  • Without arguments: displays the 20 most recent entries from that category
  • With arguments: saves the input as a new timestamped entry and reports the total count

Data Storage

All data is stored in plain text files under the data directory:

  • Category logs: $DATA_DIR/\x3Ccommand>.log — one file per command (e.g., configure.log, benchmark.log, prompt.log), each entry is timestamp|value
  • History log: $DATA_DIR/history.log — audit trail of every command executed with timestamps
  • Export files: $DATA_DIR/export.\x3Cfmt> — generated by the export command in json, csv, or txt format

Default data directory: ~/.local/share/agent-learner/

Requirements

  • Bash (with set -euo pipefail support)
  • Standard Unix utilities: grep, cat, date, echo, wc, du, head, tail, basename
  • No external dependencies or API keys required

When to Use

  1. Benchmarking agent performance — When you need to track and compare benchmark results across different agent configurations, models, or prompt strategies
  2. Prompt engineering iteration — When you're testing multiple prompt variations and want to log each version with results for later comparison
  3. Cost and usage tracking — When you need to monitor API costs and usage metrics over time to optimize spending
  4. Fine-tuning experiments — When running fine-tuning sessions and you want to log parameters, results, and observations for reproducibility
  5. Cross-category analysis — When you need to search across all logged data (benchmarks, prompts, evaluations, costs) to find patterns or specific entries

Examples

# Initialize and check status
agent-learner status

# Log a benchmark result
agent-learner benchmark "GPT-4o on MMLU: 88.7% accuracy, 1.2s avg latency"

# Log a prompt variation
agent-learner prompt "System: You are a helpful coding assistant. Always explain your reasoning step by step."

# Compare two configurations
agent-learner compare "GPT-4o vs Claude-3.5: GPT-4o 12% faster, Claude 5% more accurate on code tasks"

# Track costs
agent-learner cost "March batch: 12,450 tokens input, 3,200 tokens output, $0.47 total"

# View all recent benchmarks
agent-learner benchmark

# Search across all logs for a specific term
agent-learner search "accuracy"

# Export all data as JSON
agent-learner export json

# View summary statistics
agent-learner stats

# Show recent activity
agent-learner recent

Output

All commands return output to stdout. Export files are written to the data directory:

agent-learner export json   # → ~/.local/share/agent-learner/export.json
agent-learner export csv    # → ~/.local/share/agent-learner/export.csv
agent-learner export txt    # → ~/.local/share/agent-learner/export.txt

Every command execution is logged to $DATA_DIR/history.log for auditing purposes.


Powered by BytesAgain | bytesagain.com | [email protected]

安全使用建议
This skill is coherent and appears safe to use, but it persistently stores everything you send it (prompts, evaluation text, costs, etc.) as plain text under ~/.local/share/agent-learner. Do not enter API keys, secrets, or other sensitive data into the tool. Inspect the data directory and set appropriate file permissions or periodically delete logs if you need confidentiality. If you want extra assurance, review the included scripts before running or run the tool in a restricted environment (container or limited user account).
功能分析
Type: OpenClaw Skill Name: agent-learner Version: 2.0.2 The agent-learner skill is a local utility for logging and benchmarking agent performance metrics. It manages timestamped entries in a local data directory (~/.local/share/agent-learner/) and provides basic search, status, and export functionalities. Analysis of scripts/script.sh and SKILL.md reveals no network activity, data exfiltration, or unauthorized execution patterns; the code uses standard Unix utilities for simple file-based record keeping.
能力评估
Purpose & Capability
Name/description, SKILL.md, and the bundled bash script are consistent: the tool stores timestamped category logs, supports search, export, stats, and status. It only requires standard Unix utilities and writes data to a user-local directory (~/.local/share/agent-learner), which matches the stated purpose.
Instruction Scope
Instructions and the script operate only on files under the DATA_DIR and stdout; they do not call external networks or other system services. Important note: the skill persistently logs any input (prompts, evaluation text, costs, etc.) to disk in plain text, so sensitive prompts, credentials, or model outputs pasted into entries will be stored locally. Also there are minor formatting mismatches between SKILL.md (pipe-separated history entries) and the _log() format used for history.log, but this is an implementation detail, not a security problem.
Install Mechanism
No install spec; this is instruction-only plus an included shell script. Nothing is downloaded or installed from external URLs at runtime, and no archives are extracted. Risk from installation actions is minimal.
Credentials
The skill requests no environment variables, no credentials, and uses only HOME to build a per-user data directory. No unexpected secrets or external service tokens are requested.
Persistence & Privilege
The skill persistently writes logs and export files into the user's home (~/.local/share/agent-learner). It does not request elevated privileges, always:true is not set, and it does not modify other skills or system-wide configs. The main consideration is local data persistence of potentially sensitive content (see instruction_scope).
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install agent-learner
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /agent-learner 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v2.0.2
update
v2.0.1
SKILL.md rewritten to align with script.sh. All commands documented. Added Data Storage, Requirements, When to Use, Examples.
v2.0.0
v2.5 standard: Use-when desc, homepage, source, security fix
v2.3.6
old template -> domain-specific v2.0.0
v2.3.5
old template -> domain-specific v2.0.0
v2.3.4
Quality upgrade
v2.3.3
Quality upgrade: custom functionality
v2.3.2
De-template, unique content, script cleanup
v2.3.1
Quality improvement: better docs, examples, cleaner text
v2.3.0
Quality fixes: removed third-party references, aligned docs with implementation
v2.2.0
Enhanced descriptions for better AI triggering
v1.0.1
probe
v1.0.0
Initial release introducing Agent Learner for AI self-improvement. - Log mistakes and user corrections for continual learning. - Analyze error patterns and generate targeted improvement plans. - Self-test with quizzes based on previous mistakes. - Track progress and review performance over time. - Extract and output best practices for future reference. - Includes commands: mistake, correction, pattern, improve, quiz, report, reset, best-practices.
元数据
Slug agent-learner
版本 2.0.2
许可证 MIT-0
累计安装 2
当前安装数 1
历史版本数 13
常见问题

Agent Learner 是什么?

Benchmark and compare agent prompts and evaluation results. Use when tuning strategies, evaluating outputs, or comparing configurations. 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 421 次。

如何安装 Agent Learner?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install agent-learner」即可一键安装,无需额外配置。

Agent Learner 是免费的吗?

是的,Agent Learner 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

Agent Learner 支持哪些平台?

Agent Learner 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 Agent Learner?

由 bytesagain4(@xueyetianya)开发并维护,当前版本 v2.0.2。

💬 留言讨论