AutoResearch Pipeline
/install autoresearch-pipeline
autoresearch — Nightly Research Pipeline
A zero-cost nightly research aggregator that rotates through 3 topic tracks, pulls from 4 independent sources, synthesises a structured markdown report, and prints a 3-line Telegram teaser to stdout.
Quick Start
# Dry run — fetches real data, prints teaser, no file writes
cd ~/.openclaw/workspace
uv run --with httpx python skills/autoresearch/scripts/run.py --dry-run
# Full run — writes to memory/ and advances state
uv run --with httpx python skills/autoresearch/scripts/run.py
# Force a specific track
uv run --with httpx python skills/autoresearch/scripts/run.py --track crypto
# Verbose output (debug logs to stderr)
uv run --with httpx python skills/autoresearch/scripts/run.py --dry-run --verbose
Tracks
The pipeline rotates through 3 tracks in order (persisted in state.json):
| Track | Display Name | Sources Focus |
|---|---|---|
ai |
AI & Agents | cs.AI/MA/CL/LG arXiv, Python/Rust/TS GitHub, LLM HN keywords |
crypto |
Crypto & DeFi | cs.CR/DC arXiv, Solidity/Rust GitHub, crypto HN keywords |
devtools |
Developer Tools | cs.SE/PL arXiv, Rust/Go/TS/Python GitHub, CLI/editor HN keywords |
Sources
| Source | API | Auth | Fallback |
|---|---|---|---|
| arXiv | Atom API (export.arxiv.org) |
None | Returns [] on error |
| GitHub Trending | Public HTML scrape | None | Returns [] on structure change |
| Hacker News | Firebase JSON API | None | Returns partial results |
| Web Search | Brave Search API | BRAVE_API_KEY env |
Skipped silently if no key |
Output Files
| File | Description |
|---|---|
memory/autoresearch-latest.md |
Overwritten each run — latest report |
memory/autoresearch-archive.md |
Append-only — all runs with date markers |
memory/autoresearch-errors.log |
Stderr from cron runs |
CLI Flags
| Flag | Default | Description |
|---|---|---|
--track ai|crypto|devtools |
Rotate | Override track rotation for this run |
--dry-run |
off | Fetch + synthesise but skip file writes and state advance |
--verbose |
off | Print DEBUG logs to stderr |
Exit Codes
| Code | Meaning |
|---|---|
0 |
Success |
1 |
All sources failed OR disk write failed |
2 |
Config/state error (config.json missing, bad --track value) |
Configuration
Edit config.json to customise per-track queries:
{
"tracks": {
"ai": {
"arxiv_categories": ["cs.AI", "cs.MA", "cs.CL", "cs.LG"],
"arxiv_keywords": ["agent", "LLM", ...],
"github_languages": ["python", "rust", "typescript"],
"github_topics": ["ai-agent", "llm", ...],
"hn_keywords": ["AI", "GPT", "Claude", ...],
"web_queries": ["AI agent framework news 2026", ...]
}
}
}
Cron Integration
# Add to OpenClaw cron: 1 AM Sydney (14:00 UTC previous day)
# The cron wrapper captures stdout and sends to Telegram
0 14 * * * cd ~/.openclaw/workspace && uv run --with httpx python skills/autoresearch/scripts/run.py 2>>~/.openclaw/workspace/memory/autoresearch-errors.log
The script prints a 3-line teaser to stdout:
🔬 **Nightly Research: AI & Agents**
• Top paper: Scaling Laws for Agent Reasoning… — We study how reasoning…
• Trending: microsoft/autogen ⭐342 | HN: Show HN: I built…
The cron agent captures stdout and sends it to Telegram via the message tool.
State
State is persisted in state.json:
{
"current_track_index": 1,
"last_run": "2026-03-15T14:02:31.123456+00:00",
"last_tracks": ["ai"]
}
State only advances on a successful run (exit 0). If all sources fail, state stays at the same track so tomorrow retries the same track.
Dependencies
- httpx — all HTTP (via
uv run --with httpx) - xml.etree.ElementTree — arXiv Atom XML parsing (stdlib)
- json, re, asyncio, argparse, pathlib — stdlib
No additional dependencies needed. No pyproject.toml required in the skill dir.
Integration with Book Draft
Other cron jobs or agents can read the latest report directly:
cat ~/.openclaw/workspace/memory/autoresearch-latest.md
Or in Python:
from pathlib import Path
report = Path.home() / ".openclaw/workspace/memory/autoresearch-latest.md"
content = report.read_text()
File Structure
skills/autoresearch/
├── SKILL.md # This file
├── PLAN.md # Architecture and spec
├── config.json # Track definitions + source config
├── state.json # Runtime state (auto-managed)
└── scripts/
├── run.py # CLI entrypoint (main pipeline)
├── sources.py # Data fetchers (arXiv, GitHub, HN, web)
├── synthesise.py # Report builder (markdown synthesis)
└── state.py # Track rotation state machine
- 确保已安装 OpenClaw(本地或 Docker 部署)
- 在对话框中输入安装命令:
/install autoresearch-pipeline - 安装完成后,直接呼叫该 Skill 的名称或使用
/autoresearch-pipeline触发 - 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
AutoResearch Pipeline 是什么?
Automates nightly research by rotating topics, fetching from arXiv, GitHub, Hacker News, Brave Search, and generating structured markdown reports with Telegr... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 104 次。
如何安装 AutoResearch Pipeline?
在 OpenClaw 或 Claude Code 对话框中运行命令「/install autoresearch-pipeline」即可一键安装,无需额外配置。
AutoResearch Pipeline 是免费的吗?
是的,AutoResearch Pipeline 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。
AutoResearch Pipeline 支持哪些平台?
AutoResearch Pipeline 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。
谁开发了 AutoResearch Pipeline?
由 bowen31337(@bowen31337)开发并维护,当前版本 v1.0.0。