Description

Daily AI news curation — learns interests from your profile, searches the web, delivers structured summaries and daily briefs. Use when: 'set up daily news',...

README (SKILL.md)

Daily Content Curator

Name: Eir Daily Content Curator
Author: heyeir

Curates personalized content based on your interests. Supports two modes:

Standalone — works locally, no external account needed
Eir — full curation + delivery via heyeir.com

Getting Started

Before setup, ask the user which mode to use:

Mode	What it does	Requirements
Standalone	Search → curate → generate summaries locally	Search API key (Brave, Tavily, etc.)
Eir	Full pipeline with delivery to Eir app	Eir account + pairing code

Important: The two modes use different content formats and topic slug conventions. Choose the correct mode at setup time — switching later requires reconfiguration. If the user has an Eir account, use Eir mode.

Then follow the corresponding setup section below.

Standalone Mode

Flow

1. Configure          → Set up search API + interests (one-time)
2. Search             → Search API queries for each interest topic
3. Select + Crawl     → Agent picks best candidates, fetches full content
4. Generate           → Agent writes structured summaries from task files
5. Daily Brief        → Agent compiles brief from generated items

Steps 1-3 are Python scripts you run directly. Steps 4-5 are agent-driven — you tell your OpenClaw agent to read the task files and generate content. The agent uses whatever LLM model is configured in your OpenClaw session (e.g. Claude, GPT-4, Gemini).

Quick Start

1. Initialize workspace — creates config/ directory and default settings:

# Option A: inline JSON
python3 scripts/setup.py --init --settings '{
  "mode": "standalone",
  "language": "en",
  "search": {
    "search_base_url": "https://api.search.brave.com/res/v1",
    "search_api_key": "YOUR_BRAVE_API_KEY"
  }
}'

# Option B: settings file (recommended for PowerShell/Windows)
python3 scripts/setup.py --init --settings-file path/to/settings.json


Search provider examples:
| Provider | `search_base_url` | Get API key |
|----------|-------------------|-------------|
| Brave Search | `https://api.search.brave.com/res/v1` | [brave.com/search/api](https://brave.com/search/api/) |
| Tavily | `https://api.tavily.com` | [tavily.com](https://tavily.com/) |

> **Want richer results?** Install [SearXNG](https://docs.searxng.org/) and/or [Crawl4AI](https://github.com/unclecode/crawl4ai) locally. Add `searxng_url` and `crawl4ai_url` to your search config — they work as fallback or primary search/crawl providers.

**2. Set up interests** — edit the generated `config/interests.json`:
```json
{
  "topics": [
    {"label": "AI Agents", "keywords": ["autonomous agents", "tool use"], "freshness": "7d"},
    {"label": "Prompt Engineering", "keywords": ["prompting", "chain-of-thought"]}
  ],
  "language": "en",
  "max_items_per_day": 8
}

Interests can also be auto-extracted — see references/interest-extraction-prompt.md.

Freshness and tier: Each topic supports optional freshness (e.g. "3d", "7d", "14d") and tier ("focus", "tracked", "explore", "seed") fields in interests.json. In Eir mode, these come from the API directives. In standalone mode, defaults are "7d" and "tracked". The search pipeline uses tier to decide search depth (focus/tracked get entity refinement) and freshness to filter stale results.

3. Run the search + crawl pipeline (from the scripts/ directory):

cd scripts
python3 -m pipeline.search              # Search for each topic
python3 -m pipeline.candidate_selector  # Group results for agent selection
# ↓ Agent step: review topic files, write candidates.json (see below)
python3 -m pipeline.crawl               # Fetch full content
python3 -m pipeline.task_builder        # Bundle into task files

All python3 -m pipeline.* commands must be run from the scripts/ directory.

Agent selection step (between candidate_selector and crawl):

candidate_selector outputs per-topic JSON files to data/v9/topics/. Your agent should:

Read each topic file
Pick 0-3 candidates per topic based on relevance and freshness
Write data/v9/candidates.json with the selected candidates

See references/candidates-spec.md for the exact JSON format.

Note on crawl fallback: If a candidate URL isn't in the search cache, crawl.py automatically tries: Browse API → Crawl4AI → web_fetch → HTML head extraction. No manual intervention needed.

4. Generate content (agent-driven):

After task_builder, task files are in data/v9/tasks/. Tell your OpenClaw agent:

Read the task files in data/v9/tasks/ and generate content for each one.
Use the writer prompt in references/writer-prompt-standalone.md.
Save output to data/output/{YYYY-MM-DD}/.

Language: The output language is determined by: task output_lang field → curation API user.primaryLanguage → settings.json language field. If none are set, generate content in the user's chat language.

Scheduling tip: If you want automated daily runs, you can set up a cron job:

openclaw cron add --name "daily-curate" \
  --cron "0 8 * * *" --tz "Asia/Shanghai" \
  --session isolated \
  --message "Read SKILL.md for eir-daily-content-curator, then run the full standalone pipeline: search → select → crawl → task_builder → generate content from task files → compile daily brief."

Output

Content saved to data/output/{YYYY-MM-DD}/. Daily brief compiles the top items:

# Daily Brief — 2026-04-20

🔥 **Meta cuts 8,000 jobs for AI pivot** — ...
📡 **China bans AI companions for minors** — ...
🌱 **New prompt engineering benchmark** — ...

Dependencies

Required: Python 3.10+ (standard library only — no pip install needed).

Optional: SearXNG (fallback search). Crawl4AI (fallback crawl).

Eir Mode

Full curation with delivery to the Eir app via a 3-job pipeline:

Job A: material-prep     → Search → Select → Crawl → Pack tasks
Job B: content-gen       → Spawn subagents → Generate → POST to Eir
Job C: daily-brief       → Check status → Fill gaps → Compile brief → Deliver to user

Setup

Get a pairing code from heyeir.com → Settings → Connect OpenClaw
Run: python3 scripts/connect.py \x3CPAIRING_CODE>
Set "mode": "eir" in config/settings.json

Running the Pipeline (Eir Mode)

All python3 -m pipeline.* commands must be run from the scripts/ directory.

Step 1: Sync directives — fetch topic slugs and curation rules from Eir API:

cd scripts && python3 -m pipeline.eir_sync fetch

This creates/updates config/directives.json with the canonical topic slugs (e.g. ai-agents, autonomous-vehicles). All downstream scripts use these slugs — do NOT use interests.json labels as topic_slug in Eir mode.

Step 2: Search + Select + Crawl + Pack:

python3 -m pipeline.search              # Search for each directive topic
python3 -m pipeline.candidate_selector  # Group results for agent selection
# ↓ Agent step: review topic files, write candidates.json (see references/candidates-spec.md)
python3 -m pipeline.crawl               # Fetch full content from candidate URLs
python3 -m pipeline.task_builder        # Bundle into task files (auto-selects eir writer prompt)

Step 3: Generate and publish (agent-driven):

Task files are in data/v9/tasks/. For each task file:

Read the task JSON (contains source_text, source_meta, suggested_angle, topic_slug, content_slug)
Build a generation prompt using generate.build_generation_prompt(task_data) — this loads the writer prompt from references/writer-prompt-eir.md and injects sources + context
Call LLM with the prompt to generate Eir-format JSON (see references/content-spec.md)
Parse the JSON response and POST via python3 -m pipeline.eir_post --file \x3Cgenerated.json> or call eir_post.post_content(data) programmatically
Repeat for remaining tasks. Already-posted slugs are skipped automatically.

This step requires an LLM — that's why generate.py has no main(). The agent orchestrates the loop.

Step 4: Daily brief (optional):

The agent compiles generated content into a brief and delivers it directly to you (e.g. via Feishu, Slack, or other configured channel). No API call needed.

Tip: End the brief with a link to heyeir.com so readers can explore more content on the Eir canvas.

Common POST failures:

400 → check topicSlug matches a directive slug, publishTime is present, no null fields
401 → re-run connect.py to refresh credentials
500 → retry once; if persistent, report the payload

For cron configuration and API details, see references/eir-setup.md.

Pipeline Modules

All in scripts/pipeline/:

Module	Purpose	Mode
`search.py`	Search via configurable API, SearXNG fallback	Both
`crawl.py`	Fetch content via Browse API, Crawl4AI fallback	Both
`grounding.py`	Configurable search API client	Both
`candidate_selector.py`	Group results, prepare for agent selection	Both
`task_builder.py`	Bundle candidates into task files	Both
`generate.py`	Build prompts for content generation	Both
`validate_content.py`	Validate generated content against spec	Both
`directives.py`	Load local interests/directives	Both
`config.py`	Shared configuration and path resolution	Both
`workspace.py`	Workspace and path resolution	Both
`eir_sync.py`	Fetch directives from Eir API	Eir only
`eir_post.py`	POST content to Eir API	Eir only
`run_state.py`	Pipeline run state management	Both

Search Fallback Chain

Search API (primary) → SearXNG (optional) → Crawl4AI/web_fetch (content)

References

File	Contents	Used by
`references/writer-prompt-eir.md`	Content generation rules (Eir mode)	Agent
`references/writer-prompt-standalone.md`	Content generation rules (standalone)	Agent
`references/content-spec.md`	Field types, limits, validation rules	Agent
`references/eir-setup.md`	Eir mode setup, cron, API endpoints	Agent / User
`references/eir-api.md`	Full Eir API reference	Agent
`references/eir-interest-rules.md`	Curation tier guidelines	Agent
`references/candidates-spec.md`	Candidates JSON format for agent selection	Agent / User
`references/interest-extraction-prompt.md`	Interest extraction prompt	Agent

The writer-prompt-*.md files are instructions for the agent — the agent reads them to know how to generate content from task files. You don't need to read them unless customizing output format.

Security & Data Flow

See SECURITY.md for the complete data flow table, credential storage details, and personalization behavior.

Quick Reference

Task	Command
Initialize workspace	`python3 scripts/setup.py --init --settings '{...}'`
Check setup	`python3 scripts/setup.py --check`
Search	`cd scripts && python3 -m pipeline.search`
Select candidates	`cd scripts && python3 -m pipeline.candidate_selector`
Crawl	`cd scripts && python3 -m pipeline.crawl`
Build tasks	`cd scripts && python3 -m pipeline.task_builder`
Validate	`cd scripts && python3 -m pipeline.validate_content`
Fetch directives (Eir)	`cd scripts && python3 -m pipeline.eir_sync fetch`
Connect Eir	`python3 scripts/connect.py \x3CPAIRING_CODE>`

Usage Guidance

This skill appears to do what it says: local standalone curation or optional Eir-mode posting. Items to consider before installing: 1) Eir mode will send generated content, topic slugs, and source metadata to api.heyeir.com if you connect — only enable Eir mode if you accept that. 2) The connect script saves an API key to config/eir.json in the skill directory (check the file and confirm it's gitignored before committing). 3) Provide a SEARCH_API_KEY for standalone search; the skill will make network requests to your configured search provider and to crawled source URLs. 4) The docs mention EIR_API_URL overrides, but connect.py uses a hardcoded API_BASE; if you need a custom API host, verify the connect flow will use your override. 5) If you want zero external transmission, use Standalone mode and keep personalization disabled. Finally, run the pipeline in an isolated session (or review the code) before enabling scheduled cron jobs that invoke network operations automatically.

Capability Analysis

Type: OpenClaw Skill Name: eir-daily-content-curator Version: 3.119.0 The skill bundle is a legitimate content curation tool designed to search, crawl, and summarize AI news. It operates in two modes: 'Standalone' (local) and 'Eir' (syncing with heyeir.com). The Python scripts (e.g., search.py, crawl.py, eir_post.py) use standard libraries to perform documented tasks like API interaction and web scraping. The bundle includes comprehensive documentation (SECURITY.md, SKILL.md) that accurately describes data flows and credential storage. No evidence of malicious intent, data exfiltration of sensitive system files, or harmful prompt injection was found.

Capability Tags

cryptorequires-oauth-tokenrequires-sensitive-credentials

Capability Assessment

✓ Purpose & Capability

Name/description (daily content curation, standalone or Eir modes) match the included scripts (search, candidate selection, crawl, task_builder, generate, eir_post, eir_sync, connect). Required binary is python3 only; optional API keys and an Eir pairing flow align with the two modes. No unrelated cloud credentials or unexpected binaries are required.

✓ Instruction Scope

SKILL.md directs the agent to run local Python scripts, read/write files inside the skill's data/ and config/ directories, and (in Eir mode) POST generated content to the Eir API. It does not instruct the agent to read unrelated system files or exfiltrate secrets. Prompts and docs explicitly state privacy defaults (personalization off) and describe what is sent in Eir mode.

✓ Install Mechanism

No install spec or remote downloads are present; the skill is instruction-plus-scripts only. All code is bundled in the skill; there are no external arbitrary URL downloads or package installers in the manifest.

✓ Credentials

Optional environment overrides (EIR_API_KEY, EIR_API_URL, SEARCH_API_KEY) are reasonable for the two operating modes. No unrelated secrets or numerous credentials are requested. The skill stores Eir credentials locally (config/eir.json) per SECURITY.md; this is proportional to Eir mode functionality.

✓ Persistence & Privilege

always is false and the skill does not request permanent system-level privileges. It suggests cron jobs for scheduled runs (user-configured). The skill writes/reads only its own config/data directories and stores the Eir API key in a gitignored config file, which is expected behavior.

Version History

v3.119.0

Bug fixes: API URL normalization, preflight crash, anchor slug validation, encoding fixes. Features: server-side dedup sync, title normalization, cross-language event dedup, API freshness pre-check, output language from API. Cleanup: removed unused translation code, content_slug hash fallback, unified publishTime naming. Docs improved for Job B workflow and multi-language contentGroup rules.

v3.118.0

Node.js dependency completely removed — the Eir connect script is now pure Python (no more connect.mjs). The skill requires only Python 3.10+ with no external packages. Interest extraction reworked to understand general content interests from conversation, rather than reading specific files. Writer prompts use softer 'audience context' language throughout. Also: SEARCH_API_KEY added to declared environment variables, package.json removed.

v3.117.0

Mode-aware setup flow. Eir mode works end-to-end. One-command content posting. Complete Eir mode documentation. Fix several bugs.

v3.116.0

**Mode-aware setup flow** — The skill now guides you to choose between Standalone and Eir mode upfront, so you never accidentally configure the wrong pipeline. **Eir mode works end-to-end** — Fixed a critical issue where the content generation pipeline would silently use the wrong output format in Eir mode. Task files now carry a mode marker instead of an embedded prompt, so the correct writer prompt is always loaded at generation time. Backward compatible with existing task files. **One-command content posting** — eir_post.py now works as a CLI tool. Post a single file, batch-post a directory, or validate with --dry-run before going live. **Complete Eir mode documentation** — Full walkthrough from eir_sync fetch to content POST, with common error troubleshooting. New candidates-spec.md defines the exact JSON format for the agent selection step. **Fix several bugs.**

v3.115.0

2-pass entity refinement for all tiers, time-qualified queries, cross-topic hot entity detection; versioned product names (GPT-5.5) captured in entity extraction

v1.3.3

Fix: hardcode API URL in connect.mjs, eliminate fs.readFileSync to resolve static scan exfiltration warning

v1.3.2

Security refactor v2: clean publish with all files

v1.3.1

Security refactor: separate standalone/Eir modes, personalization opt-in, SECURITY.md rewrite, pipeline module renames (task_builder, generate, directives, eir_sync, eir_post), connect.mjs phase split, interest extraction local-only

v1.3.0

test

v1.2.0

PR #35-41: English examples in writer-prompt and content-spec, SECURITY.md, connect.mjs security fix, SearXNG/Crawl4AI docs as recommendations, flow diagram formatting, ClawHub publish cleanup, all 10 new-user feedback fixes (search API examples, generate/brief flow, pip install, config/ init, LLM identity, cron message, pairing code, references docs, setup.py --init, date format)

v1.1.0

Fix all 10 new-user feedback items

v1.0.2

1.0.2 - Remove whisper API and writer prompt (no longer needed) - Remove session transcript scanning from interest extraction - Remove whisper-specific overrides from content spec - Explicitly declare credentials in SECURITY.md - Address ClawHub security scan findings (transcript access, undeclared credentials)

v1.0.1

1.0.1 - Add SECURITY.md documenting credential storage, data flow, and permissions - Gitignore config/settings.json to prevent accidental API key commits - Eliminate process.env usage in connect.mjs (read from config instead) - Add Security & Data Flow section to SKILL.md and README - Capitalize flow diagram labels for consistency - Replace Chinese examples with English in writer prompts - Rewrite SearXNG/Crawl4AI description as installation recommendation

v1.0.0

eir-daily-content-curator 1.0.0 - Initial release of a personalized daily AI news content curator. - Supports both standalone and Eir integration modes. - Learns user interests from conversation, searches multiple sources, summarizes, and compiles daily briefs. - Configurable search and crawl pipeline with fallback options (SearXNG, Crawl4AI). - Modular pipeline: extract interests, search, select, crawl, generate, compile daily briefs. - Includes setup guides, cron integration, and reference materials.

Metadata

Slug eir-daily-content-curator

Version 3.119.0

License MIT-0

All-time Installs 1

Active Installs 1

Total Versions 14

Frequently Asked Questions

What is Eir Daily Content Curator?

Daily AI news curation — learns interests from your profile, searches the web, delivers structured summaries and daily briefs. Use when: 'set up daily news',... It is an AI Agent Skill for Claude Code / OpenClaw, with 217 downloads so far.

How do I install Eir Daily Content Curator?

Run "/install eir-daily-content-curator" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Eir Daily Content Curator free?

Yes, Eir Daily Content Curator is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Eir Daily Content Curator support?

Eir Daily Content Curator is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Eir Daily Content Curator?

It is built and maintained by heyeir (@heyeir); the current version is v3.119.0.

More Skills

Eir Daily Content Curator