/install ai-news-pipeline
\r \r
AI News Pipeline\r
\r
Overview\r
\r
This skill is executable by itself. The actual workflow scripts are bundled in scripts/.\r
Run them against the current workspace or pass --workspace /path/to/workspace explicitly.\r
\r
Workspace Requirements\r
\r The target workspace should contain or accept these files and folders:\r \r
config/sources.json\rconfig/international_sources.json\rcompanies.txt\rdata/\rreports/\rstate/\r \r If the folders do not exist, the scripts create them.\r \r
Install Dependencies\r
\r Install Python dependencies before first use:\r \r
python -m pip install -r /path/to/skill/scripts/requirements.txt\r
```\r
\r
## Available Entrypoints\r
\r
Use the bundled Python entrypoints depending on the job type.\r
\r
### Capture Only\r
\r
Use this for high-frequency collection jobs. It only captures feeds, updates deduplication state, and writes raw and incremental data.\r
\r
```bash\r
python /path/to/skill/scripts/run_capture_only.py --workspace /path/to/workspace\r
```\r
\r
### Report Only\r
\r
Use this for scheduled delivery jobs. It reads already-collected data, calls the model for summaries and titles, updates the cumulative Excel files, and rebuilds the Word brief.\r
\r
By default it uses the reporting window from yesterday 00:00 to today 08:00.\r
\r
```bash\r
python /path/to/skill/scripts/run_report_only.py --workspace /path/to/workspace\r
```\r
\r
Optional time window:\r
\r
```bash\r
python /path/to/skill/scripts/run_report_only.py --workspace /path/to/workspace --time-window "2026-03-15 00:00 to 2026-03-16 08:00"\r
```\r
\r
Optional skip-AI mode:\r
\r
```bash\r
python /path/to/skill/scripts/run_report_only.py --workspace /path/to/workspace --disable-ai\r
```\r
\r
## Full Workflow\r
\r
```bash\r
python /path/to/skill/scripts/run_full_workflow.py --workspace /path/to/workspace\r
```\r
\r
Optional time window:\r
\r
```bash\r
python /path/to/skill/scripts/run_full_workflow.py --workspace /path/to/workspace --time-window "2026-03-15 00:00 to 2026-03-15 18:00"\r
```\r
\r
Optional skip-AI mode:\r
\r
```bash\r
python /path/to/skill/scripts/run_full_workflow.py --workspace /path/to/workspace --disable-ai\r
```\r
\r
## What Each Entrypoint Does\r
\r
`run_capture_only.py`\r
1. Collect domestic RSS items into `data/YYYY-MM-DD.jsonl`.\r
2. Collect domestic raw items into `data/domestic_raw_YYYY-MM-DD.jsonl`.\r
3. Collect international raw items into `data/international_raw_YYYY-MM-DD.jsonl`.\r
4. Filter international items into `data/international_YYYY-MM-DD.jsonl`.\r
5. Save per-source snapshots in `snapshots/`.\r
6. Update RSS deduplication and source metrics in `state/feed_state.json`.\r
\r
`run_report_only.py`\r
1. Read the selected time window from collected data.\r
2. Build the cumulative domestic Excel output in `reports/company_mentions.xlsx`.\r
3. Build the cumulative international Excel output in `reports/international_company_mentions.xlsx`.\r
4. Call the model to generate domestic AI titles and AI summaries.\r
5. Call the model to generate international AI titles, AI summaries, and impact scores.\r
6. Build a merged daily Word brief in `reports/`.\r
\r
`run_full_workflow.py`\r
1. Run capture.\r
2. Run domestic reporting.\r
3. Run international reporting.\r
\r
## Inputs\r
\r
- Domestic RSS config: `config/sources.json`\r
- International RSS config: `config/international_sources.json`\r
- Company list: `companies.txt`\r
- Volcengine key: `ARK_API_KEY`\r
- Optional model override: `ARK_MODEL`\r
\r
## Security Review Notes\r
\r
This skill is designed to run an AI news collection and reporting workflow inside a user-provided workspace.\r
\r
It accesses external network resources for only two purposes:\r
1. reading user-configured RSS / Atom feeds to collect public news content;\r
2. calling a user-configured Volcengine model endpoint to generate AI titles, AI summaries, and impact scores.\r
\r
It writes local files because it needs to:\r
1. store raw and incremental collected news data;\r
2. persist deduplication state so repeated runs do not duplicate items;\r
3. generate cumulative Excel reports and a Word brief;\r
4. save feed snapshots and logs for troubleshooting and completeness checks.\r
\r
It does not upload arbitrary local files from the workspace and does not scan unrelated user content. External requests are limited to user-configured RSS URLs and the user-configured model endpoint.\r
\r
Credentials are only taken from user-provided configuration, such as RSS authentication data and `ARK_API_KEY`. These credentials are used only at runtime for the intended service and are not forwarded to unrelated destinations.\r
\r
## Important Behavior\r
\r
- `state/feed_state.json` controls RSS deduplication.\r
- Excel files are cumulative.\r
- The Word brief is rebuilt per run.\r
- The Word international section only includes the top 5 items by impact score inside the selected time window.\r
- International items without a successful AI summary are excluded from the Word brief.\r
- AI cache files are deleted automatically after each run.\r
\r
## Troubleshooting\r
\r
1. If the workflow does not rerun old RSS items, check `state/feed_state.json`.\r
2. If AI columns are empty, check whether `ARK_API_KEY` is set in the execution environment.\r
3. If the user wants a full rebuild, delete the relevant daily `data` files and `state/feed_state.json`, then rerun.\r
4. If the user needs exact commands or cloud prompts, read `references/commands.md`.\r
\r
## References\r
\r
- `references/commands.md`\r
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install ai-news-pipeline - After installation, invoke the skill by name or use
/ai-news-pipeline - Provide required inputs per the skill's parameter spec and get structured output
What is ai-news-pipeline-new?
Run a self-contained Chinese and international AI news workflow inside the current workspace. Use when the user wants either high-frequency RSS capture only... It is an AI Agent Skill for Claude Code / OpenClaw, with 283 downloads so far.
How do I install ai-news-pipeline-new?
Run "/install ai-news-pipeline" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is ai-news-pipeline-new free?
Yes, ai-news-pipeline-new is completely free, licensed under MIT-0. You can download, install and use it at no cost.
Which platforms does ai-news-pipeline-new support?
ai-news-pipeline-new is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).
Who created ai-news-pipeline-new?
It is built and maintained by Nighmat (@nighmat1220); the current version is v1.0.4.