/install ai-news-collection
\r \r
AI News Pipeline\r
\r
Overview\r
\r
This skill is executable by itself. The actual workflow scripts are bundled in scripts/.\r
Run them against the current workspace or pass --workspace /path/to/workspace explicitly.\r
\r
Workspace Requirements\r
\r The target workspace should contain or accept these files and folders:\r \r
config/sources.json\rconfig/international_sources.json\rcompanies.txt\rdata/\rreports/\rstate/\r \r If the folders do not exist, the scripts create them.\r \r
Install Dependencies\r
\r Install Python dependencies before first use:\r \r
python -m pip install -r /path/to/skill/scripts/requirements.txt\r
```\r
\r
## Available Entrypoints\r
\r
Use the bundled Python entrypoints depending on the job type.\r
\r
### Capture Only\r
\r
Use this for high-frequency collection jobs. It only captures feeds, updates deduplication state, and writes raw and incremental data.\r
\r
```bash\r
python /path/to/skill/scripts/run_capture_only.py --workspace /path/to/workspace\r
```\r
\r
### Report Only\r
\r
Use this for scheduled delivery jobs. It reads already-collected data, calls the model for summaries and titles, updates the cumulative Excel files, and rebuilds the Word brief.\r
\r
By default it uses the reporting window from yesterday 00:00 to today 08:00.\r
\r
```bash\r
python /path/to/skill/scripts/run_report_only.py --workspace /path/to/workspace\r
```\r
\r
Optional time window:\r
\r
```bash\r
python /path/to/skill/scripts/run_report_only.py --workspace /path/to/workspace --time-window "2026-03-15 00:00 to 2026-03-16 08:00"\r
```\r
\r
Optional skip-AI mode:\r
\r
```bash\r
python /path/to/skill/scripts/run_report_only.py --workspace /path/to/workspace --disable-ai\r
```\r
\r
## Full Workflow\r
\r
```bash\r
python /path/to/skill/scripts/run_full_workflow.py --workspace /path/to/workspace\r
```\r
\r
Optional time window:\r
\r
```bash\r
python /path/to/skill/scripts/run_full_workflow.py --workspace /path/to/workspace --time-window "2026-03-15 00:00 to 2026-03-15 18:00"\r
```\r
\r
Optional skip-AI mode:\r
\r
```bash\r
python /path/to/skill/scripts/run_full_workflow.py --workspace /path/to/workspace --disable-ai\r
```\r
\r
## What Each Entrypoint Does\r
\r
`run_capture_only.py`\r
1. Collect domestic RSS items into `data/YYYY-MM-DD.jsonl`.\r
2. Collect domestic raw items into `data/domestic_raw_YYYY-MM-DD.jsonl`.\r
3. Collect international raw items into `data/international_raw_YYYY-MM-DD.jsonl`.\r
4. Filter international items into `data/international_YYYY-MM-DD.jsonl`.\r
5. Save per-source snapshots in `snapshots/`.\r
6. Update RSS deduplication and source metrics in `state/feed_state.json`.\r
\r
`run_report_only.py`\r
1. Read the selected time window from collected data.\r
2. Build the cumulative domestic Excel output in `reports/company_mentions.xlsx`.\r
3. Build the cumulative international Excel output in `reports/international_company_mentions.xlsx`.\r
4. Call the model to generate domestic AI titles and AI summaries.\r
5. Call the model to generate international AI titles, AI summaries, and impact scores.\r
6. Build a merged daily Word brief in `reports/`.\r
\r
`run_full_workflow.py`\r
1. Run capture.\r
2. Run domestic reporting.\r
3. Run international reporting.\r
\r
## Inputs\r
\r
- Domestic RSS config: `config/sources.json`\r
- International RSS config: `config/international_sources.json`\r
- Company list: `companies.txt`\r
- Volcengine key: `ARK_API_KEY`\r
- Optional model override: `ARK_MODEL`\r
\r
## Important Behavior\r
\r
- `state/feed_state.json` controls RSS deduplication.\r
- Excel files are cumulative.\r
- The Word brief is rebuilt per run.\r
- The Word international section only includes the top 5 items by impact score inside the selected time window.\r
- International items without a successful AI summary are excluded from the Word brief.\r
- AI cache files are deleted automatically after each run.\r
\r
## Troubleshooting\r
\r
1. If the workflow does not rerun old RSS items, check `state/feed_state.json`.\r
2. If AI columns are empty, check whether `ARK_API_KEY` is set in the execution environment.\r
3. If the user wants a full rebuild, delete the relevant daily `data` files and `state/feed_state.json`, then rerun.\r
4. If the user needs exact commands or cloud prompts, read `references/commands.md`.\r
\r
## References\r
\r
- `references/commands.md`\r
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install ai-news-collection - After installation, invoke the skill by name or use
/ai-news-collection - Provide required inputs per the skill's parameter spec and get structured output
What is xinwencaiji?
Run a self-contained Chinese and international AI news workflow inside the current workspace. Use when the user wants either high-frequency RSS capture only... It is an AI Agent Skill for Claude Code / OpenClaw, with 197 downloads so far.
How do I install xinwencaiji?
Run "/install ai-news-collection" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is xinwencaiji free?
Yes, xinwencaiji is completely free, licensed under MIT-0. You can download, install and use it at no cost.
Which platforms does xinwencaiji support?
xinwencaiji is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).
Who created xinwencaiji?
It is built and maintained by Nighmat (@nighmat1220); the current version is v1.0.0.