← Back to Skills Marketplace
nighmat1220

xinwencaiji

by Nighmat · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ⚠ suspicious
197
Downloads
0
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install ai-news-collection
Description
Run a self-contained Chinese and international AI news workflow inside the current workspace. Use when the user wants either high-frequency RSS capture only...
README (SKILL.md)

\r \r

AI News Pipeline\r

\r

Overview\r

\r This skill is executable by itself. The actual workflow scripts are bundled in scripts/.\r Run them against the current workspace or pass --workspace /path/to/workspace explicitly.\r \r

Workspace Requirements\r

\r The target workspace should contain or accept these files and folders:\r \r

  • config/sources.json\r
  • config/international_sources.json\r
  • companies.txt\r
  • data/\r
  • reports/\r
  • state/\r \r If the folders do not exist, the scripts create them.\r \r

Install Dependencies\r

\r Install Python dependencies before first use:\r \r

python -m pip install -r /path/to/skill/scripts/requirements.txt\r
```\r
\r
## Available Entrypoints\r
\r
Use the bundled Python entrypoints depending on the job type.\r
\r
### Capture Only\r
\r
Use this for high-frequency collection jobs. It only captures feeds, updates deduplication state, and writes raw and incremental data.\r
\r
```bash\r
python /path/to/skill/scripts/run_capture_only.py --workspace /path/to/workspace\r
```\r
\r
### Report Only\r
\r
Use this for scheduled delivery jobs. It reads already-collected data, calls the model for summaries and titles, updates the cumulative Excel files, and rebuilds the Word brief.\r
\r
By default it uses the reporting window from yesterday 00:00 to today 08:00.\r
\r
```bash\r
python /path/to/skill/scripts/run_report_only.py --workspace /path/to/workspace\r
```\r
\r
Optional time window:\r
\r
```bash\r
python /path/to/skill/scripts/run_report_only.py --workspace /path/to/workspace --time-window "2026-03-15 00:00 to 2026-03-16 08:00"\r
```\r
\r
Optional skip-AI mode:\r
\r
```bash\r
python /path/to/skill/scripts/run_report_only.py --workspace /path/to/workspace --disable-ai\r
```\r
\r
## Full Workflow\r
\r
```bash\r
python /path/to/skill/scripts/run_full_workflow.py --workspace /path/to/workspace\r
```\r
\r
Optional time window:\r
\r
```bash\r
python /path/to/skill/scripts/run_full_workflow.py --workspace /path/to/workspace --time-window "2026-03-15 00:00 to 2026-03-15 18:00"\r
```\r
\r
Optional skip-AI mode:\r
\r
```bash\r
python /path/to/skill/scripts/run_full_workflow.py --workspace /path/to/workspace --disable-ai\r
```\r
\r
## What Each Entrypoint Does\r
\r
`run_capture_only.py`\r
1. Collect domestic RSS items into `data/YYYY-MM-DD.jsonl`.\r
2. Collect domestic raw items into `data/domestic_raw_YYYY-MM-DD.jsonl`.\r
3. Collect international raw items into `data/international_raw_YYYY-MM-DD.jsonl`.\r
4. Filter international items into `data/international_YYYY-MM-DD.jsonl`.\r
5. Save per-source snapshots in `snapshots/`.\r
6. Update RSS deduplication and source metrics in `state/feed_state.json`.\r
\r
`run_report_only.py`\r
1. Read the selected time window from collected data.\r
2. Build the cumulative domestic Excel output in `reports/company_mentions.xlsx`.\r
3. Build the cumulative international Excel output in `reports/international_company_mentions.xlsx`.\r
4. Call the model to generate domestic AI titles and AI summaries.\r
5. Call the model to generate international AI titles, AI summaries, and impact scores.\r
6. Build a merged daily Word brief in `reports/`.\r
\r
`run_full_workflow.py`\r
1. Run capture.\r
2. Run domestic reporting.\r
3. Run international reporting.\r
\r
## Inputs\r
\r
- Domestic RSS config: `config/sources.json`\r
- International RSS config: `config/international_sources.json`\r
- Company list: `companies.txt`\r
- Volcengine key: `ARK_API_KEY`\r
- Optional model override: `ARK_MODEL`\r
\r
## Important Behavior\r
\r
- `state/feed_state.json` controls RSS deduplication.\r
- Excel files are cumulative.\r
- The Word brief is rebuilt per run.\r
- The Word international section only includes the top 5 items by impact score inside the selected time window.\r
- International items without a successful AI summary are excluded from the Word brief.\r
- AI cache files are deleted automatically after each run.\r
\r
## Troubleshooting\r
\r
1. If the workflow does not rerun old RSS items, check `state/feed_state.json`.\r
2. If AI columns are empty, check whether `ARK_API_KEY` is set in the execution environment.\r
3. If the user wants a full rebuild, delete the relevant daily `data` files and `state/feed_state.json`, then rerun.\r
4. If the user needs exact commands or cloud prompts, read `references/commands.md`.\r
\r
## References\r
\r
- `references/commands.md`\r
Usage Guidance
This skill appears to do what it says (fetch RSS feeds, build Excel/Word reports, and optionally call an external model for summaries). Before installing or running it, consider the following: - The scripts will send collected article content to an external model endpoint using ARK_API_KEY / ARK_API_BASE. The registry metadata did not list these required env vars — verify the skill author and the endpoint before providing keys. - Default ARK_API_BASE points to an external service (https://ark.cn-beijing.volces.com). Confirm this is a trusted API and that your ARK_API_KEY is scoped appropriately. If you do not want outbound data sent, run with --disable-ai or omit ARK_API_KEY. - Feed configs can include authentication (username/password or custom headers). Those credentials may be used by the feed fetcher and stored under workspace state/logs; keep sensitive feeds out of the same workspace or review how credentials are provided. - The skill will create and write files/directories (data/, reports/, state/, logs/, snapshots/) in the chosen workspace. Back up or isolate any existing data you care about. - The package source is unknown and has no homepage; if you intend to run it in a production or sensitive environment, review the full script contents (they are bundled) and verify the model endpoint and data handling behavior. If you trust the code and endpoint: set ARK_API_KEY (and ARK_API_BASE/ARK_MODEL if needed), run dependency installation in an isolated environment, and consider running with --disable-ai first to validate data ingestion without external network calls.
Capability Analysis
Type: OpenClaw Skill Name: ai-news-collection Version: 1.0.0 The skill bundle implements a legitimate AI-driven news aggregation and reporting pipeline. It collects RSS/Atom feeds (collect_feeds.py), processes them using the Volcengine Ark API for summarization and impact scoring (generate_company_report.py, generate_international_report.py), and produces formatted Excel and Word documents (brief_writer.py). The code follows standard practices for data processing, uses environment variables for API keys, and lacks any indicators of data exfiltration, persistence, or malicious prompt injection.
Capability Assessment
Purpose & Capability
The skill's code and SKILL.md align with the stated purpose: collecting RSS/Atom feeds, deduplicating, producing cumulative Excel files and a Word brief, and optionally calling an AI model for titles/summaries. However, the registry metadata did not declare the external model credential (ARK_API_KEY) or model base (ARK_API_BASE) even though SKILL.md and scripts require them; this mismatch is unexpected and should be corrected.
Instruction Scope
Runtime instructions are narrowly scoped to the workspace (config, data, reports, state) and to running the bundled Python scripts. The scripts fetch arbitrary RSS/Atom URLs from user-provided config files and will POST content to an external model API for AI summarization. That behavior is coherent with the skill's purpose but means collected article text (and any credentials present in feed configs) will be transmitted externally.
Install Mechanism
There is no install spec in the registry (instruction-only). The bundled scripts list Python dependencies (openpyxl, python-docx) to be installed via pip — a low-risk, typical approach. No arbitrary binary downloads or obscure installers are present.
Credentials
The SKILL.md and scripts require ARK_API_KEY (and allow ARK_MODEL / ARK_API_BASE overrides) to call an external model service. The registry's required-env list is empty, which is inconsistent. Requesting an API key for the external model is proportionate to the AI-summary functionality, but the missing declaration and the default ARK_API_BASE (https://ark.cn-beijing.volces.com/api/v3) require you to verify the endpoint and trust the operator before supplying credentials.
Persistence & Privilege
The skill does not request always:true and does not modify other skills or system-wide settings. It writes state, logs, caches, reports, and snapshots into the workspace (data/, reports/, state/, logs/), which is normal for this type of tool.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install ai-news-collection
  3. After installation, invoke the skill by name or use /ai-news-collection
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
Initial release of ai-news-pipeline skill. - Provides a complete AI news workflow for Chinese and international sources, runnable directly in the current workspace. - Adds high-frequency RSS capture ("capture only") and scheduled report delivery ("report only") modes, both generating cumulative Excel outputs and a merged Word brief. - Bundles all necessary scripts under scripts/, with entrypoints for capture, reporting, and full workflow. - No external local repository path required; all outputs, state, and configuration are managed inside the workspace. - Includes comprehensive instructions for setup, inputs, behavior, and troubleshooting in SKILL.md.
Metadata
Slug ai-news-collection
Version 1.0.0
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 1
Frequently Asked Questions

What is xinwencaiji?

Run a self-contained Chinese and international AI news workflow inside the current workspace. Use when the user wants either high-frequency RSS capture only... It is an AI Agent Skill for Claude Code / OpenClaw, with 197 downloads so far.

How do I install xinwencaiji?

Run "/install ai-news-collection" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is xinwencaiji free?

Yes, xinwencaiji is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does xinwencaiji support?

xinwencaiji is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created xinwencaiji?

It is built and maintained by Nighmat (@nighmat1220); the current version is v1.0.0.

💬 Comments