← Back to Skills Marketplace
493
Downloads
0
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install batch-processor-1-0-0
Description
Process multiple documents in bulk with parallel execution
README (SKILL.md)
Batch Processor Skill
Overview
This skill enables efficient bulk processing of documents - convert, transform, extract, or analyze hundreds of files with parallel execution and progress tracking.
How to Use
- Describe what you want to accomplish
- Provide any required input data or files
- I'll execute the appropriate operations
Example prompts:
- "Convert 100 PDFs to Word documents"
- "Extract text from all images in a folder"
- "Batch rename and organize files"
- "Mass update document headers/footers"
Domain Knowledge
Batch Processing Patterns
Input: [file1, file2, ..., fileN]
│
▼
┌─────────────┐
│ Parallel │ ← Process multiple files concurrently
│ Workers │
└─────────────┘
│
▼
Output: [result1, result2, ..., resultN]
Python Implementation
from concurrent.futures import ProcessPoolExecutor, as_completed
from pathlib import Path
from tqdm import tqdm
def process_file(file_path: Path) -> dict:
"""Process a single file."""
# Your processing logic here
return {"path": str(file_path), "status": "success"}
def batch_process(input_dir: str, pattern: str = "*.*", max_workers: int = 4):
"""Process all matching files in directory."""
files = list(Path(input_dir).glob(pattern))
results = []
with ProcessPoolExecutor(max_workers=max_workers) as executor:
futures = {executor.submit(process_file, f): f for f in files}
for future in tqdm(as_completed(futures), total=len(files)):
file = futures[future]
try:
result = future.result()
results.append(result)
except Exception as e:
results.append({"path": str(file), "error": str(e)})
return results
# Usage
results = batch_process("/documents/invoices", "*.pdf", max_workers=8)
print(f"Processed {len(results)} files")
Error Handling & Resume
import json
from pathlib import Path
class BatchProcessor:
def __init__(self, checkpoint_file: str = "checkpoint.json"):
self.checkpoint_file = checkpoint_file
self.processed = self._load_checkpoint()
def _load_checkpoint(self):
if Path(self.checkpoint_file).exists():
return json.load(open(self.checkpoint_file))
return {}
def _save_checkpoint(self):
json.dump(self.processed, open(self.checkpoint_file, "w"))
def process(self, files: list, processor_func):
for file in files:
if str(file) in self.processed:
continue # Skip already processed
try:
result = processor_func(file)
self.processed[str(file)] = {"status": "success", **result}
except Exception as e:
self.processed[str(file)] = {"status": "error", "error": str(e)}
self._save_checkpoint() # Resume-safe
Best Practices
- Use progress bars (tqdm) for user feedback
- Implement checkpointing for long jobs
- Set reasonable worker counts (CPU cores)
- Log failures for later review
Installation
# Install required dependencies
pip install python-docx openpyxl python-pptx reportlab jinja2
Resources
Usage Guidance
This skill appears coherent for bulk document processing. Before installing or running: (1) run pip installs in a virtualenv and add tqdm if you want progress bars; (2) ensure the agent/process you give this skill has only the file-system access it needs (it will read input files and create checkpoint.json in the working directory); (3) validate or scan input files (malicious documents can exploit processors); (4) if you plan autonomous runs, restrict the agent's file-path scope to prevent unexpected file access. If you need stronger guarantees, ask the author for explicit input validation and a minimal dependency list.
Capability Analysis
Type: OpenClaw Skill
Name: batch-processor-1-0-0
Version: 1.0.0
The skill is classified as suspicious due to its reliance on high-risk capabilities like `code_execution` and `file_operations` (declared in SKILL.md), and the instruction to `pip install` multiple packages. While these capabilities are plausibly needed for its stated purpose of 'batch processing' and the listed packages are legitimate, the open-ended nature of the `process_file` placeholder and the ability to install dependencies via `pip` present a significant attack surface for potential prompt injection or supply chain vulnerabilities if the agent's guardrails are insufficient or if a malicious dependency were introduced.
Capability Assessment
Purpose & Capability
The name/description (bulk document processing) matches the SKILL.md contents: design patterns, Python code examples using concurrent.futures, checkpointing, and suggested document libraries. The listed pip packages (python-docx, openpyxl, python-pptx, reportlab, jinja2) are relevant for converting and manipulating office files and PDFs.
Instruction Scope
Instructions stick to processing user-provided files, progress tracking, and checkpointing. The examples read and write files within the working directory (e.g., checkpoint.json) which is expected for a batch processor. There are no instructions to exfiltrate data, call unrelated external services, or access system credentials. A minor note: the prose allows the agent discretion ('I'll execute the appropriate operations')—this is normal for instruction skills but means the agent will need permission to run file operations and code execution.
Install Mechanism
This is an instruction-only skill (no install spec). The SKILL.md suggests installing Python packages via pip, which is proportionate. Minor inconsistency: the examples use tqdm for progress bars but tqdm is not included in the pip install command—add it if you expect CLI progress bars. No downloads from unknown hosts or archive extraction are present.
Credentials
The skill requests no environment variables, credentials, or config paths. The only file artifact is a local checkpoint (checkpoint.json) created in the working directory — this is appropriate for resume-safe batch jobs.
Persistence & Privilege
always is false and the skill does not request permanent presence or elevated privileges. It writes/reads its own checkpoint file but does not modify other skills or system-wide settings.
How to Use
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install batch-processor-1-0-0 - After installation, invoke the skill by name or use
/batch-processor-1-0-0 - Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
Initial release of the Batch Processor skill:
- Enables bulk processing of documents with parallel execution for tasks like conversion, extraction, and transformation.
- Supports progress tracking and checkpointing for easy error recovery and resume-safe operation.
- Provides sample Python implementation, including best practices for reliability and efficiency.
- Compatible with Claude Sonnet 4 and Opus 4 models; uses file, code execution, and basic computer tools.
- Includes installation guide and resource links.
Metadata
Frequently Asked Questions
What is Batch Processor 1.0.0?
Process multiple documents in bulk with parallel execution. It is an AI Agent Skill for Claude Code / OpenClaw, with 493 downloads so far.
How do I install Batch Processor 1.0.0?
Run "/install batch-processor-1-0-0" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is Batch Processor 1.0.0 free?
Yes, Batch Processor 1.0.0 is completely free (open-source). You can download, install and use it at no cost.
Which platforms does Batch Processor 1.0.0 support?
Batch Processor 1.0.0 is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).
Who created Batch Processor 1.0.0?
It is built and maintained by Leo Wing (@leowing); the current version is v1.0.0.
More Skills