Ca File Processor
/install ca-file-processor
CA File Processor
This skill processes the four most common file formats used by Indian CA firms and extracts structured information from them for analysis, summarisation, and answering queries.
Supported formats
- PDF — GST returns, ITR acknowledgements, audit reports, scanned invoices (text-layer and scanned via OCR)
- Excel (.xlsx / .xls) — Trial balance, P&L, balance sheets, payroll registers, GST workings
- CSV — Bank statement exports (HDFC, ICICI, SBI), GSTR-2B downloads, Tally exports
- Images (.jpg / .png) — WhatsApp invoice photos, scanned Form 16, cheque images
How to use
When a file is attached or uploaded, run the appropriate script:
python3 scripts/skill_router.py \x3Cfile_path>
The router auto-detects the file type and calls the correct processor. It returns a structured JSON dict.
What to do with the output
Once the script returns output, use it to:
- Answer the user's question about the document
- Extract specific fields they asked for (GSTIN, totals, dates)
- Summarise the document in plain language
- Flag anomalies or missing information
- Compare figures across multiple documents
Field extraction — what gets detected automatically
For invoices and PDFs:
- GSTIN (supplier and recipient)
- Invoice number and date
- Total amount / grand total
- PAN number
- Email and phone
For bank statements (CSV):
- Total debits and credits
- Date range of transactions
- Detected bank format
For Excel files:
- Document type (trial balance / P&L / balance sheet / payroll / GST workings / ledger)
- Sheet names and row counts
- Preview of header rows
OCR notes
- Text-layer PDFs are read directly (fast, accurate)
- Scanned PDFs and images go through Tesseract OCR (English + Hindi)
- Confidence is rated high / medium / low in the output
- Always flag low-confidence results to the user and ask for confirmation on numeric fields
Trust statement
This skill runs entirely locally on your server. No data is sent to any external service. All processing happens via open-source Python libraries (PyMuPDF, pytesseract, openpyxl, pandas).
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install ca-file-processor - After installation, invoke the skill by name or use
/ca-file-processor - Provide required inputs per the skill's parameter spec and get structured output
What is Ca File Processor?
Process financial documents for Indian CA firms. Use when any PDF, Excel (.xlsx/.xls), CSV, JPG, or PNG file is received or uploaded — including GST returns,... It is an AI Agent Skill for Claude Code / OpenClaw, with 141 downloads so far.
How do I install Ca File Processor?
Run "/install ca-file-processor" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is Ca File Processor free?
Yes, Ca File Processor is completely free, licensed under MIT-0. You can download, install and use it at no cost.
Which platforms does Ca File Processor support?
Ca File Processor is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).
Who created Ca File Processor?
It is built and maintained by purvik6062 (@purvik6062); the current version is v1.0.3.