← Back to Skills Marketplace
aipoch-ai

Gene Structure Mapper

by AIpoch · GitHub ↗ · v0.1.0 · MIT-0
cross-platform ✓ Security Clean
183
Downloads
0
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install gene-structure-mapper
Description
Visualize gene structure with exon-intron diagrams, domain annotations, and mutation position markers. Produces SVG, PNG, or PDF figures suitable for publica...
README (SKILL.md)

Gene Structure Mapper

Generate exon-intron structure diagrams for any gene symbol using the Ensembl REST API. Optionally overlay protein domain annotations (UniProt) and mark mutation hotspot positions. Outputs publication-ready SVG, PNG, or PDF figures.

IMPLEMENTEDscripts/main.py is fully functional. Ensembl REST API, caching, matplotlib visualization, --domains, --mutations, and --demo are all implemented.

Quick Check

python -m py_compile scripts/main.py
python scripts/main.py --help
python scripts/main.py --demo --output demo.png

When to Use

  • Creating gene structure figures for manuscripts or presentations
  • Visualizing splice variants and isoform differences
  • Marking mutation positions on a gene diagram for functional annotation
  • Overlaying domain boundaries on exon-intron maps

Workflow

  1. Confirm the user objective, required inputs, and non-negotiable constraints before doing detailed work.
  2. Validate that the request matches the documented scope and stop early if the task would require unsupported assumptions.
  3. Use the packaged script path or the documented reasoning path with only the inputs that are actually available.
  4. Return a structured result that separates assumptions, deliverables, risks, and unresolved items.
  5. If execution fails or inputs are incomplete, switch to the fallback path and state exactly what blocked full completion.

Fallback template: If scripts/main.py fails or the gene symbol is unrecognized, report: (a) the failure point, (b) whether a manual Ensembl/UCSC lookup can substitute, (c) which output formats are still generatable.

Parameters

Parameter Type Required Description
--gene, -g string Yes* Gene symbol or Ensembl ID (e.g., TP53, BRCA1, ENSG00000141510)
--species string No Species name for Ensembl lookup (default: homo_sapiens)
--format string No Output format: png, svg, pdf (default: png)
--output, -o string No Output file path (default: \x3Cgene>_structure.\x3Cformat>)
--domains flag No Fetch and overlay UniProt protein domain annotations
--mutations string No Comma-separated codon positions to mark (e.g., 248,273)
--demo flag No Use hardcoded TP53 GRCh38 data — no internet required

*Required unless --demo is used.

Usage

python scripts/main.py --gene TP53 --format png
python scripts/main.py --gene BRCA1 --format png --domains --output brca1_structure.png
python scripts/main.py --gene KRAS --mutations 12,13,61 --format pdf
python scripts/main.py --demo
python scripts/main.py --demo --output demo.png --format svg

Implementation Notes (for script developer)

The script must implement:

  1. Gene lookupGET https://rest.ensembl.org/lookup/symbol/homo_sapiens/{gene}?expand=1 to fetch exon coordinates. Cache response to .cache/{gene}_ensembl.json to avoid repeated API calls. Add a 0.1 s delay between requests for batch lookups. The unauthenticated rate limit is 15 requests/second.
  2. Unknown gene handling — catch HTTP 400/404 from Ensembl and exit with code 1: Error: Gene not found: {gene_name}. Check the gene symbol and try again.
  3. SVG/PNG/PDF output — use matplotlib or svgwrite to draw exon blocks (filled rectangles) and intron lines scaled to genomic coordinates.
  4. --domains flag — fetch UniProt domain annotations and overlay colored domain blocks on the gene structure.
  5. --mutations flag — accept comma-separated codon positions; map to exon coordinates and draw vertical markers.
  6. --demo flag — use hardcoded TP53 GRCh38 exon coordinates (no internet required) to generate a demo visualization.

Known Limitations

  • For genes with multiple isoforms, the script uses the canonical transcript (Ensembl is_canonical flag). Other isoforms are not visualized.
  • Domain overlay (--domains) maps UniProt amino acid positions to genomic coordinates using CDS length; accuracy may vary for genes with complex splicing.
  • Ensembl API responses are cached to .cache/{gene}_ensembl.json. Delete the cache file to force a fresh lookup.
  • The unauthenticated Ensembl REST API rate limit is 15 requests/second; a 0.1 s delay is applied between batch requests.

Features

  • Exon-intron visualization scaled to genomic coordinates
  • Protein domain annotation overlay via UniProt (optional, --domains)
  • Mutation position markers with configurable labels (--mutations)
  • Publication-ready output in SVG, PNG, or PDF
  • Demo mode for offline testing (--demo)
  • Ensembl API response caching to avoid rate-limit issues

Output Requirements

Every response must make these explicit:

  • Objective and deliverable
  • Inputs used and assumptions introduced (e.g., genome build, transcript isoform selected)
  • Workflow or decision path taken
  • Core result: gene structure figure file path
  • Constraints, risks, caveats (e.g., multi-isoform genes, annotation version)
  • Unresolved items and next-step checks

Input Validation

This skill accepts: gene symbol inputs for structure visualization, with optional domain and mutation overlays.

If the request does not involve gene structure visualization — for example, asking to perform sequence alignment, predict protein structure, or analyze expression data — do not proceed. Instead respond:

"gene-structure-mapper is designed to visualize gene exon-intron structure. Your request appears to be outside this scope. Please provide a gene symbol and desired output format, or use a more appropriate tool for your task."

Error Handling

  • If --gene is missing, state that the gene symbol is required and provide an example.
  • If the gene symbol is not found in Ensembl (HTTP 400/404), print: Error: Gene not found: {gene_name}. Check the gene symbol and try again. and exit with code 1.
  • If --mutations contains non-numeric values, reject with: Error: --mutations must be comma-separated integers (codon positions).
  • If the task goes outside the documented scope, stop instead of guessing or silently widening the assignment.
  • If scripts/main.py fails, report the failure point, summarize what still can be completed safely, and provide a manual fallback.
  • Do not fabricate files, citations, data, search results, or execution outcomes.

Response Template

  1. Objective
  2. Inputs Received
  3. Assumptions
  4. Workflow
  5. Deliverable
  6. Risks and Limits
  7. Next Checks
Usage Guidance
This skill appears to be what it claims: a Python script that fetches Ensembl (and optionally EBI/UniProt) data to produce gene diagrams. Before installing or running it, consider: (1) Network access — live mode makes outbound requests to https://rest.ensembl.org and https://www.ebi.ac.uk; if you need to run offline, use --demo which uses hardcoded TP53 data. (2) Local files — the script creates/reads .cache/{gene}_ensembl.json and writes image files to the output path; check that this is acceptable in your environment. (3) No credentials are requested, but verify that running code from an unknown source is acceptable for your risk posture; review the script (scripts/main.py) if you need higher assurance. (4) Domain-to-genome mapping is approximate (as documented); validate figures for publication. If you want extra caution, run the demo first, inspect the code, and consider executing inside an isolated environment (container or VM) before using it on sensitive systems.
Capability Analysis
Type: OpenClaw Skill Name: gene-structure-mapper Version: 0.1.0 The skill bundle is a legitimate bioinformatics tool designed to visualize gene exon-intron structures. The Python script (scripts/main.py) uses standard libraries to fetch genomic data from public, unauthenticated APIs (Ensembl and EBI) and generates diagrams using Matplotlib. There is no evidence of malicious intent, data exfiltration, or dangerous execution patterns like shell injection or dynamic code evaluation. The documentation (SKILL.md) includes robust input validation and clear instructions for the AI agent that align with the stated purpose.
Capability Assessment
Purpose & Capability
Name and description (gene exon/intron visualization with optional domain/mutation overlays) match the included script and SKILL.md. Required operations (Ensembl REST lookup, optional UniProt/EBI features fetch, matplotlib drawing, caching) are appropriate and expected for this functionality.
Instruction Scope
SKILL.md and scripts/main.py stick to the declared task. The runtime instructions and script fetch data from Ensembl and EBI Proteins APIs, cache responses to .cache/{gene}_ensembl.json, and write image output — all coherent with the stated purpose. Note: the skill performs network I/O and file writes (cache + generated figures); these are expected but worth being aware of.
Install Mechanism
No install specification is provided and no external installers are pulled. The skill is delivered as source (Python script) that uses standard libraries plus requests and matplotlib; nothing is downloaded during installation beyond normal runtime network calls to data APIs.
Credentials
The skill requests no environment variables, no credentials, and no unusual config paths. Network access to public Ensembl and EBI endpoints is required for live runs; the --demo mode avoids network use. No secrets/external credentials are asked for or embedded.
Persistence & Privilege
always is false and the skill does not request persistent platform privileges. It writes its own cache file under .cache/ and output files as requested — behavior appropriate for a data-caching visualization tool. It does not modify other skills or system-wide agent settings.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install gene-structure-mapper
  3. After installation, invoke the skill by name or use /gene-structure-mapper
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v0.1.0
- Initial public release (v0.1.0) of gene-structure-mapper. - Visualizes gene structures from a gene symbol using Ensembl API. - Supports exon-intron diagrams, optional UniProt domain overlays (--domains), and mutation marker positions (--mutations). - Outputs publication-ready SVG, PNG, or PDF figures. - Includes demo mode (--demo) using TP53 without internet and implements API response caching. - Comprehensive error handling, scope limits, and fallback procedures provided in documentation.
Metadata
Slug gene-structure-mapper
Version 0.1.0
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 1
Frequently Asked Questions

What is Gene Structure Mapper?

Visualize gene structure with exon-intron diagrams, domain annotations, and mutation position markers. Produces SVG, PNG, or PDF figures suitable for publica... It is an AI Agent Skill for Claude Code / OpenClaw, with 183 downloads so far.

How do I install Gene Structure Mapper?

Run "/install gene-structure-mapper" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Gene Structure Mapper free?

Yes, Gene Structure Mapper is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Gene Structure Mapper support?

Gene Structure Mapper is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Gene Structure Mapper?

It is built and maintained by AIpoch (@aipoch-ai); the current version is v0.1.0.

💬 Comments