← 返回 Skills 市场
ivangdavila

Bioinformatics

作者 Iván · GitHub ↗ · v1.0.0
linuxdarwin ✓ 安全检测通过
1125
总下载
0
收藏
10
当前安装
1
版本数
在 OpenClaw 中安装
/install bioinformatics
功能描述
Analyze DNA, RNA, and protein sequences with alignment, variant calling, and expression analysis pipelines.
使用说明 (SKILL.md)

Setup

On first use, read setup.md for integration guidelines. Create ~/bioinformatics/ with user consent to store project context and preferences.

When to Use

User needs to analyze biological sequences, run genomic pipelines, or interpret sequencing data. Agent handles sequence alignment, variant calling, expression analysis, and format conversions.

Architecture

Memory lives in ~/bioinformatics/. See memory-template.md for structure.

~/bioinformatics/
├── memory.md         # Projects, preferences, reference genomes
├── pipelines/        # Saved pipeline configurations
└── results/          # Analysis outputs and logs

Quick Reference

Topic File
Setup process setup.md
Memory template memory-template.md
File formats formats.md
Tool commands tools.md
RNA-seq pipeline rnaseq.md
Variant calling variants.md

Core Rules

1. Verify Input Quality First

Before any analysis, check input data quality:

  • FASTQ: Run FastQC, check per-base quality, adapter content
  • BAM: Verify sorted, indexed (samtools quickcheck)
  • VCF: Validate format (bcftools view -h)

Bad input → garbage output. Always QC first.

2. Use Reference Genome Consistently

Track which reference is used per project:

  • Human: GRCh38/hg38 (prefer) or GRCh37/hg19
  • Mouse: GRCm39/mm39 or GRCm38/mm10
  • Mixing references = invalid results

Store reference info in ~/bioinformatics/memory.md per project.

3. Preserve Raw Data

NEVER modify original FASTQ/BAM files:

  • Work on copies
  • Keep originals read-only
  • Log every transformation step

4. Resource Awareness

Bioinformatics commands can consume massive resources:

  • Check file sizes before operations
  • Use streaming when possible (samtools view | ...)
  • Estimate memory needs (BWA: ~6GB for human genome)
  • Warn before operations >10 minutes

5. Reproducibility

Every analysis must be reproducible:

  • Log exact tool versions (samtools --version)
  • Save command parameters
  • Record input file checksums for critical analyses

Common Traps

  • Wrong chromosome namingchr1 vs 1 causes silent failures. Check and convert with sed 's/^chr//'
  • Unsorted BAM — Most tools expect sorted input. Symptoms: errors or wrong results with no warning
  • Index missing — BAM needs .bai, VCF needs .tbi. Commands fail cryptically without them
  • Memory exhaustion — Large BAM operations kill the session. Stream or use --threads wisely
  • Stale indices — After modifying BAM/VCF, regenerate index. Old index = corrupt reads
  • 0-based vs 1-based coordinates — BED is 0-based, VCF/GFF is 1-based. Off-by-one bugs are common

File Formats Quick Reference

Format Purpose Key Tool
FASTA Reference sequences samtools faidx
FASTQ Raw reads + quality seqtk, fastp
SAM/BAM Aligned reads samtools
VCF/BCF Variants bcftools
BED Genomic intervals bedtools
GFF/GTF Gene annotations gffread
BigWig Coverage tracks deepTools

Essential Commands

Quality Control

# FASTQ quality report
fastqc sample.fastq.gz -o qc_reports/

# Trim adapters + low quality
fastp -i R1.fq.gz -I R2.fq.gz -o R1.clean.fq.gz -O R2.clean.fq.gz

# BAM statistics
samtools flagstat aligned.bam
samtools stats aligned.bam > stats.txt

Alignment

# Index reference (once)
bwa index reference.fa

# Align paired-end reads
bwa mem -t 8 reference.fa R1.fq.gz R2.fq.gz | \
  samtools sort -o aligned.bam -

# Index BAM
samtools index aligned.bam

Variant Calling

# Call variants
bcftools mpileup -Ou -f reference.fa aligned.bam | \
  bcftools call -mv -Oz -o variants.vcf.gz

# Index VCF
bcftools index variants.vcf.gz

# Filter variants
bcftools filter -s LowQual -e 'QUAL\x3C20' variants.vcf.gz

Data Manipulation

# Extract region
samtools view -b aligned.bam chr1:1000000-2000000 > region.bam

# Convert BAM to FASTQ
samtools fastq -1 R1.fq.gz -2 R2.fq.gz aligned.bam

# Merge BAMs
samtools merge merged.bam sample1.bam sample2.bam

# Subset VCF by region
bcftools view -r chr1:1000-2000 variants.vcf.gz

Security & Privacy

Data access:

  • Only reads files user explicitly provides as input
  • Writes outputs to directories user specifies
  • Stores preferences in ~/bioinformatics/ (with consent)

Data that stays local:

  • All sequence data processed locally
  • No external API calls for analysis
  • Pipeline configs in ~/bioinformatics/

This skill does NOT:

  • Upload sequence data anywhere
  • Access files without explicit user instruction
  • Infer or collect data beyond explicit inputs
  • Make network requests during analysis

Note: Installing tools (conda, brew) and downloading reference genomes requires internet access. These are user-initiated actions.

Related Skills

Install with clawhub install \x3Cslug> if user confirms:

  • data-analysis — statistical interpretation
  • statistics — hypothesis testing
  • science — research methodology

Feedback

  • If useful: clawhub star bioinformatics
  • Stay updated: clawhub sync
安全使用建议
This skill appears coherent for local bioinformatics work, but review and consider the following before installing: - You (or the agent) must install and trust the listed binaries (samtools, bcftools, bedtools, bwa, fastqc, fastp); install them from official channels (bioconda, Homebrew) to avoid malicious packages. - The skill will ask to create ~/bioinformatics/ to store project context; only allow this if you’re comfortable the agent will save project metadata locally. It states it will ask for consent first. - Genomic data is highly sensitive: avoid uploading sequence files to external services and double-check that any downloads (reference genomes, tool installers) are user-initiated and from trusted sources. - The skill has no code files (instruction-only), so its behavior depends on the agent following the provided commands; review any commands you run and the outputs before sharing them. - If you plan to let the agent run pipelines autonomously, be aware those commands can consume large CPU, memory, and disk; verify resource availability and confirm long-running or large downloads before allowing them. If you want a deeper check, provide the agent's runtime environment details (what binaries are already installed, whether the agent has network access, and expected storage location contents) so you can validate where files will be written and what external downloads might occur.
功能分析
Type: OpenClaw Skill Name: bioinformatics Version: 1.0.0 The OpenClaw Bioinformatics skill bundle is benign. It provides comprehensive instructions and code snippets for standard bioinformatics pipelines using well-known tools like `samtools`, `bwa`, `bcftools`, `STAR`, and `GATK`. The `SKILL.md` and `setup.md` files explicitly guide the AI agent to obtain user consent for creating local workspace `~/bioinformatics/`, to only save explicitly provided information, and to avoid uploading sequence data or making unauthorized network requests. All commands are local tool invocations, and network access is limited to user-initiated tool installations via `conda`/`brew` or reference genome/database downloads, which are necessary for the stated purpose. There is no evidence of data exfiltration, malicious execution, persistence mechanisms, or prompt injection attempts designed to subvert the agent for harmful purposes.
能力评估
Purpose & Capability
Name/description (sequence alignment, variant calling, expression analysis) match the declared required binaries (samtools, bcftools, bedtools, bwa, fastqc, fastp) and the config path (~/bioinformatics/) used to store project context; nothing requested appears unrelated to the stated purpose.
Instruction Scope
SKILL.md is an instruction-only runtime guide that stays within scope: it instructs QC, alignment, variant calling, and local file operations, asks for user consent before creating ~/bioinformatics/, and explicitly states it will only read files the user supplies and will not upload data. It does not instruct reading unrelated system files or contacting hidden endpoints.
Install Mechanism
There is no install spec (instruction-only), and the included tool-install guidance uses standard channels (conda/bioconda, Homebrew). No downloads from untrusted URLs or extract/install steps are present in the skill files.
Credentials
The skill requests no environment variables or external credentials and only a config path in the user's home for optional persistence. That storage request is proportionate to maintaining project context for pipelines.
Persistence & Privilege
always is false and the skill explicitly asks for user consent before creating ~/bioinformatics/ or saving memory; it does not request elevated/system-wide privileges or modify other skills. Autonomous invocation is allowed (platform default) but not combined with other red flags.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install bioinformatics
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /bioinformatics 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.0
Initial release
元数据
Slug bioinformatics
版本 1.0.0
许可证
累计安装 10
当前安装数 10
历史版本数 1
常见问题

Bioinformatics 是什么?

Analyze DNA, RNA, and protein sequences with alignment, variant calling, and expression analysis pipelines. 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 1125 次。

如何安装 Bioinformatics?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install bioinformatics」即可一键安装,无需额外配置。

Bioinformatics 是免费的吗?

是的,Bioinformatics 完全免费(开源免费),可自由下载、安装和使用。

Bioinformatics 支持哪些平台?

Bioinformatics 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(linux, darwin)。

谁开发了 Bioinformatics?

由 Iván(@ivangdavila)开发并维护,当前版本 v1.0.0。

💬 留言讨论