← Back to Skills Marketplace
biociao

bioinfo-style

by Fang, Chao · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ✓ Security Clean
35
Downloads
0
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install bioinfo-style
Description
标准化生物信息学分析流程。当用户提出以下任务时激活:(1) 测序数据分析(RNA-seq、ChIP-seq、WGS等),(2) 差异表达分析,(3) 通路/富集分析,(4) 组学数据可视化,(5) 统计检验,(6) 任何需要调用成熟生信工具而非编写Python/R算法代码的分析任务。核心原则:优先调用工具,记录s...
README (SKILL.md)

Bioinformatics Analysis - 标准化生信分析

核心原则

工具 > 代码:优先使用成熟的生信工具,绝不自己写算法代码。 过程记录:每一步都写 shell 脚本并执行,而非在对话中描述。 可重复性:所有分析必须可以通过脚本重现。

工作流程

1. 环境准备

# 读取 ~/.bashrc 中的环境配置
source ~/.bashrc

# 激活或创建 mamba 环境
mamba activate \x3Cenv_name>      # 已有环境
mamba create -n \x3Cenv_name> -c \x3Cchannels> \x3Cpackages>  # 创建新环境

# 常用生信环境示例
mamba create -n bioinfo -c conda-forge -c bioconda \
  fastqc trimmomatic star featurecounts deseq2 \
  samtools bedtools homer meme

2. 原始数据质量控制

# FastQC 质控
fastqc -o \x3Coutput_dir> -f fastq \x3Craw_data>/*.fastq.gz

# 批量处理脚本模板
cat > 01_qc.sh \x3C\x3C 'EOF'
#!/bin/bash
set -e
RAW_DIR="raw_data"
QC_DIR="qc_results"
mkdir -p $QC_DIR

for fq in $RAW_DIR/*.fastq.gz; do
    echo "Processing: $fq"
    fastqc -o $QC_DIR "$fq"
done
echo "QC completed"
EOF
bash 01_qc.sh

3. 序列比对/定量

# RNA-seq: STAR 比对 + featureCounts 定量
cat > 02_alignment.sh \x3C\x3C 'EOF'
#!/bin/bash
set -e
REF="genome/GRCh38"
SAMPLE="sample1"
GTF="annotation/gencode.v38.annotation.gtf"

mkdir -p alignment/$SAMPLE

star --genomeDir $REF \
     --readFilesIn reads/$SAMPLE.fastq.gz \
     --readFilesCommand zcat \
     --outSAMtype BAM SortedByCoordinate \
     --outFileNamePrefix alignment/$SAMPLE/

featureCounts -T 4 -t exon -g gene_id \
  -a $GTF \
  -o counts/$SAMPLE.counts \
  alignment/$SAMPLE/Aligned.sortedByCoord.out.bam
EOF
bash 02_alignment.sh

4. 差异分析(Rmarkdown)

cat > 03_differential_analysis.Rmd \x3C\x3C 'EOF'
---
title: "差异表达分析报告"
author: "Bioinformatics Pipeline"
date: "`r Sys.Date()`"
output:
  html_document:
    toc: true
    theme: united
---

```{r setup, message=FALSE}
library(DESeq2)
library(ggplot2)
library(pheatmap)
library(clusterProfiler)
# 读取计数矩阵
count_matrix \x3C- read.table("counts/matrix.txt", header=TRUE, row.names=1)
coldata \x3C- read.table("metadata.txt", header=TRUE, row.names=1)
dds \x3C- DESeqDataSetFromMatrix(countData=count_matrix,
                               colData=coldata,
                               design=~condition)
dds \x3C- DESeq(dds)
results \x3C- results(dds)
# MA plot
plotMA(results)
# 热图
top_genes \x3C- rownames(head(results[order(results$padj),], 20))
pheatmap(assay(vst(dds))[top_genes,])

EOF Rscript -e "rmarkdown::render('03_differential_analysis.Rmd')"


### 5. 富集分析

```bash
cat > 04_enrichment.sh \x3C\x3C 'EOF'
#!/bin/bash
set -e
GENELIST="differential_genes.txt"
OUTPUT="enrichment_results"

# GO富集
 enrichment.sh $GENELIST BP CC MF BP
clusterProfiler --dotplot --gseGO

# KEGG通路
enrichment.sh $GENELIST KEGG

# Reactome
enrichment.sh $GENELIST Reactome
EOF
bash 04_enrichment.sh

工具优先级

  1. QC: FastQC, MultiQC, Trimmomatic, Cutadapt
  2. 比对: STAR, BWA, Bowtie2, HISAT2
  3. 定量: featureCounts, htseq-count, Salmon, kallisto
  4. 差异分析: DESeq2, edgeR, limma(仅用已有R包,不写算法)
  5. 富集分析: clusterProfiler, GSEA, Enrichr
  6. 可视化: ggplot2, pheatmap, Gviz, IGV
  7. 基因组操作: BEDTools, SAMtools, BCFtools

关键规则

  • 绝不写Python脚本做已有的生信工具能做的事
  • 每一步都写 .sh 脚本并执行,记录到 logs/ 目录
  • 原始数据不修改,所有处理脚本化
  • 输出结构标准化
    project/
    ├── raw_data/        # 原始数据(只读)
    ├── qc_results/      # 质控结果
    ├── alignment/       # 比对结果
    ├── counts/          # 表达矩阵
    ├── results/         # 差异分析结果
    ├── figures/         # 图片
    ├── scripts/         # 所有分析脚本
    └── logs/            # 执行日志
    
  • Rmarkdown 报告包含:方法描述、统计结果、可视化图表

常用工具速查

详见 references/tools.md

分析脚本模板

详见 references/templates.md

Usage Guidance
Install this only if you expect the agent to run local bioinformatics commands, create scripts/logs/reports, and possibly create or update Conda/Mamba environments. Review generated commands before running them on valuable data or shared systems, and run analyses in an isolated project directory or environment when possible.
Capability Assessment
Purpose & Capability
The skill's capabilities match its stated purpose: sequencing QC, alignment, quantification, differential expression, enrichment analysis, visualization, and reproducible reporting.
Instruction Scope
The activation text is broad within bioinformatics and tells the agent to script and execute each step, but that behavior is explicit and central to the advertised workflow rather than hidden or unrelated.
Install Mechanism
The skill includes Mamba/Conda environment creation and package installation examples, which can change local tool environments and use network/storage, but these are disclosed as setup steps for standard bioinformatics tools.
Credentials
Local shell scripts, R Markdown reports, logs, and project output directories are proportionate for reproducible bioinformatics analysis; no credential access, cloud upload, or unrelated data collection was found.
Persistence & Privilege
Persistence is limited to generated scripts, logs, analysis outputs, reports, and optional local tool environments; no background service, privilege escalation, or hidden long-running persistence was evident.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install bioinfo-style
  3. After installation, invoke the skill by name or use /bioinfo-style
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
Initial release of standardized bioinformatics analysis workflow. - Provides a reproducible pipeline for sequencing data analysis (QC, alignment, quantification, DE analysis, enrichment, visualization). - Emphasizes use of established bioinformatics tools over custom code. - Every step is scripted (shell/bash), with logs and organized output directories. - Results and visualization are generated through Rmarkdown reports. - Ensures raw data is unmodified and all processing is fully documented and repeatable. - Templates and tool references included for easy adaptation.
Metadata
Slug bioinfo-style
Version 1.0.0
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 1
Frequently Asked Questions

What is bioinfo-style?

标准化生物信息学分析流程。当用户提出以下任务时激活:(1) 测序数据分析(RNA-seq、ChIP-seq、WGS等),(2) 差异表达分析,(3) 通路/富集分析,(4) 组学数据可视化,(5) 统计检验,(6) 任何需要调用成熟生信工具而非编写Python/R算法代码的分析任务。核心原则:优先调用工具,记录s... It is an AI Agent Skill for Claude Code / OpenClaw, with 35 downloads so far.

How do I install bioinfo-style?

Run "/install bioinfo-style" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is bioinfo-style free?

Yes, bioinfo-style is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does bioinfo-style support?

bioinfo-style is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created bioinfo-style?

It is built and maintained by Fang, Chao (@biociao); the current version is v1.0.0.

💬 Comments