← Back to Skills Marketplace
billwanttobetop

Protein QC Strict

by Billwanttobetop · GitHub ↗ · v4.0.0 · MIT-0
cross-platform ⚠ suspicious
69
Downloads
0
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install protein-qc-strict
Description
Strictest protein sequence analysis quality control workflow (3365→456 sequences). Includes literature validation, CD-HIT redundancy removal, complexity chec...
Usage Guidance
This workflow appears to be a legitimate local protein QC pipeline, but review and preparation are required before running it: - Manual literature validation: The documentation emphasizes a literature-validation stage, but the provided script does not perform it. Ensure your input.fasta has already been curated/validated if you require that step. - Install missing Python packages: The script uses embedded Python that requires Biopython and numpy (and possibly other scientific packages). Install them (for example, conda install -c conda-forge biopython numpy) or the inline Python will fail. - Versioning and resource flags: cd-hit is invoked with -M 0 (no memory limit) and -T 0 (use all threads). These can consume all memory/CPUs on the host; adjust -M and -T to match your environment. Ensure MAFFT and trimAl versions are compatible with the commands used. - Input/output behavior: The script cds into the output_dir and writes files there, overwriting files with those names if present. Run in an isolated directory and back up important data. - Data sensitivity: The skill performs local processing only (no network calls in the included files), but your sequence data may be proprietary—do not run on sensitive data in untrusted environments. If you want this to be more robust: ask the author (or modify) to add explicit dependency installation for Python packages, either automated checks at runtime or documented requirements, and clarify whether/how literature validation should be performed (manual vs automated).
Capability Analysis
Type: OpenClaw Skill Name: protein-qc-strict Version: 4.0.0 The skill bundle provides a legitimate and well-documented workflow for protein sequence quality control, specifically tailored for the IRED enzyme family. The shell script `protein_qc_workflow.sh` and the instructions in `SKILL.md` correctly implement standard bioinformatics procedures using reputable tools like CD-HIT, MAFFT, and trimAl. There are no indicators of malicious intent, data exfiltration, or prompt injection; the embedded Python code is focused entirely on sequence filtering, entropy calculation, and alignment quality assessment.
Capability Assessment
Purpose & Capability
The declared required binaries (cd-hit, mafft, trimal) match the described workflow (redundancy removal, MSA, trimming). The overall purpose (protein QC for sequence families) aligns with the provided shell script and examples.
Instruction Scope
SKILL.md and README claim a Stage 1 'literature validation' (3365→840) to ensure experimental support for sequences, but the included script does not implement any automated literature or database checks—it assumes input is already validated. The runtime code embeds Python snippets that rely on Biopython and numpy, yet no Python package dependencies are declared. The instructions also recommend 'original tools' and set aggressive cd-hit flags (-M 0, -T 0) that imply using all memory/CPUs; these are operational choices that could impact a host but are coherent with the goal.
Install Mechanism
SKILL.md contains an 'install' block that recommends conda/bioconda packages for cd-hit, mafft and trimal (reasonable). However the registry metadata said 'No install spec' which is inconsistent with SKILL.md. There are no downloads from arbitrary URLs. The script and docs do not declare installation of required Python packages (Biopython, numpy), which are necessary to run the embedded Python code.
Credentials
The skill requests no environment variables, credentials, or config paths—appropriate for a purely local QC workflow that operates on user-provided FASTA files.
Persistence & Privilege
The skill is not always-enabled, does not request elevated or persistent privileges, and does not modify other skills or system-wide config. It runs locally and writes output files into the user-specified output directory.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install protein-qc-strict
  3. After installation, invoke the skill by name or use /protein-qc-strict
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v4.0.0
Complete workflow: 3365→456 sequences with literature validation, CD-HIT, complexity check, motif verification, MSA quality assessment. Battle-tested on IRED enzyme family.
Metadata
Slug protein-qc-strict
Version 4.0.0
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 1
Frequently Asked Questions

What is Protein QC Strict?

Strictest protein sequence analysis quality control workflow (3365→456 sequences). Includes literature validation, CD-HIT redundancy removal, complexity chec... It is an AI Agent Skill for Claude Code / OpenClaw, with 69 downloads so far.

How do I install Protein QC Strict?

Run "/install protein-qc-strict" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Protein QC Strict free?

Yes, Protein QC Strict is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Protein QC Strict support?

Protein QC Strict is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Protein QC Strict?

It is built and maintained by Billwanttobetop (@billwanttobetop); the current version is v4.0.0.

💬 Comments