Description

Humanize murine antibody sequences using CDR grafting and framework optimization to reduce immunogenicity while preserving antigen binding. Predicts optimal...

README (SKILL.md)

Antibody Humanizer

Name: Antibody Humanizer
Author: aipoch-ai

Overview

Bioinformatics platform for converting murine antibodies into humanized variants by grafting complementarity-determining regions (CDRs) onto human framework templates while preserving antigen-binding affinity and reducing immunogenicity risk.

Key Capabilities:

CDR Identification: Automatic CDR boundary detection (Kabat/Chothia/IMGT schemes)
Framework Matching: Database search for optimal human germline templates
Humanization Scoring: Multi-parameter immunogenicity risk assessment
Back-Mutation Prediction: Identify critical framework residues for retention
Batch Processing: Humanize multiple antibody candidates efficiently
Immunogenicity Assessment: T-cell epitope and humanness scoring

When to Use

✅ Use this skill when:

Converting murine hybridoma antibodies to therapeutic candidates
Reducing immunogenicity risk of rodent-derived antibodies
Selecting human framework templates for CDR grafting
Identifying critical framework residues for antigen binding
Comparing multiple humanization strategies for lead optimization
Preparing antibody sequences for patent filings
Teaching antibody engineering principles

❌ Do NOT use when:

Fully human antibody generation from phage display → Use phage-display-library
De novo antibody design → Use antibody-design-ai
Affinity maturation → Use affinity-maturation-predictor
ADCC/CDC optimization → Use fc-engineering-toolkit
Final therapeutic candidate selection → Requires experimental validation

Integration:

Upstream: antibody-sequencer (VH/VL sequence determination), cdr-grafting-validator (structural assessment)
Downstream: protein-struct-viz (3D visualization), immunogenicity-predictor (T-cell epitope analysis)

Core Capabilities

1. CDR Region Identification

Parse antibody sequences and identify CDR boundaries:

from scripts.humanizer import AntibodyHumanizer

humanizer = AntibodyHumanizer()

# Analyze antibody sequence
analysis = humanizer.analyze_sequence(
    vh_sequence="QVQLQQSGPELVKPGASVKISCKASGYTFTDYYMHWVKQSHGKSLEWIGYINPSTGYTEYNQKFKDKATLTVDKSSSTAYMQLSSLTSEDSAVYYCAR...",
    vl_sequence="DIQMTQSPSSLSASVGDRVTITCRASQGISSWLAWYQQKPGKAPKLLIYKASSLESGVPSRFSGSGSGTDFTLTISSLQPEDFATYYCQQYSSYPYT...",
    scheme="chothia"  # Options: kabat, chothia, imgt
)

# Output CDR locations
print(analysis.cdr_regions)
# {
#   "VH_CDR1": {"start": 26, "end": 32, "seq": "GYTFTDY"},
#   "VH_CDR2": {"start": 52, "end": 58, "seq": "INPSTGY"},
#   ...
# }

Numbering Schemes:

Scheme	VH CDR1	VH CDR2	VH CDR3	Best For
Chothia	26-32	52-56	95-102	Structural analysis
Kabat	31-35	50-65	95-102	Sequence-based work
IMGT	27-38	56-65	105-117	Standardized analysis

2. Human Framework Matching

Identify optimal human germline templates:

# Match against human germline database
matches = humanizer.find_human_frameworks(
    vh_framework=analysis.vh_frameworks,
    vl_framework=analysis.vl_frameworks,
    top_n=5,
    criteria=["homology", "canonical_structure", "vernier_similarity"]
)

# Evaluate each candidate
for match in matches:
    print(f"Template: {match.germline_genes}")
    print(f"Homology: {match.homology:.2%}")
    print(f"Vernier Score: {match.vernier_score:.1f}")
    print(f"Risk Level: {match.immunogenicity_risk}")

Matching Criteria:

Sequence Homology: Percent identity to human germline
Canonical Structure: Loop conformation compatibility
Vernier Region: Framework residues contacting CDRs
Interface Residues: Packing interactions with CDRs

3. Humanization Scoring

Assess immunogenicity risk of candidates:

# Score humanization candidates
scores = humanizer.score_candidates(
    murine_antibody=analysis,
    human_templates=matches,
    scoring_methods=["t20", "h_score", "germline_deviation", "paratope_diversity"]
)

# Rank by overall score
ranked = scores.rank_by_composite_score(
    weights={"humanness": 0.4, "binding_retention": 0.4, "developability": 0.2}
)

Scoring Methods:

Method	Description	Target
T20 Score	20-mer peptide humanization	>80% human
H-Score	Hummerblind germline distance	\x3C15 mutations
Paratope Diversity	CDR germline gene diversity	Low diversity
Developability	Aggregation/pH stability prediction	High score

4. Back-Mutation Prediction

Identify critical residues to retain from murine framework:

# Predict back-mutations
back_mutations = humanizer.predict_back_mutations(
    murine_vh=analysis.vh_sequence,
    human_vh=matches[0].human_template,
    cdr_regions=analysis.cdr_regions,
    rationale_required=True
)

# Output shows position-specific recommendations
for mutation in back_mutations:
    print(f"Position {mutation.position}: {mutation.human_aa} → {mutation.murine_aa}")
    print(f"Rationale: {mutation.reason}")  # e.g., "Vernier region contact"
    print(f"Priority: {mutation.priority}")  # Critical/Important/Optional

Critical Residue Classes:

Vernier Positions: Framework residues contacting CDRs (VH 24, 71, 94)
Interface Packs: Residue packing between VH and VL
Canonical Anchors: Cysteines and conserved framework positions
** Buried Positions**: Core packing residues affecting stability

Common Patterns

Pattern 1: Standard Therapeutic Humanization

Scenario: Convert murine anti-tumor antibody to therapeutic candidate.

# Humanize single antibody
python scripts/main.py \
  --vh "QVQLQQSGPELVKPGASVKISCKAS..." \
  --vl "DIQMTQSPSSLSASVGDRVTITCRAS..." \
  --name "Anti-HER2-Murine-1" \
  --scheme chothia \
  --top-n 3 \
  --output humanization_report.json

# Review top candidates
cat humanization_report.json | jq '.candidates[0]'

Workflow:

Input murine VH/VL sequences
Identify CDRs using Chothia scheme
Match to human germline database
Score top 3 candidates
Identify required back-mutations
Output humanized sequences

Pattern 2: Batch Humanization Screening

Scenario: Screen multiple murine clones from hybridoma campaign.

# Process multiple antibodies
antibodies = [
    {"name": "Clone-A", "vh": "...", "vl": "..."},
    {"name": "Clone-B", "vh": "...", "vl": "..."},
    {"name": "Clone-C", "vh": "...", "vl": "..."}
]

results = humanizer.batch_humanize(
    antibodies=antibodies,
    ranking_criteria="composite_score",
    min_humanness=0.85
)

# Rank by developability
ranked = results.rank_by(criteria=["humanness", "binding_retention", "stability"])

Selection Criteria:

Highest humanness score (>85%)
Fewest back-mutations required (\x3C6)
Low immunogenicity risk
Good developability profile

Pattern 3: Framework Template Comparison

Scenario: Compare different humanization strategies for lead candidate.

# Test multiple framework combinations
strategies = [
    {"vh": "IGHV1-2*02", "vl": "IGKV1-12*01", "name": "Template-A"},
    {"vh": "IGHV3-23*01", "vl": "IGKV3-20*01", "name": "Template-B"},
    {"vh": "IGHV4-34*01", "vl": "IGKV1-5*01", "name": "Template-C"}
]

comparison = humanizer.compare_strategies(
    murine_antibody=analysis,
    strategies=strategies,
    metrics=["homology", "back_mutations", "immunogenicity", "paratope_structure"]
)

comparison.generate_report("framework_comparison.pdf")

Comparison Metrics:

Sequence identity to human germline
Number and location of back-mutations
Predicted immunogenicity risk
CDR conformation preservation

Pattern 4: Intellectual Property Analysis

Scenario: Assess humanization for patent landscape analysis.

# Generate humanized variants
python scripts/main.py \
  --input murine_lead.json \
  --generate-variants 10 \
  --include-back-mutations \
  --output variants_for_ip.json

# Check novelty against patent databases
python scripts/patent_check.py \
  --sequences variants_for_ip.json \
  --databases [USPTO, EPO, WIPO] \
  --output novelty_report.pdf

IP Considerations:

Human framework combinations may be patented
CDR sequences determine antigen specificity
Back-mutation positions may be prior art
Document humanization rationale for filings

Complete Workflow Example

From murine hybridoma to therapeutic candidate:

# Step 1: Sequence analysis and CDR identification
python scripts/main.py \
  --vh $VH_SEQUENCE \
  --vl $VL_SEQUENCE \
  --scheme chothia \
  --output step1_analysis.json

# Step 2: Find best human frameworks
python scripts/main.py \
  --input step1_analysis.json \
  --find-frameworks \
  --top-n 5 \
  --output step2_frameworks.json

# Step 3: Score and rank candidates
python scripts/main.py \
  --input step2_frameworks.json \
  --score-candidates \
  --include-immunogenicity \
  --output step3_scored.json

# Step 4: Predict back-mutations
python scripts/main.py \
  --input step3_scored.json \
  --predict-back-mutations \
  --rationale \
  --output step4_backmutations.json

# Step 5: Generate final humanized sequences
python scripts/main.py \
  --input step4_backmutations.json \
  --generate-sequences \
  --format fasta \
  --output humanized_antibody.fasta

Python API:

from scripts.humanizer import AntibodyHumanizer
from scripts.scoring import HumanizationScorer
from scripts.backmutation import BackMutationPredictor

# Initialize pipeline
humanizer = AntibodyHumanizer()
scorer = HumanizationScorer()
bm_predictor = BackMutationPredictor()

# Step 1: Parse and analyze
antibody = humanizer.analyze_sequence(
    vh_sequence=murine_vh,
    vl_sequence=murine_vl,
    scheme="chothia"
)

# Step 2: Find human frameworks
candidates = humanizer.find_human_frameworks(
    antibody,
    top_n=5
)

# Step 3: Score candidates
for candidate in candidates:
    scores = scorer.calculate_scores(
        murine=antibody,
        humanized=candidate
    )
    candidate.composite_score = scores.weighted_score()

# Step 4: Select best and predict back-mutations
best = max(candidates, key=lambda x: x.composite_score)
back_mutations = bm_predictor.predict(
    murine=antibody,
    human_template=best
)

# Step 5: Generate final sequence
final_sequence = humanizer.generate_humanized_sequence(
    template=best,
    back_mutations=back_mutations,
    cdrs=antibody.cdr_regions
)

print(f"Humanized antibody generated:")
print(f"- Humanness: {best.humanness:.1%}")
print(f"- Back-mutations: {len(back_mutations)}")
print(f"- Risk level: {best.immunogenicity_risk}")

Quality Checklist

Input Quality:

VH and VL sequences complete (110-130 aa typical)
No ambiguous residues (B, Z, X)
Signal peptide removed
Constant region removed (variable region only)

Humanization Assessment:

CDR boundaries correctly identified
Human framework homology >80%
T20 score >75 (high humanness)
Vernier positions analyzed for back-mutations
Interface residues checked for packing

Output Validation:

Humanized sequence valid (no stop codons)
CDRs preserved exactly
Framework length conserved
Back-mutations documented with rationale
CRITICAL: Immunogenicity risk assessed

Before Experimental Work:

CRITICAL: Top 2-3 candidates selected for expression
Binding affinity to be tested (ELISA/Biacore)
Stability assessed (thermal/aggregation)
Immunogenicity in vitro assays planned

Common Pitfalls

Sequence Issues:

❌ Incomplete sequences → Missing framework regions
- ✅ Ensure full VH/VL variable domains provided
❌ Wrong numbering scheme → CDR boundaries incorrect
- ✅ Verify scheme matches experimental data source
❌ Non-standard residues → Unusual amino acids
- ✅ Clean sequences; remove signal peptides

Design Issues:

❌ Over-humanization → Losing antigen binding
- ✅ Don't exceed 85-90% humanness; retain critical residues
❌ Ignoring back-mutations → Assuming 100% human framework works
- ✅ Always predict and test back-mutations
❌ Single candidate only → No backup options
- ✅ Generate 2-3 candidates with different frameworks

Experimental Issues:

❌ Skipping binding validation → Assuming in silico = in vivo
- ✅ Always confirm antigen binding experimentally
❌ Ignoring developability → Aggregation or instability
- ✅ Check for problematic residues (unpaired cysteines, hydrophobic patches)

References

Available in references/ directory:

imgt_germline_database.md - Human germline gene reference sequences
cdr_numbering_schemes.md - Kabat, Chothia, IMGT comparison
humanization_case_studies.md - Successful therapeutic examples
vernier_positions_guide.md - Critical framework residues
immunogenicity_assessment.md - T-cell epitope prediction methods
patent_landscape.md - Humanization IP considerations

Scripts

Located in scripts/ directory:

main.py - CLI interface for humanization
humanizer.py - Core humanization engine
cdr_parser.py - CDR identification and numbering
framework_matcher.py - Human germline database search
scoring.py - Humanization quality assessment
backmutation.py - Critical residue prediction
batch_processor.py - Multiple antibody screening
structure_predictor.py - CDR conformation analysis

Limitations

Binding Prediction: Cannot accurately predict impact on antigen affinity
Developability: Limited prediction of aggregation or stability issues
Immunogenicity: In silico T-cell epitope prediction has false positives
Non-Standard Antibodies: May not handle camelid, shark, or engineered scaffolds
Experimental Validation Required: All predictions must be confirmed in vitro/vivo
Intellectual Property: Does not check for existing patent claims on sequences

Parameters

Parameter	Type	Default	Required	Description
`--vh`	string	-	No	Murine VH sequence (amino acids)
`--vl`	string	-	No	Murine VL sequence (amino acids)
`--input`, `-i`	string	-	No	Input JSON file path
`--name`, `-n`	string	""	No	Antibody name
`--output`, `-o`	string	-	No	Output file path
`--format`, `-f`	string	json	No	Output format (json, fasta, csv)
`--scheme`, `-s`	string	chothia	No	Numbering scheme (kabat, chothia, imgt)
`--top-n`	int	3	No	Number of best candidates to return

Usage

Basic Usage

# Humanize with direct sequence input
python scripts/main.py --vh "QVQLQQSGPELVKPGASVKMSCKAS..." --vl "DIQMTQSPSSLSASVGDRVTITC..." --name "MyAntibody"

# Use JSON input file
python scripts/main.py --input antibody.json --output results.json

# Use IMGT numbering scheme
python scripts/main.py --vh "SEQUENCE" --vl "SEQUENCE" --scheme imgt

Input JSON Format

{
  "vh_sequence": "QVQLQQSGPELVKPGASVKMSCKAS...",
  "vl_sequence": "DIQMTQSPSSLSASVGDRVTITC...",
  "name": "MyAntibody",
  "scheme": "chothia"
}

Risk Assessment

Risk Indicator	Assessment	Level
Code Execution	Python script executed locally	Medium
Network Access	No external API calls	Low
File System Access	Read input files, write output files	Low
Instruction Tampering	Standard prompt guidelines	Low
Data Exposure	Output may contain proprietary sequences	Medium

Security Checklist

No hardcoded credentials or API keys
No unauthorized file system access (../)
Input validation for sequences
Prompt injection protections in place
Error messages sanitized
Output directory restricted to workspace
Script execution in sandboxed environment

Prerequisites

# Python 3.7+
# No external packages required (uses standard library)

Evaluation Criteria

Success Metrics

Successfully parses antibody sequences
Identifies CDR regions correctly
Matches human germline frameworks
Predicts back-mutations
Generates valid humanized sequences

Test Cases

Basic Functionality: Humanize valid VH/VL sequences → Returns candidates
Edge Case: Invalid sequence characters → Graceful error message
File Input: Process JSON input → Correctly parses and outputs

Lifecycle Status

Current Stage: Draft
Next Review Date: 2026-03-06
Known Issues: None
Planned Improvements:
- Add T20 score database integration
- Support for camelid and shark antibodies
- Structure-based CDR prediction

🔬 Critical Note: Computational humanization is a design tool, not a substitute for experimental validation. Always express and test humanized candidates for binding affinity, specificity, stability, and immunogenicity before therapeutic development.

Usage Guidance

This skill appears internally consistent and runs local sequence-processing logic only, but take these precautions before installing or using it: 1) Review the full scripts/main.py yourself (or have a trusted developer do so) before running, especially if you will provide confidential sequences. 2) Run the tool in an isolated environment (container/VM) if you are unsure about provenance—the registry metadata shows no homepage and the source is 'unknown'. 3) Do not treat outputs as experimentally validated: humanization recommendations require laboratory confirmation and regulatory review before therapeutic use. 4) Ensure using the tool complies with your institutional/ethical guidelines for handling potentially sensitive biological sequence data. 5) Note the requirements.txt entries (dataclasses, enum) are standard library modules for Python 3.7+—no external packages appear required.

Capability Analysis

Type: OpenClaw Skill Name: antibody-humanizer Version: 0.1.0 The antibody-humanizer skill is a specialized bioinformatics tool designed for humanizing murine antibody sequences. The core logic in scripts/main.py implements standard sequence analysis techniques, including CDR extraction and framework matching against a hardcoded human germline database using Kabat, Chothia, and IMGT numbering schemes. The code is self-contained, relies only on the Python standard library, and lacks any high-risk behaviors such as network access, shell execution, or sensitive data exfiltration. The SKILL.md documentation provides clear, task-aligned instructions for the AI agent without any evidence of prompt injection or malicious intent.

Capability Assessment

✓ Purpose & Capability

The name/description (humanize murine antibodies via CDR grafting and framework optimization) matches the included instructions and the Python implementation: sequence validation, CDR extraction, framework matching, scoring, and back-mutation prediction. No surprising binaries or unrelated capabilities are requested.

✓ Instruction Scope

SKILL.md instructs local analysis of sequences and example API usage consistent with the stated purpose. The instructions do not direct the agent to read system credentials, configuration files, or exfiltrate data to external endpoints. The allowed-tools list includes Read/Write/Bash/Edit (expected for a local processing skill) but the runtime instructions themselves stay within the domain of sequence processing.

✓ Install Mechanism

No install spec is provided (instruction-only + included script), so nothing is automatically downloaded or executed during install. A single Python script is included; it uses only standard libraries and local data structures. This is low-install risk. Note: requirements.txt lists dataclasses and enum which are part of modern Python stdlib and not evidence of third-party download.

✓ Credentials

The skill declares no required environment variables, credentials, or config paths. The code does not import networking libraries or attempt to read environment variables—requested privileges are minimal and proportionate to processing sequences locally.

✓ Persistence & Privilege

Skill is not marked always:true and does not request persistent system presence or modify other skill/config files. Autonomous invocation is allowed (platform default) but not combined with other elevated privileges.

Version History

v0.1.0

antibody-humanizer v0.1.0 - Initial release with core antibody humanization functionalities. - Supports CDR identification using Kabat, Chothia, and IMGT schemes. - Matches murine antibody frameworks to optimal human germline templates. - Predicts essential back-mutations for antigen binding retention. - Provides multi-factor humanization scoring and immunogenicity assessment. - Enables batch processing of multiple antibody candidates.

Metadata

Slug antibody-humanizer

Version 0.1.0

License MIT-0

All-time Installs 0

Active Installs 0

Total Versions 1

Frequently Asked Questions

What is Antibody Humanizer?

Humanize murine antibody sequences using CDR grafting and framework optimization to reduce immunogenicity while preserving antigen binding. Predicts optimal... It is an AI Agent Skill for Claude Code / OpenClaw, with 167 downloads so far.

How do I install Antibody Humanizer?

Run "/install antibody-humanizer" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Antibody Humanizer free?

Yes, Antibody Humanizer is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Antibody Humanizer support?

Antibody Humanizer is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Antibody Humanizer?

It is built and maintained by AIpoch (@aipoch-ai); the current version is v0.1.0.

More Skills

Antibody Humanizer

Antibody Humanizer

Overview

When to Use

Core Capabilities

1. CDR Region Identification

2. Human Framework Matching

3. Humanization Scoring

4. Back-Mutation Prediction

Common Patterns

Pattern 1: Standard Therapeutic Humanization

Pattern 2: Batch Humanization Screening

Pattern 3: Framework Template Comparison

Pattern 4: Intellectual Property Analysis

Complete Workflow Example

Quality Checklist

Common Pitfalls

References

Scripts

Limitations

Parameters

Usage

Basic Usage

Input JSON Format

Risk Assessment

Security Checklist

Prerequisites

Evaluation Criteria

Success Metrics

Test Cases

Lifecycle Status

What is Antibody Humanizer?

How do I install Antibody Humanizer?

Is Antibody Humanizer free?

Which platforms does Antibody Humanizer support?

Who created Antibody Humanizer?

💬 Comments