Description

Query KEGG database for drug information, pathway analysis, and disease-drug-target discovery. Use this skill when: (1) Looking up drug information including...

README (SKILL.md)

KEGG Query

Name: KEGG Query
Author: hollyya

Query the KEGG (Kyoto Encyclopedia of Genes and Genomes) database for comprehensive biomedical information.

When to Use

Drug Lookup: Retrieve drug efficacy, targets, metabolism, drug-drug interactions
Pathway Analysis: Get pathway genes, compounds, modules, and related pathways
Disease Discovery: Find disease-associated drugs, genes, and therapeutic targets

Workflow

Use Case 1: Drug Information Lookup

Fetch comprehensive drug information from KEGG DRUG database.

from scripts.kegg_api import kegg_find, kegg_get, parse_drug_entry

# Step 1: Search for drug by name
results = kegg_find("drug", "aspirin")
# Returns: [("dr:D00109", "Aspirin (JP18/USP); Acetylsalicylic acid; ...")]

# Step 2: Get full entry
drug_id = "dr:D00109"  # or just "D00109"
entry = kegg_get(drug_id)
drug_info = parse_drug_entry(entry)

Output includes: Names, formula, efficacy, diseases, targets, pathways, metabolism, DDI.

See examples/drug_lookup.py for complete implementation.

Use Case 2: Pathway Analysis

Analyze KEGG pathways to retrieve genes, compounds, and modules.

from scripts.kegg_api import kegg_get, parse_pathway_entry

# Get pathway by ID (e.g., hsa00010 for Glycolysis)
entry = kegg_get("hsa00010")
pathway = parse_pathway_entry(entry)

# Access parsed data
print(f"Genes: {len(pathway['genes'])}")      # 50+ genes
print(f"Compounds: {len(pathway['compounds'])}")  # 30+ compounds

Output includes: Description, genes with KO/EC annotations, compounds, modules, related pathways.

See examples/pathway_analysis.py for complete implementation.

Use Case 3: Disease-Drug-Target Discovery

Discover therapeutic targets and drugs for diseases.

from scripts.kegg_api import kegg_find, kegg_get, parse_disease_entry

# Step 1: Search for disease
results = kegg_find("disease", "diabetes")
# Returns multiple matches including Type 2 diabetes (H00409)

# Step 2: Get disease details
entry = kegg_get("ds:H00409")
disease = parse_disease_entry(entry)

# Access drugs and targets
print(f"Drugs: {len(disease['drugs'])}")    # 60+ drugs
print(f"Genes: {len(disease['genes'])}")    # 20+ genes

Output includes: Description, category, associated genes, pathways, approved drugs.

See examples/disease_discovery.py for complete implementation.

Expected Outputs

Drug Entry (JSON)

{
  "id": "D00109",
  "names": ["Aspirin", "Acetylsalicylic acid"],
  "formula": "C9H8O4",
  "efficacy": ["Analgesic", "Anti-inflammatory", "Antipyretic", "COX inhibitor"],
  "targets": [
    {"gene": "PTGS1", "uniprot": "P23219", "ko": "K00509"},
    {"gene": "PTGS2", "uniprot": "P35354", "ko": "K11987"}
  ],
  "pathways": ["hsa00590", "hsa04611"],
  "diseases": ["Myocardial infarction", "Unstable angina"]
}

Pathway Entry (JSON)

{
  "id": "hsa00010",
  "name": "Glycolysis / Gluconeogenesis",
  "organism": "Homo sapiens",
  "description": "Glycolysis is the process...",
  "genes": [
    {"id": "10327", "symbol": "AKR1A1", "ko": "K00002", "ec": "1.1.1.2"},
    {"id": "3939", "symbol": "LDHA", "ko": "K00016", "ec": "1.1.1.27"}
  ],
  "compounds": [
    {"id": "C00031", "name": "D-Glucose"},
    {"id": "C00022", "name": "Pyruvate"}
  ],
  "modules": ["hsa_M00001", "hsa_M00002", "hsa_M00003"]
}

Disease Entry (JSON)

{
  "id": "H00409",
  "name": "Type 2 diabetes mellitus",
  "category": "Endocrine and metabolic disease",
  "description": "T2DM is characterized by chronic hyperglycemia...",
  "genes": [
    {"symbol": "CAPN10", "ko": "K08579"},
    {"symbol": "TCF7L2", "ko": "K04491"}
  ],
  "drugs": [
    {"id": "D00944", "name": "Metformin hydrochloride"},
    {"id": "D06404", "name": "Liraglutide"}
  ],
  "pathways": ["hsa04930", "hsa04911"]
}

KEGG API Reference

Operation	URL Pattern	Description
`info`	`/info/{database}`	Database statistics
`list`	`/list/{database}`	List all entries
`find`	`/find/{database}/{query}`	Search by keyword
`get`	`/get/{entry_id}`	Retrieve entry
`link`	`/link/{target}/{source}`	Cross-references
`conv`	`/conv/{target}/{source}`	ID conversion

Key Databases: pathway, compound, drug, disease, genes, enzyme, ko

Entry ID Formats:

Drug: D00009 or dr:D00009
Compound: C00031 or cpd:C00031
Pathway: hsa00010 (organism-specific) or map00010 (reference)
Disease: H00409 or ds:H00409
Gene: hsa:5742 (organism:gene_id)

Error Handling

Error	Solution
Entry not found	Verify ID format (e.g., D00109, not aspirin)
Multiple matches	Use `kegg_find` first to get exact ID
Timeout	Reduce query complexity, retry with delay
Rate limited	KEGG allows ~10 requests/second; add delays

Integration with OpenBioMed

from open_biomed.data import Molecule, Protein
from open_biomed.tools.tool_registry import TOOLS

# Convert KEGG compound to Molecule
compound_entry = kegg_get("cpd:C00031")  # Glucose
mol_file = kegg_get("C00031", option="mol")  # Get MOL format
# molecule = Molecule.from_mol_file(mol_file)

# Get protein from KEGG gene
gene_entry = kegg_get("hsa:5742")  # PTGS1
# Use UniProt ID to fetch protein
protein_tool = TOOLS["protein_uniprot_request"]
proteins, _ = protein_tool.run(accession="P23219")

References

references/kegg_databases.md - Complete database listing and ID formats
references/kegg_api_operations.md - Detailed API operation reference
KEGG API Documentation: https://www.kegg.jp/kegg/rest/keggapi.html

Usage Guidance

This skill appears coherent and does what it claims: example Python code calls the public KEGG REST API and parses results. There are no requested secrets or risky install steps. Consider: (1) queries are sent to rest.kegg.jp — avoid sending sensitive or personally identifiable data to any external service, (2) respect KEGG rate limits for bulk queries (examples mention batching/delays), and (3) review example code (uses the 'requests' library) before running in a production environment. If you require offline or private datasets, this skill will not provide that.

Capability Analysis

Type: OpenClaw Skill Name: kegg-query Version: 1.0.0 The kegg-query skill bundle provides a legitimate interface for querying the Kyoto Encyclopedia of Genes and Genomes (KEGG) database. The included Python scripts (e.g., drug_lookup.py, pathway_analysis.py) use the official KEGG REST API (rest.kegg.jp) to retrieve and parse biomedical data. No evidence of data exfiltration, malicious execution, or harmful prompt injection was found; the code and instructions are strictly aligned with the stated purpose of bioinformatics research.

Capability Assessment

✓ Purpose & Capability

Name/description (KEGG Query) match the provided code and SKILL.md: all examples implement searches, GETs, and parsing of KEGG entries via the official REST API (https://rest.kegg.jp). There are no unrelated credentials, binaries, or config paths required.

✓ Instruction Scope

SKILL.md and the example scripts only instruct network calls to the KEGG REST endpoints and local parsing/formatting. They do not read local secrets, scan unrelated files, or send data to endpoints other than rest.kegg.jp. The only external integration mentioned (OpenBioMed) is optional/example usage and does not introduce extra required permissions.

✓ Install Mechanism

Instruction-only skill with example Python scripts; there is no install specification and no downloads of third-party code. The examples use the standard 'requests' library, which is expected for HTTP access.

✓ Credentials

No required environment variables, credentials, or config paths are declared or used. The skill does perform outbound HTTP requests to KEGG, which is proportional and necessary for its function.

✓ Persistence & Privilege

Skill is not marked always:true and does not request persistent system presence or modify other skills. Default autonomous invocation is allowed (platform default) but this skill's capabilities are limited to KEGG queries.

Version History

v1.0.0

Initial release of kegg-query: - Query KEGG for drug information, pathway analysis, and disease-drug-target discovery. - Supports drug lookup (efficacy, targets, metabolism, interactions), pathway gene/compound analysis, and disease associations. - Returns structured, rich JSON outputs for drugs, pathways, and diseases. - Includes robust workflow examples and API usage details. - Provides integration guidelines with OpenBioMed tools. - Includes KEGG API reference and troubleshooting for common errors.

Metadata

Slug kegg-query

Version 1.0.0

License MIT-0

All-time Installs 0

Active Installs 0

Total Versions 1

Frequently Asked Questions

What is KEGG Query?

Query KEGG database for drug information, pathway analysis, and disease-drug-target discovery. Use this skill when: (1) Looking up drug information including... It is an AI Agent Skill for Claude Code / OpenClaw, with 101 downloads so far.

How do I install KEGG Query?

Run "/install kegg-query" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is KEGG Query free?

Yes, KEGG Query is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does KEGG Query support?

KEGG Query is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created KEGG Query?

It is built and maintained by HollyYa (@hollyya); the current version is v1.0.0.

More Skills

KEGG Query

KEGG Query

When to Use

Workflow

Use Case 1: Drug Information Lookup

Use Case 2: Pathway Analysis

Use Case 3: Disease-Drug-Target Discovery

Expected Outputs

Drug Entry (JSON)

Pathway Entry (JSON)

Disease Entry (JSON)

KEGG API Reference

Error Handling

Integration with OpenBioMed

References

What is KEGG Query?

How do I install KEGG Query?

Is KEGG Query free?

Which platforms does KEGG Query support?

Who created KEGG Query?

💬 Comments