← 返回 Skills 市场

Alibabacloud Polardbx Sql

Name: Alibabacloud Polardbx Sql
Author: sdk-team

作者 alibabacloud-skills-team · GitHub ↗ · v0.0.2 · MIT-0

cross-platform ✓ 安全检测通过

132

总下载

当前安装

版本数

在 OpenClaw 中安装

/install alibabacloud-polardbx-sql

功能描述

Design partition schemes, select partition keys, create GSI, and write SQL for PolarDB-X 2.0 Enterprise Edition AUTO mode databases, handling PolarDB-X vs My...

使用说明 (SKILL.md)

PolarDB-X SQL (MySQL Compatibility Focus)

Write, review, and adapt SQL for PolarDB-X 2.0 Enterprise Edition (Distributed Edition) AUTO mode databases, avoiding the "runs on MySQL but fails on PolarDB-X" problem.

Architecture: PolarDB-X 2.0 Enterprise Edition (CN compute nodes + DN storage nodes + GMS metadata service + CDC log nodes) + AUTO mode database

Scope:

PolarDB-X 2.0 Enterprise Edition (also known as Distributed Edition) + AUTO mode database

Not applicable to:

PolarDB-X 1.0 (DRDS 1.0)
PolarDB-X 2.0 Standard Edition
PolarDB-X 2.0 Enterprise Edition DRDS mode databases

Key difference between AUTO mode and DRDS mode: AUTO mode uses MySQL-compatible PARTITION BY syntax to define partitions, while DRDS mode uses the legacy dbpartition/tbpartition syntax. Verify the database mode with:

SHOW CREATE DATABASE db_name;
-- Output containing MODE = 'auto' indicates AUTO mode

Installation

Connect to a PolarDB-X instance via a MySQL-compatible client:

mysql -h \x3Chost> -P \x3Cport> -u \x3Cuser> -p\x3Cpassword> -D \x3Cdatabase>

Supported clients: MySQL CLI, MySQL Workbench, DBeaver, Navicat, or any MySQL-compatible client.

Parameter Confirmation

IMPORTANT: Parameter Confirmation — Before executing any command or API call, ALL user-customizable parameters (e.g., RegionId, instance names, CIDR blocks, passwords, domain names, resource specifications, etc.) MUST be confirmed with the user. Do NOT assume or use default values without explicit user approval.

Configurable parameters for this skill:

Parameter Name	Required/Optional	Description	Default Value
host	Required	PolarDB-X instance connection address	None
port	Required	PolarDB-X instance port	3306
user	Required	Database username	None
password	Required	Database password	None
database	Required	Target database name	None

Core Workflow (Follow each time)

Confirm the target engine and version:
- Run SELECT VERSION(); to determine the instance type:
  - Result contains TDDL with version > 5.4.12 (e.g., 5.7.25-TDDL-5.4.19-20251031) -> 2.0 Enterprise Edition (Distributed Edition), this skill applies. Parse the Enterprise Edition version number (e.g., 5.4.19).
  - Result contains TDDL with version \x3C= 5.4.12 (e.g., 5.6.29-TDDL-5.4.12-16327949) -> DRDS 1.0. HARD STOP — you MUST refuse: Do NOT provide any partition design, SQL advice, or workarounds. Respond only with: "This skill covers PolarDB-X 2.0 Enterprise Edition AUTO mode only. Your instance is DRDS 1.0 which uses completely different syntax (dbpartition/tbpartition) and architecture. Please consult DRDS 1.0 documentation or upgrade to PolarDB-X 2.0." Then stop. Do NOT continue even if the user insists.
  - Result contains X-Cluster (e.g., 8.0.32-X-Cluster-8.4.20-20251017) -> 2.0 Standard Edition. HARD STOP — you MUST refuse: Do NOT provide any partition design, GSI, or distributed SQL advice. Respond only with: "Your instance is PolarDB-X 2.0 Standard Edition (100% MySQL compatible, no distributed partitioning). Please use the polardbx-standard skill instead." Then stop. Do NOT continue even if the user insists.
- After confirming 2.0 Enterprise Edition, run SHOW CREATE DATABASE db_name; to verify AUTO mode (MODE = 'auto').
- The version number affects feature availability (e.g., NEW SEQUENCE requires 5.4.14+, CCI requires a newer version).
Determine the table type:
- Small or dictionary tables that are frequently joined with partitioned tables -> Broadcast table BROADCAST (fully replicated to every DN, enables local JOIN pushdown). This is the recommended choice when JOINs are involved.
- Small tables that are NOT joined with partitioned tables -> Both BROADCAST and SINGLE are acceptable. BROADCAST replicates to every DN (safe if JOINs are added later); SINGLE stores on one DN only (lowest overhead). Either is fine — do NOT insist on one over the other.
- Otherwise -> Partitioned table (default), choose appropriate partition key and strategy.
Partition scheme design (for partitioned tables):
- Collect SQL access pattern data (prerequisite — always recommend collecting data before making the final partition key decision): prefer SQL Insight (most accurate); when unavailable, use slow query logs + application code analysis, or have the business team provide SQL patterns as alternatives. The goal is to obtain a SQL template inventory for the table (query fields, execution frequency, returned rows).
- Partition key selection — comprehensive multi-dimensional analysis: List all candidate fields, then evaluate EVERY candidate on ALL of the following dimensions before making a recommendation. Do NOT recommend based on a single dimension alone:
  - Equality query ratio: proportion of SQL templates where this field appears as an equality condition.
  - Cardinality: number of distinct values; higher means more even data distribution across partitions.
  - Hotspot risk: whether a few values dominate a large portion of data (e.g., in an order table, some buyer_ids may account for millions of rows while others have few).
  - Primary key / unique key status: PKs/UKs inherently have the highest cardinality and zero hotspot risk.
  - Semantic analysis: Infer query patterns from table type and field meaning. For example, order_id in an order table is certainly queried frequently (order detail lookups, status checks, payment callbacks), even if the user only mentions buyer_id queries. The best partition key is the candidate that scores well across all dimensions combined. High-frequency queries on non-partition-key fields can be optimized by creating a GSI. Classic example: order table → order_id (PK, highest cardinality, zero hotspot, semantically high query frequency) as partition key + GSI on buyer_id (high buyer-dimension query ratio, but has potential skew risk as some buyers generate far more orders).
- GSI selection: Decide strategy based on write volume — tables with low write volume can freely create GSIs; create GSIs for high-frequency non-partition-key query fields; fields with low cardinality and time fields are unsuitable for GSI; fields that always appear combined with other fields and never appear alone don't need standalone GSIs. GSI types: regular GSI for few returned rows, Clustered GSI for one-to-many, UGSI for unique constraints. GSI syntax must include PARTITION BY KEY(...) PARTITIONS N — see gsi.md for full syntax.
- Partition algorithm: ~90% of workloads use single-level HASH/KEY; order-type multi-dimensional queries use CO_HASH; time-based data cleanup uses HASH+RANGE; multi-tenant uses LIST+HASH. For single column, HASH and KEY are equivalent.
- Partition count: 256 suits the vast majority of workloads; should be several times the number of DN nodes; keep single partition under 100 million rows.
- Migration workflow (three-step method for single table to partitioned table): (1) First convert to a partitioned table with 1 partition (preserving uniqueness) -> (2) Create required GSI/UGSI -> (3) Change to the target partition count. See partition-design-best-practice.md for details.
Use PolarDB-X safe defaults when generating SQL:
- Avoid unsupported MySQL features (stored procedures/triggers/EVENTs/SPATIAL, etc.).
- Use KEY or HASH partitioning instead of MySQL's AUTO_INCREMENT primary key write hotspot.
- When non-partition-key queries are needed, consider creating Global Secondary Indexes (GSI).
If the user provides MySQL SQL, perform compatibility checks:
- Replace unsupported features and provide PolarDB-X alternatives.
- Clearly mark behavioral differences and version requirements.
When SQL is slow or errors occur, use PolarDB-X diagnostic tools:
- EXPLAIN to view the logical execution plan.
- EXPLAIN EXECUTE to view the physical execution plan pushed down to DN.
- EXPLAIN SHARDING to view shard scan details and check for full-shard scans.
- EXPLAIN ANALYZE to actually execute and collect runtime statistics.

Key Differences Quick Reference

Three table types: Single table (SINGLE), Broadcast table (BROADCAST), Partitioned table (default); choose based on data volume and access patterns.
Partitioned tables: Support KEY/HASH/RANGE/LIST/RANGE COLUMNS/LIST COLUMNS/CO_HASH + secondary partitions (49 combinations).
Primary keys and unique keys: Classified as Global (globally unique) or Local (unique within partition); single/broadcast/auto-partitioned tables are always Global; manual partitioned tables are Global when partition columns are a subset of PK/UK columns, otherwise Local (risk of data duplication and DDL failure). Key principle: prefer choosing partition keys FROM existing PK/UK columns to naturally guarantee global uniqueness — do NOT modify the user's existing primary key definition to add partition columns.
Global Secondary Index GSI: Solves full-shard scan issues for non-partition-key queries, supports GSI / UGSI / Clustered GSI types. CRITICAL: GSI must specify its own PARTITION BY clause — it is an independently partitioned table, not a regular MySQL index. Correct syntax:
```
-- ✅ Correct: GSI with PARTITION BY clause
GLOBAL INDEX g_i_seller(seller_id) PARTITION BY KEY(seller_id) PARTITIONS 16
CLUSTERED INDEX cg_i_buyer(buyer_id) PARTITION BY KEY(buyer_id) PARTITIONS 16
-- ❌ Wrong: Missing PARTITION BY (this is NOT MySQL INDEX syntax)
GLOBAL INDEX gsi_seller(seller_id)
```
Classic partition design — order table: Candidates are order_id (PK) and buyer_id. Comprehensive analysis: order_id has the highest cardinality (unique per row), zero hotspot risk, PK status, and semantically high query frequency (order detail/status/payment lookups); buyer_id has high buyer-dimension query ratio but potential distribution skew (some buyers generate far more orders). Conclusion: order_id as partition key + Clustered GSI on buyer_id.
Clustered Columnar Index CCI: Row-column hybrid storage, accelerates OLAP analytical queries via CLUSTERED COLUMNAR INDEX.
Sequence: Globally unique sequence, default type is NEW SEQUENCE (5.4.14+), distributed alternative to AUTO_INCREMENT.
Distributed transactions: Based on TSO global clock + MVCC + 2PC, strong consistency by default; single-shard transactions automatically optimized to local transactions.
Table groups: Tables with the same partition rules bound to the same table group, ensuring JOIN computation pushdown to avoid cross-shard data shuffling.
TTL tables: Automatic expiration and cleanup of cold data based on time columns, can work with CCI for hot/cold data separation.
Unsupported MySQL features: Stored procedures/triggers/EVENTs/SPATIAL/GEOMETRY/LOAD XML/HANDLER, etc.
STRAIGHT_JOIN / NATURAL JOIN not supported: Use standard JOIN syntax instead.
:= assignment operator not supported: Move logic to the application layer.
Subqueries not supported in HAVING/JOIN ON clauses: Rewrite subqueries as JOINs or CTEs.

Best Practices

Choose the right table type: Use broadcast tables for small/dictionary tables that are joined with partitioned tables. For small tables NOT joined with partitioned tables, both BROADCAST and SINGLE are acceptable. Use partitioned tables for everything else.
Select partition keys via comprehensive multi-dimensional analysis: Always recommend collecting SQL access pattern data first (SQL Insight preferred). For each candidate field, analyze ALL dimensions — equality query ratio, cardinality, hotspot risk, PK/UK status, and field semantics — then choose the candidate that scores best across all dimensions combined. Never decide based on a single dimension alone. Remember to infer query patterns from table/field semantics (e.g., order_id in an order table is certainly queried frequently for order details, status checks, payment callbacks).
Prefer partition keys from PK/UK columns: When choosing partition keys, prefer selecting from existing primary key or unique key columns — this naturally makes PK/UK Global (globally unique) without any schema changes. Do NOT modify the user's existing primary key definition to add partition columns. When PK columns are not suitable as partition keys (e.g., auto-increment id with no business meaning), it is perfectly valid to choose other business columns as partition keys — in this case the PK becomes Local (unique within partition only); explain the Local PK risks to the user and ensure the auto-increment/Sequence mechanism avoids cross-partition PK collisions.
Create GSIs wisely: Decide GSI strategy based on write volume; use regular GSI for few returned rows, Clustered GSI for one-to-many, UGSI for unique constraints; don't create GSIs for low-ratio SQL; use INSPECT INDEX to periodically clean up redundant GSIs. Every GSI must have its own PARTITION BY KEY(...) PARTITIONS N clause; never write bare GLOBAL INDEX idx(col) without PARTITION BY.
Use 256 partitions: 256 partitions suit the vast majority of workloads, should be several times the number of DN nodes.
Use the three-step method for single table to partitioned table: First convert to 1 partition (preserving uniqueness) -> Create GSI/UGSI -> Change to target partition count, avoiding uniqueness constraint gaps.
Don't force partition key hits for low-ratio SQL: Partition design is pragmatic work; low-QPS cross-shard queries have limited total cost, don't create GSIs for every query field.
Use table groups to optimize JOINs: Bind frequently joined tables to the same table group using the same partition rules.
Avoid unsupported MySQL syntax: Don't use stored procedures, triggers, EVENTs, SPATIAL, NATURAL JOIN, :=, etc.
Avoid subqueries in HAVING/JOIN ON: Rewrite as JOINs or CTEs.
Use EXPLAIN commands for diagnosis: For SQL performance issues, prefer EXPLAIN SHARDING and EXPLAIN ANALYZE.
Check long transactions before Online DDL: Check for long transactions before executing DDL to avoid MDL lock waits.
Use TTL tables to manage cold data: For large tables with time attributes, use TTL tables to automatically clean up expired data.
Use Keyset pagination for efficient paging: Avoid LIMIT M, N deep pagination (cost O(M+N), even larger in distributed systems); record the sort value of the last row in each batch as the WHERE condition for the next batch; when sort columns may have duplicates, use (sort_column, id) tuple comparison; ensure appropriate composite indexes on sort columns.
Use auto-add partitions for Range partitioned tables: PolarDB-X uses a proprietary ALTER TABLE ... MODIFY TTL SET syntax (with multiple parameters like TTL_EXPR, TTL_PART_INTERVAL, ARCHIVE_TYPE, ARCHIVE_TABLE_PRE_ALLOCATE, etc.) to configure automatic partition pre-creation. This syntax is NOT standard SQL and cannot be guessed — you MUST read references/auto-add-range-parts.md for the exact SQL syntax before generating any auto-add partition configuration. Requires version 5.4.20+.

Reference Links

Reference	Description
references/create-table.md	CREATE TABLE syntax, table types (single/broadcast/partitioned), partition strategies, secondary partitions, partition management
references/partition-design-best-practice.md	Partition design best practices: partition key/GSI/algorithm/count selection, three-step migration, complete examples
references/primary-key-unique-key.md	Primary key and unique key Global/Local classification, rules, risks, and recommendations
references/gsi.md	Global Secondary Index GSI/UGSI/Clustered GSI creation, querying, and limitations
references/cci.md	Clustered Columnar Index CCI creation, usage, and applicable scenarios
references/sequence.md	Sequence types (NEW/GROUP/SIMPLE/TIME), creation and usage
references/transactions.md	Distributed transaction model, isolation levels, and considerations
references/mysql-compatibility-notes.md	MySQL vs PolarDB-X compatibility differences and development limitations
references/explain.md	EXPLAIN command variants and execution plan diagnostics
references/ttl-table.md	TTL table definition, cold data archiving, and cleanup scheduling
references/online-ddl.md	Online DDL assessment, lock-free execution strategy, long transaction checks, DMS lock-free changes
references/pagination-best-practice.md	Efficient pagination: Keyset pagination, per-shard traversal, index requirements, Java examples
references/auto-add-range-parts.md	Range partition auto-add: TTL-based partition pre-creation, first/second level configuration, management commands
references/cli-installation-guide.md	Alibaba Cloud CLI installation guide

安全使用建议

This skill is an instruction-only PolarDB‑X SQL/operational guide and appears internally consistent. Before using it: (1) Do not paste production credentials into chat; use the agent's secure credential mechanism if available. (2) The skill expects you to connect to your DB and may suggest DDL — it mandates explicit user confirmation before executing DDL, but you should still review any statements, run EXPLAIN ONLINE_DDL, check long transactions, and take backups or test on a staging instance first. (3) The included Aliyun CLI reference is informational; if you install or configure cloud CLI tools, follow least-privilege and rotation best practices. (4) If you need the agent to execute statements, verify the agent’s DB user has only necessary privileges and consider using read-only or restricted accounts for diagnostics.

能力标签

cryptorequires-walletcan-make-purchasesrequires-sensitive-credentials

能力评估

✓ Purpose & Capability

The name/description (PolarDB-X partition/GSI/CCI/DDL guidance) matches the included SKILL.md and reference files (partition design, GSI, CCI, EXPLAIN, online DDL, TTL, etc.). No unrelated env vars, binaries, or config paths are required.

✓ Instruction Scope

SKILL.md focuses on inspecting the DB (SELECT VERSION(), SHOW CREATE DATABASE), collecting query patterns, designing partition keys, and DDL workflows. It emphasizes explicit user confirmation before executing DDL and contains hard-stops for unsupported instance types. It does not instruct reading arbitrary local files or exfiltrating data to external endpoints.

✓ Install Mechanism

No install spec and no code files to execute; all content is instruction/documentation. Lowest-risk modality — nothing will be downloaded or written by an installer.

✓ Credentials

The skill declares no required environment variables or credentials. Reference docs include an Aliyun CLI installation/configuration guide (helpful for admin tasks) but the skill does not require Alibaba Cloud credentials to produce SQL guidance. No disproportionate credential requests are present.

✓ Persistence & Privilege

always:false and normal invocation settings. The skill does not request persistent system presence or modify other skills' configurations; it merely provides runtime instructions for database interaction and decision points requiring user confirmation.

如何使用

确保已安装 OpenClaw（本地或 Docker 部署）
在对话框中输入安装命令：/install alibabacloud-polardbx-sql
安装完成后，直接呼叫该 Skill 的名称或使用 /alibabacloud-polardbx-sql 触发
根据 Skill 的参数说明提供必要输入，即可获得结构化输出

版本历史

v0.0.2

alibabacloud-polardbx-sql 0.0.2 Changelog - Updated partition scheme and table type design guidance for PolarDB-X 2.0 Enterprise Edition AUTO mode, with stricter multi-dimensional analysis of partition key selection. - Clarified strict applicability: refuse all requests for DRDS 1.0 or Standard Edition with specific response and hard stop. - Expanded and refined table type selection logic, especially for broadcast vs single tables and join behavior. - Enhanced GSI best practices, clearly indicating the required syntax and write workload considerations. - Improved partition design workflow to emphasize SQL access pattern data gathering as a prerequisite. - Documentation and trigger phrase updates for greater accuracy and clarity.

v0.0.1

alibabacloud-polardbx-sql 0.0.1 — Initial Release - Provides design guidance for partition schemes, partition key selection, and GSI/CCI creation for PolarDB-X 2.0 Enterprise Edition AUTO mode. - Focuses on MySQL compatibility, highlighting syntax and behavioral differences between PolarDB-X and MySQL. - Includes step-by-step workflow for table design, partitioning, SQL migration, and query diagnostics. - Outlines key verification steps to ensure skill is applied on supported engine and database mode. - Details parameter confirmation requirements to avoid accidental defaults. - Offers quick-reference tables for configuration and feature differences.

元数据

Slug alibabacloud-polardbx-sql

版本 0.0.2

许可证 MIT-0

累计安装 0

当前安装数 0

历史版本数 2

常见问题