← Back to Skills Marketplace
errsr

全球旅游资源解析入库技能 V2.0

by eRrsr · GitHub ↗ · v2.0.0 · MIT-0
cross-platform ⚠ suspicious
29
Downloads
0
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install yc-resource-import
Description
全球旅游资源全自动解析入库技能。 支持本地文件 / Google Drive / 百度网盘自动拉取合同; 支持 PDF / Word / 图片PDF / 扫描件 / 拍照图片全格式解析; 自动识别8大资源类型 → 按锁死标准字段抽取 → 公开信息自动补齐 → 敏感信息脱敏 → 合规校验 → 输出标准CSV并追加到...
README (SKILL.md)

\r \r

全球旅游资源解析入库技能 V2.0(最终锁字段版)\r

【全局强约束:字段永久锁死,禁止任何改动】\r

你必须严格绑定以下8类资源固定字段体系,字段名、数量、顺序永久固定,不允许:新增字段、删除字段、改名字段、调整顺序、合并字段、编造内容。\r 识别资源类型后,只加载对应字段列表,按顺序填空,缺则留空。\r \r -------------------------------------------------------------------------------\r

【1】酒店 固定字段(42个,顺序锁死)\r

contract_id,hotelnameen,hotelnamecn,star_rating,region_en,region_cn,address,phone,email,website,gps_longitude,gps_latitude,Zone ID,Hotel Number,Company Name,room_type,area_sqm,max_occupancy,bed_type,room_amenities,facilities,unique_features,priceidrlow_season,priceidrhigh_season,priceidrpeak_season,pricecnylow_season,pricecnyhigh_season,pricecnypeak_season,breakfast_policy,children_policy,extrabedpolicy,cancellation_policy,checkintime,checkouttime,deposit_policy,payment_terms,validity_start,validity_end,data_source,data_quality,last_updated,notes,tags_audience,tags_style,tags_feature\r \r

【2】车辆 固定字段(36个,顺序锁死)\r

vehiclemodelen,vehiclemodelcn,vehicle_type,seats,capacity,luggage_capacity,air_condition,driver_language,services_included,driver_experience,vehicle_condition,Company Name,priceidrhalf_day,priceidrdaily13,priceidrdaily4,Priceidrdaily5,Priceidrdaily6,Priceidrdaily7,Priceidrdaily8,pricecnyhalf_day,pricecnydaily13,pricecnydaily47,Pricecnydaily5,Pricecnydaily6,Pricecnydaily7,Pricecnydaily8,service_hours,overtimefeeidr,overtimefeecny,outofareafeeidr,fuel_policy,toll_policy,parking_policy,validity_start,validity_end,data_source,data_quality,last_updated,notes,tags_audience,tags_style,tags_feature\r \r

【3】景点 固定字段(40个,顺序锁死)\r

Attraction ID,attractionnameen,attractionnamecn,attraction_type,category,region_en,region_cn,address,website,gps_longitude,gps_latitude,Zone ID,opening_hours,closing_days,duration_required,bestvisittime,priceidradult,priceidrchild,priceidrinfant,pricecnyadult,pricecnychild,pricecnyinfant,special_requirements,booking_info,transport_info,description,highlights,photos_allowed,guided_tour,wheelchair_access,parking_available,restaurantonsite,restroom_available,souvenir_shop,validity_start,validity_end,data_source,data_quality,last_updated,notes,tags_audience,tags_style,tags_feature\r \r

【4】活动 固定字段(42个,顺序锁死)\r

activity_id,activitynameen,activitynamecn,activity_type,category,region_en,region_cn,gps_longitude,gps_latitude,Zone ID,Company Name,location,address,phone,email,website,duration_hours,difficulty_level,age_restriction,health_requirements,physical_demand,besttimeof_day,Opening Hours,Recommended Time,Best Time to Visit,seasonal_availability,groupsizemin,groupsizemax,priceidradult,priceidrchild,priceidrinfant,pricecnyadult,pricecnychild,pricecnyinfant,inclusions,instructor_language,equipment_provided,insurance_included,weather_dependency,booking_required,cancellation_policy,description,highlights,safety_notes,whattobring,photos_allowed,videoserviceavailable,validity_start,validity_end,data_source,data_quality,last_updated,notes,tags_audience,tags_style,tags_feature\r \r

【5】SPA 固定字段(33个,顺序锁死)\r

spa_id,spanameen,spanamecn,spa_type,category,region_en,region_cn,address,phone,email,website,gps_longitude,gps_latitude,Zone ID,SPA Shop Name,opening_hours,closing_days,duration_hours,Treatments info,priceidr60min,priceidr90min,priceidr120min,pricecny60min,pricecny90min,pricecny120min,inclusions,room_type,products_used,booking_required,cancellation_policy,shower_facilities,locker_rooms,validity_start,validity_end,data_source,data_quality,last_updated,notes,tags_audience,tags_style,tags_feature\r \r

【6】俱乐部 固定字段(35个,顺序锁死)\r

club_id,clubnameen,clubnamecn,club_type,category,region_en,region_cn,address,Zone ID,Company Name,phone,email,website,gps_longitude,gps_latitude,opening_hours,closing_days,age_restriction,dress_code,entry_requirements,coverchargeidr_adult,coverchargeidr_female,coverchargeidr_couple,coverchargecny_adult,coverchargecny_female,coverchargecny_couple,tablebookingrequired,minimumspendingidr,minimumspendingcny,features,music_style,dj_schedule,special_events,parking_available,vip_rooms,bottle_service,food_available,description,highlights,safety_notes,validity_start,validity_end,data_source,data_quality,last_updated,notes,tags_audience,tags_style,tags_feature\r \r

【7】餐厅 固定字段(38个,顺序锁死)\r

restaurant_id,restaurantnameen,restaurantnamecn,restaurant_type,category,region_en,region_cn,address,Zone ID,Company Name,phone,email,website,gps_longitude,gps_latitude,opening_hours,closing_days,cuisine_type,seating_capacity,outdoor_seating,beach_front,sunset_view,signature_dishes,special_offers,private_dining,Package1,Menu 1,averagepriceidrperperson,averagepricecnyperperson,Package2,Menu 2,reservation_required,advancebookingdays,cancellation_policy,free_parking,wheelchair_accessible,children_menu,halal_food,payment_methods,service_charge,tax,description,highlights,special_notes,validity_start,validity_end,data_source,data_quality,last_updated,notes,tags_audience,tags_style,tags_feature\r \r

【8】下午茶 固定字段(32个,顺序锁死)\r

afternoonteaid,afternoonteaname_en,afternoonteaname_cn,afternoonteatype,category,region_en,region_cn,address,Zone ID,Company Name,phone,email,website,gps_longitude,gps_latitude,opening_hours,closing_days,teatimehours,capacity,menu_type,tea_selection,cake_selection,special_combinations,priceidrsetfortwo,priceidrper_person,pricecnysetfortwo,pricecnyper_person,reservations_required,advancebookingdays,cancellation_policy,beach_view,sunset_view,pool_access,signature_treats,special_offers,private_area,free_parking,wheelchair_accessible,children_friendly,payment_methods,service_charge,tax,description,highlights,special_notes,validity_start,validity_end,data_source,data_quality,last_updated,notes,tags_audience,tags_style,tags_feature\r -------------------------------------------------------------------------------\r \r

一、触发识别与路径提取\r

从用户输入动态提取以下变量,禁止硬编码:\r

  1. 云盘URL:Google Drive / 百度网盘\r
  2. 本地文件路径\r
  3. 模型文件路径(生产CSV所在目录)\r
  4. 目标输出路径(解析结果存放目录)\r 未指定路径时,必须提示用户补充。\r \r

二、依赖检查\r

执行前必须检查:\r

  • google-drive、Bbaidu-netdisk-storage 技能\r
  • pdfplumber、python-docx、PyMuPDF\r 缺失时返回安装指引,终止执行。\r \r

三、文件拉取与预处理\r

  1. 云盘文件:自动下载 PDF/Word/图片类合同,过滤无关文件\r
  2. 本地文件:直接读取\r
  3. 图片PDF/扫描件/拍照图:自动OCR转文本,notes标注「来源=OCR识别」\r
  4. 云端文件先下载到本地临时目录,再解析\r
  5. 支持批量10+文件,任务队列串行处理,不崩溃\r \r

四、资源类型自动识别(无需人工指定)\r

按以下维度判定,输出固定8类:\r 酒店、车辆、景点、活动、SPA、俱乐部、餐厅、下午茶\r 判定依据:\r

  • 文件名/标题关键词\r
  • 正文特征词(房型、门票、租车、按摩、冲浪等)\r
  • 价格结构(按晚/按天/按人次/按分钟)\r
  • 条款模式(入住退房、超时费、时长、套餐)\r \r

五、三层标签体系(强制使用官方标准)\r

人群标签(所有资源通用)\r

家庭、亲子、情侣、商务、蜜月、年轻人、老年人、单身\r

风格标签(所有资源通用)\r

豪华、经济型、浪漫、休闲、刺激、度假村、商务、自然、文化\r

特色标签(严格按8类官方标签库生成,不编造)\r

必须从《巴厘岛旅游资源_8类特色标签快速查阅表_V4.0》中选取,只选匹配项,用英文逗号分隔。\r \r

六、Zone ID 自动匹配(官方规则)\r

按地区自动映射:\r

  • Zone-S1:努沙杜瓦、金巴兰、乌鲁瓦图\r
  • Zone-S2:库塔、水明漾、长谷、萨努尔\r
  • Zone-C1:乌布、德格拉朗、嘉利维\r
  • Zone-W1:贝都古、汉达拉、贾蒂卢维\r
  • Zone-E1:帕当拜、阿曼、天堂之门\r
  • Zone-N1:京打马尼、罗威纳、塞库普\r
  • Zone-OFF:佩妮达、蓝梦岛、吉利群岛\r 无法匹配时填:未知\r \r

七、字段抽取与补齐规则(核心)\r

  1. 按锁死字段列表按顺序抽取,不增、不减、不改、不乱序\r
  2. 优先级:合同原文 > 公开网络查询 > 留空\r
  3. 【自动补齐白名单】仅公开信息可联网查询补齐:\r address、region_en、region_cn、gps_longitude、gps_latitude、website、phone、email、opening_hours\r
  4. 价格、政策、合同条款、有效期、标签:只使用合同内容,禁止补齐\r
  5. 补齐记录必须在notes标注:[自动补齐]xxx / [缺失]xxx\r
  6. 无内容必须留空,禁止填写:无、NULL、未提供、合同未提供\r \r

八、多房型解析规则(酒店专用)\r

  1. 每个房型单独一行\r
  2. 基础信息全部复制\r
  3. 房型、面积、价格、床型单独填写\r
  4. 政策、标签全行一致\r \r

九、敏感信息自动脱敏(强制执行)\r

  • 手机号:138****1234\r
  • 身份证号:1101********1234\r
  • 银行卡/对公账户:6222****1234\r
  • 其他隐私信息一律打码,不显示原文\r \r

十、合规与质量校验(官方评分体系)\r

  1. 合同有效期校验:\r
    • 已过期 → notes:合同已过期\r
    • 30天内到期 → notes:合同即将到期\r
  2. 必备字段缺失 → notes:核心信息缺失,需人工复核\r
  3. 价格异常/冲突 → notes:条款异常,请人工核对\r
  4. 数据质量自动评分(官方标准):\r
    • 合约价:90分\r
    • 官网价:70分\r
    • 未知来源:0分\r 完整+时效+审核通过:+10~15分\r \r

十一、CSV输出路径与格式\r

用户可指定任意输出路径,自动按资源类型分文件夹:\r /output/酒店、/output/车辆、/output/景点 ...\r 编码:\r

  • 酒店:utf-8-sig\r
  • 其他7类:gbk\r \r

十二、数据库导入规则\r

  1. 8类资源对应8张表\r
  2. 字段与CSV完全一致\r
  3. 追加写入,不覆盖\r
  4. 重复ID自动跳过\r
  5. 支持 MySQL / PostgreSQL\r \r

十三、依赖缺失处理\r

  • 缺云盘技能:提示执行 openclaw skills install xxx\r
  • 缺Python库:提示 pip install pdfplumber python-docx PyMuPDF\r
  • 路径不存在:自动创建\r \r

十四、执行完成标准回复(必须原样返回)\r

✅ 资源解析入库完成\r ━━━━━━━━━━━━━━━━━━━━\r 解析文件:xxx.pdf、xxx.docx\r 成功解析:N 条数据\r 资源类型:酒店×N | 车辆×N | 景点×N | 活动×N | SPA×N | 俱乐部×N | 餐厅×N | 下午茶×N\r 输出CSV路径:/xxx/xxx/xxx/\r 数据库入库状态:已完成 / 未开启\r 清理状态:原始文件已归档,临时文件已删除\r \r

十五、文件处理规则\r

  1. 云盘下载原始合同 → 移至 archive 目录归档,不删除\r
  2. 中间临时文件 → 自动删除\r
  3. 最终CSV → 永久保留在用户指定目录\r \r

十六、绝对禁止行为\r

  • 不编造任何合同信息\r
  • 不脑补价格、政策、条款、标签\r
  • 不新增/删除/改名字段\r
  • 不打乱字段顺序\r
  • 不输出解释、表格、代码块,只输出纯CSV\r
  • 不修改合同真实内容
Usage Guidance
Review this skill before installing or running it against production data. It appears purpose-aligned for tourism contract parsing, but use a staging copy first, confirm every production CSV path manually, and restrict Google Drive/Baidu Netdisk access to only the needed contract files.
Capability Assessment
Purpose & Capability
The stated purpose is coherent: parse tourism contracts, normalize fields, mask sensitive data, and output/import CSVs. However, that purpose includes sensitive contract processing and production data mutation.
Instruction Scope
The artifacts describe automatic import into a production CSV/database, but the visible instructions and helper script do not show a clear user confirmation step, rollback plan, backup strategy, or path allowlist before modifying production data.
Install Mechanism
Registry requirements show no install spec, binaries, credentials, or env vars, while _meta.json declares Python dependencies and required cloud-storage skills. This looks purpose-aligned but under-declared.
Credentials
Access to local files, Google Drive, and Baidu Netdisk contracts is expected for this skill, but those sources can contain sensitive business and personal data.
Persistence & Privilege
The helper script rewrites the target production CSV in place, creating a material risk of accidental production data corruption if the wrong path or malformed input is used.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install yc-resource-import
  3. After installation, invoke the skill by name or use /yc-resource-import
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v2.0.0
初始发布:8大资源类型锁死字段体系,Zone映射,数据质量评分,脱敏,OCR支持,兼容google-drive/Bbaidu-netdisk-storage云盘技能
Metadata
Slug yc-resource-import
Version 2.0.0
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 1
Frequently Asked Questions

What is 全球旅游资源解析入库技能 V2.0?

全球旅游资源全自动解析入库技能。 支持本地文件 / Google Drive / 百度网盘自动拉取合同; 支持 PDF / Word / 图片PDF / 扫描件 / 拍照图片全格式解析; 自动识别8大资源类型 → 按锁死标准字段抽取 → 公开信息自动补齐 → 敏感信息脱敏 → 合规校验 → 输出标准CSV并追加到... It is an AI Agent Skill for Claude Code / OpenClaw, with 29 downloads so far.

How do I install 全球旅游资源解析入库技能 V2.0?

Run "/install yc-resource-import" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is 全球旅游资源解析入库技能 V2.0 free?

Yes, 全球旅游资源解析入库技能 V2.0 is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does 全球旅游资源解析入库技能 V2.0 support?

全球旅游资源解析入库技能 V2.0 is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created 全球旅游资源解析入库技能 V2.0?

It is built and maintained by eRrsr (@errsr); the current version is v2.0.0.

💬 Comments