← 返回 Skills 市场
aitowerofbabel-lang

Bosszp

作者 aitowerofbabel-lang · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ⚠ suspicious
103
总下载
0
收藏
0
当前安装
1
版本数
在 OpenClaw 中安装
/install bosszp
功能描述
BOSS直聘岗位数据爬虫 + 可视化分析工具。使用 Scrapy 爬取职位数据,存入 MySQL,用 Flask + Highcharts 生成可视化报告。当用户需要批量采集 BOSS直聘 职位数据、分析薪资分布、公司分布、岗位要求,或需要生成可视化报告时触发。
使用说明 (SKILL.md)

BOSS直聘 数据爬虫分析 (bosszp)

环境要求

  • Python 3.x
  • MySQL
  • Scrapy
  • Pandas + SQLAlchemy
  • Flask + Highcharts

安装依赖

pip install scrapy
pip install flask pandas sqlalchemy pymysql
pip install flash

数据采集

1. 创建 Scrapy 项目

scrapy startproject bosszp
cd bosszp
scrapy genspider boss zhipin.com

2. 配置 Cookie 和请求头

编辑 settings.py

COOKIES_ENABLED = True
USER_AGENTS = [
    'Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_7) AppleWebKit/537.36...',
    ...
]

# 爬取延迟(避免被封)
DOWNLOAD_DELAY = 2

3. 运行爬虫

scrapy crawl boss -o output.csv

数据导出

CSV 格式

岗位名称,岗位地区,薪资,公司名称,公司类型,公司规模,融资阶段,工作年限,学历,福利标签

导入 MySQL

python clean_data.py   # 数据清洗 + 导入 MySQL

可视化看板

cd web
export FLASK_APP=run.py
flask run
# 访问 http://127.0.0.1:8080/

看板包含:

  • 饼图: 公司融资阶段分布
  • 柱状图: 薪资分布 Top10
  • 词云: 岗位福利标签词云
  • 表格: 公司排名 Top10(按岗位数量)

字段说明

字段 说明
job_name 岗位名称
job_area 工作地区
job_salary 薪资范围
com_name 公司名称
com_type 公司类型(国企/民营等)
com_size 公司规模
finance_stage 融资阶段
work_year 工作年限要求
education 学历要求
job_benefits 福利标签
安全使用建议
This skill appears to be an instructions-only guide to build a Scrapy->MySQL->Flask pipeline for scraping BOSS直聘. Before you proceed: 1) Do not run pip install commands blindly — verify packages and versions. The SKILL.md has 'pip install flash' which is likely a typo for 'flask' and could install an unrelated PyPI package; correct and verify before installing. 2) The repository contains no spider code, no clean_data.py and no web/run.py — you'll need to implement or inspect those scripts; do not run third-party scripts you didn't review. 3) The workflow requires MySQL credentials; decide how to provide them securely (use least-privilege DB user, avoid embedding credentials in public files). 4) Web scraping external sites can violate terms of service and legal/regulatory rules; check robots.txt, site terms, and rate-limit to avoid blocking. 5) Run in an isolated environment (virtualenv/container) and audit installed packages before use. If you want a safer evaluation, request the actual spider and data-cleaning scripts so they can be reviewed for unwanted network/file/credential access.
功能分析
Type: OpenClaw Skill Name: bosszp Version: 1.0.0 The skill bundle provides instructions and metadata for a job data scraper and visualization tool (BOSS Zhipin). While the bundle references external scripts like 'clean_data.py' and 'run.py' that are not included in the provided files, the instructions in SKILL.md describe standard Scrapy and Flask workflows. There are no signs of data exfiltration, malicious execution, or prompt injection.
能力评估
Purpose & Capability
The SKILL.md describes a Scrapy -> MySQL -> Flask pipeline which is coherent with the named purpose (BOSS直聘 数据爬虫 + 可视化). However, the skill package contains no code files (spiders, clean_data.py, web/run.py) yet the instructions reference them; that makes the runtime expectations incomplete. Overall capabilities align with the stated purpose but are under-specified.
Instruction Scope
Instructions tell the user to create/run a Scrapy project, run a spider against zhipin.com, and run external scripts (clean_data.py, run.py) that are not bundled. The steps instruct installing packages and running network I/O (scraping) but do not describe how database credentials are provided or what clean_data.py does. This is open-ended and could lead users to run or create code without clear safety or credential-handling guidance.
Install Mechanism
Although there is no formal install spec, the SKILL.md includes pip install commands. Most are expected (scrapy, flask, pandas, sqlalchemy, pymysql), but there is a likely-typo or unexpected command: 'pip install flash' (instead of 'flask'). Installing an unexpected PyPI package named 'flash' could pull an unrelated or malicious package. Any instructions that blindly pip install third-party packages without verifying versions/source are a risk.
Credentials
The skill requests no environment variables in metadata, yet the workflow requires MySQL access for importing data. The SKILL.md does not specify how DB credentials should be provided (env vars, config file, interactive input). Requiring database access in practice is proportional to the purpose, but the omission of explicit credential handling is a gap and could lead users to supply credentials insecurely.
Persistence & Privilege
The skill is instruction-only and has no install spec, does not request always:true, and does not claim to modify other skills or persist in the agent. No elevated persistence or privileges are requested.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install bosszp
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /bosszp 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.0
Initial release of bosszp: a BOSS直聘 job data crawler and visualization tool. - Scrapes job listings from BOSS直聘 using Scrapy and exports data as CSV. - Cleans and imports job data into MySQL for structured analysis. - Provides a Flask + Highcharts web dashboard with salary, company, and job requirement visualizations. - Includes features like financing phase pie charts, Top10 salary bar charts, benefit tag word clouds, and company rankings. - Supports batch data collection and report generation for comprehensive job market analysis.
元数据
Slug bosszp
版本 1.0.0
许可证 MIT-0
累计安装 0
当前安装数 0
历史版本数 1
常见问题

Bosszp 是什么?

BOSS直聘岗位数据爬虫 + 可视化分析工具。使用 Scrapy 爬取职位数据,存入 MySQL,用 Flask + Highcharts 生成可视化报告。当用户需要批量采集 BOSS直聘 职位数据、分析薪资分布、公司分布、岗位要求,或需要生成可视化报告时触发。 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 103 次。

如何安装 Bosszp?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install bosszp」即可一键安装,无需额外配置。

Bosszp 是免费的吗?

是的,Bosszp 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

Bosszp 支持哪些平台?

Bosszp 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 Bosszp?

由 aitowerofbabel-lang(@aitowerofbabel-lang)开发并维护,当前版本 v1.0.0。

💬 留言讨论