← 返回 Skills 市场
dataify-server

Dataify Glassdoor Company By Url

作者 dataify-server · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ⚠ suspicious
45
总下载
0
收藏
0
当前安装
1
版本数
在 OpenClaw 中安装
/install dataify-glassdoor-company-by-url
功能描述
Prepare Dataify builder requests for the glassdoor.com scraper family rooted at glassdoor_company_by-url. Use when needs to work with the successful Dataify...
使用说明 (SKILL.md)

Dataify Builder Skill

Use this skill to prepare Dataify builder requests for the scraper family rooted at glassdoor_company_by-url on glassdoor.com.

Workflow

  1. Check whether DATAIFY_API_TOKEN exists in the environment.
  2. If the token is missing, stop and tell the user to sign in at Dataify Dashboard](https://dataify.com/dashboard) to obtain it.
  3. Ask the user to choose exactly one tool from the following Chinese list:
  • 通过URL采集 (glassdoor_company_by-url)
  • 通过过滤器采集 (glassdoor_company_by-inputfilter)
  • 通过关键词采集 (glassdoor_company_by-keywords)
  • 通过搜索网址采集 (glassdoor_company_by-listurl)
  • 通过URL采集 (glassdoor_joblistings_by-url)
  • 通过关键词采集 (glassdoor_joblistings_by-keywords)
  • 通过搜索网址采集 (glassdoor_joblistings_by-listurl)
  1. Read references/tool-params.json and find the chosen tool by tool_sign or Chinese tool name.
  2. For each parameter in the chosen tool:
    • If input_mode is user_input, ask the user for the value.
    • If input_mode is select, present the saved options to the user.
  3. Use scripts/build-dataify-request.py as the default cross-platform helper.
  4. Use scripts/build-dataify-request.ps1 as the Windows PowerShell helper when needed.
  5. When a selectable parameter has a human-readable Chinese label, keep that label in spider_parameters. Do not replace it with a code such as HK unless the user explicitly asks for the coded value.
  6. Build spider_parameters as a JSON array.
  7. If every parameter has only one final value, build one object such as [{"searchurl":"...","country":"Hong Kong"}].
  8. If one or more parameters have multiple aligned values, zip them by index and build one object per row. Example: [{"search_url":"url1","page_turning":"1","max_num":"15"},{"search_url":"url2","page_turning":"1","max_num":"15"}].
  9. If a parameter has one value while another parameter has multiple values, reuse the single value across every generated row.
  10. Set spider_name to glassdoor.com.
  11. Set spider_id to the selected tool's tool_sign.
  12. Always include spider_errors=true and file_name={{TasksID}}.
  13. Return a curl command for https://scraperapi.dataify.com/builder.

Set DATAIFY_API_TOKEN

Prefer a permanent environment-variable setup instead of setting the token only for the current terminal session.

Windows PowerShell, permanent for the current user:

[Environment]::SetEnvironmentVariable("DATAIFY_API_TOKEN", "your_token_here", "User")

Then reopen PowerShell. If the current session also needs the token immediately, run:

$env:DATAIFY_API_TOKEN = "your_token_here"

macOS or Linux, permanent for bash:

echo 'export DATAIFY_API_TOKEN="your_token_here"' >> ~/.bashrc
source ~/.bashrc

macOS or Linux, permanent for zsh:

echo 'export DATAIFY_API_TOKEN="your_token_here"' >> ~/.zshrc
source ~/.zshrc

Script usage

Python:

python scripts/build-dataify-request.py --tool-sign \x3Cselected_tool_sign> --values-file values.json

PowerShell:

& ".\scripts\build-dataify-request.ps1" -ToolSign "\x3Cselected_tool_sign>" -ValuesFile ".\values.json"

The values.json file should contain either one object or an array of objects. Example:

[{"searchurl":"https://www.airbnb.com/s/Greece/homes?...","country":"Hong Kong"}]

Required output shape

Generate a curl command in this form:

curl -X POST 'https://scraperapi.dataify.com/builder' \
  -H "Authorization: Bearer $DATAIFY_API_TOKEN" \
  -H 'Content-Type: application/x-www-form-urlencoded' \
  -d 'spider_name=glassdoor.com' \
  -d 'spider_id=\x3Cselected_tool_sign>' \
  -d 'spider_parameters=[{"param":"value"}]' \
  -d 'spider_errors=true' \
  -d 'file_name={{TasksID}}'

Reference usage

  • references/tool-params.json stores the full saved parameter catalog for every available tool in this scraper family.
  • scripts/build-dataify-request.py is the portable implementation and should be preferred.
  • scripts/build-dataify-request.ps1 mirrors the same behavior for Windows users.
  • If a parameter has no options, the user must provide the value.
  • If a parameter has options, present those options back to the user before building the final request.
  • Do not assume spider_parameters always contains exactly one object. Multi-value tools may require multiple objects zipped by index.
  • Use the saved url_example only as a reference example. Do not assume the user wants the example values unless they explicitly confirm them.
安全使用建议
Install only if you are comfortable with this skill handling a Dataify API token. Prefer a session-only environment variable or a secrets manager, avoid running the helper in CI/shared terminals/logged sessions, and rotate the Dataify token if it appears in terminal history, logs, screenshots, or shared curl output. Also verify the missing referenced parameter catalog before relying on the skill.
能力评估
Purpose & Capability
The stated purpose is coherent: build Dataify Glassdoor scraper builder curl requests. However, the artifact references references/tool-params.json and a PowerShell helper that are not included, so the workflow may not work as documented.
Instruction Scope
The skill asks the agent/user to use DATAIFY_API_TOKEN and the Python helper prints a curl command containing the resolved bearer token instead of an environment-variable placeholder, which can leak the token through terminal output or logs.
Install Mechanism
No package installer, dependency install, background service, or automatic startup mechanism is present in the artifact files reviewed.
Credentials
Using a Dataify API token is expected for an authenticated Dataify request builder, but the documented default favors persistent shell or user environment storage without corresponding handling warnings.
Persistence & Privilege
The skill recommends permanent token storage in shell profile files or user environment variables. This is user-directed rather than hidden, but it is long-lived credential persistence without enough caution about local exposure, dotfile sync, logs, or rotation.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install dataify-glassdoor-company-by-url
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /dataify-glassdoor-company-by-url 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.0
Initial release supporting the Dataify builder workflow for Glassdoor scraper tools: - Lets users select one Glassdoor scraping tool from a Chinese-language list and input/choose required parameters. - Reads full tool parameter definitions from references/tool-params.json. - Builds aligned JSON parameter arrays for multi-value tools, supporting user input and selectable options. - Produces ready-to-run curl requests for scraperapi.dataify.com/builder with all required arguments and the API token. - Includes detailed setup instructions for DATAIFY_API_TOKEN environment variable on all platforms.
元数据
Slug dataify-glassdoor-company-by-url
版本 1.0.0
许可证 MIT-0
累计安装 0
当前安装数 0
历史版本数 1
常见问题

Dataify Glassdoor Company By Url 是什么?

Prepare Dataify builder requests for the glassdoor.com scraper family rooted at glassdoor_company_by-url. Use when needs to work with the successful Dataify... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 45 次。

如何安装 Dataify Glassdoor Company By Url?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install dataify-glassdoor-company-by-url」即可一键安装,无需额外配置。

Dataify Glassdoor Company By Url 是免费的吗?

是的,Dataify Glassdoor Company By Url 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

Dataify Glassdoor Company By Url 支持哪些平台?

Dataify Glassdoor Company By Url 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 Dataify Glassdoor Company By Url?

由 dataify-server(@dataify-server)开发并维护,当前版本 v1.0.0。

💬 留言讨论