Dataify Github Repository By Repo Url
/install dataify-github-repository-by-repo-url
Dataify Builder Skill
Use this skill to prepare Dataify builder requests for the scraper family rooted at github_repository_by-repo-url on github.com.
Workflow
- Check whether
DATAIFY_API_TOKENexists in the environment. - If the token is missing, stop and tell the user to sign in at Dataify Dashboard](https://dataify.com/dashboard) to obtain it.
- Ask the user to choose exactly one tool from the following Chinese list:
- 通过仓库URL采集 (github_repository_by-repo-url)
- 通过搜索URL采集 (github_repository_by-search-url)
- 通过URL采集 (github_repository_by-url)
- Read
references/tool-params.jsonand find the chosen tool bytool_signor Chinese tool name. - For each parameter in the chosen tool:
- If
input_modeisuser_input, ask the user for the value. - If
input_modeisselect, present the saved options to the user.
- If
- Use
scripts/build-dataify-request.pyas the default cross-platform helper. - Use
scripts/build-dataify-request.ps1as the Windows PowerShell helper when needed. - When a selectable parameter has a human-readable Chinese label, keep that label in
spider_parameters. Do not replace it with a code such asHKunless the user explicitly asks for the coded value. - Build
spider_parametersas a JSON array. - If every parameter has only one final value, build one object such as
[{"searchurl":"...","country":"Hong Kong"}]. - If one or more parameters have multiple aligned values, zip them by index and build one object per row. Example:
[{"search_url":"url1","page_turning":"1","max_num":"15"},{"search_url":"url2","page_turning":"1","max_num":"15"}]. - If a parameter has one value while another parameter has multiple values, reuse the single value across every generated row.
- Set
spider_nametogithub.com. - Set
spider_idto the selected tool'stool_sign. - Always include
spider_errors=trueandfile_name={{TasksID}}. - Return a curl command for
https://scraperapi.dataify.com/builder.
Set DATAIFY_API_TOKEN
Prefer a permanent environment-variable setup instead of setting the token only for the current terminal session.
Windows PowerShell, permanent for the current user:
[Environment]::SetEnvironmentVariable("DATAIFY_API_TOKEN", "your_token_here", "User")
Then reopen PowerShell. If the current session also needs the token immediately, run:
$env:DATAIFY_API_TOKEN = "your_token_here"
macOS or Linux, permanent for bash:
echo 'export DATAIFY_API_TOKEN="your_token_here"' >> ~/.bashrc
source ~/.bashrc
macOS or Linux, permanent for zsh:
echo 'export DATAIFY_API_TOKEN="your_token_here"' >> ~/.zshrc
source ~/.zshrc
Script usage
Python:
python scripts/build-dataify-request.py --tool-sign \x3Cselected_tool_sign> --values-file values.json
PowerShell:
& ".\scripts\build-dataify-request.ps1" -ToolSign "\x3Cselected_tool_sign>" -ValuesFile ".\values.json"
The values.json file should contain either one object or an array of objects. Example:
[{"searchurl":"https://www.airbnb.com/s/Greece/homes?...","country":"Hong Kong"}]
Required output shape
Generate a curl command in this form:
curl -X POST 'https://scraperapi.dataify.com/builder' \
-H "Authorization: Bearer $DATAIFY_API_TOKEN" \
-H 'Content-Type: application/x-www-form-urlencoded' \
-d 'spider_name=github.com' \
-d 'spider_id=\x3Cselected_tool_sign>' \
-d 'spider_parameters=[{"param":"value"}]' \
-d 'spider_errors=true' \
-d 'file_name={{TasksID}}'
Reference usage
references/tool-params.jsonstores the full saved parameter catalog for every available tool in this scraper family.scripts/build-dataify-request.pyis the portable implementation and should be preferred.scripts/build-dataify-request.ps1mirrors the same behavior for Windows users.- If a parameter has no options, the user must provide the value.
- If a parameter has options, present those options back to the user before building the final request.
- Do not assume
spider_parametersalways contains exactly one object. Multi-value tools may require multiple objects zipped by index. - Use the saved
url_exampleonly as a reference example. Do not assume the user wants the example values unless they explicitly confirm them.
- 确保已安装 OpenClaw(本地或 Docker 部署)
- 在对话框中输入安装命令:
/install dataify-github-repository-by-repo-url - 安装完成后,直接呼叫该 Skill 的名称或使用
/dataify-github-repository-by-repo-url触发 - 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
Dataify Github Repository By Repo Url 是什么?
Prepare Dataify builder requests for the github.com scraper family rooted at github_repository_by-repo-url. Use when needs to work with the successful Dataif... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 40 次。
如何安装 Dataify Github Repository By Repo Url?
在 OpenClaw 或 Claude Code 对话框中运行命令「/install dataify-github-repository-by-repo-url」即可一键安装,无需额外配置。
Dataify Github Repository By Repo Url 是免费的吗?
是的,Dataify Github Repository By Repo Url 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。
Dataify Github Repository By Repo Url 支持哪些平台?
Dataify Github Repository By Repo Url 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。
谁开发了 Dataify Github Repository By Repo Url?
由 dataify-server(@dataify-server)开发并维护,当前版本 v1.0.0。