← Back to Skills Marketplace
dataify-server

Dataify Youtube Transcript By Id

by dataify-server · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ✓ Security Clean
51
Downloads
0
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install dataify-youtube-transcript-by-id
Description
Submit Dataify YouTube Transcript by Video ID Builder tasks for the YouTube subtitle file collection tool. Use when the user wants to collect, scrape, crawl,...
README (SKILL.md)

Dataify YouTube Transcript By ID

Submit YouTube subtitle/transcript collection jobs through Dataify Builder by video ID. After a successful submission, give the user the task_id, the returned or inferred status, and tell them to visit https://dataify.com/dashboard/ to view results.

API TOKEN Handling

Use DATAIFY_API_TOKEN as the long-term saved token name.

  • If the user provides a token in the request, use it for this run.
  • If no token is provided, first check whether DATAIFY_API_TOKEN is already saved locally in the environment.
  • If DATAIFY_API_TOKEN is saved locally, use it without asking the user to re-enter the token.
  • If no token is available locally, tell the user they need to provide a Dataify API TOKEN.
  • If the user does not have an API TOKEN, tell them they can register or log in at https://dataify.com/login to get one.
  • If the user already has an API TOKEN, tell them it is available in the top-right area of https://dataify.com/dashboard/.
  • After the user provides an API TOKEN and no local DATAIFY_API_TOKEN is saved, ask whether they want to save it locally as DATAIFY_API_TOKEN for future use.
  • If the user wants to save it, give the appropriate command for their shell and ask them to run it; do not silently persist tokens without confirmation.
  • Do not call the Builder endpoint without a token.
  • Always call it API TOKEN in user-facing instructions. Prefer the environment variable name DATAIFY_API_TOKEN for saved local use.

PowerShell examples for saving the token for the current session:

$env:DATAIFY_API_TOKEN = "YOUR_DATAIFY_API_TOKEN"

For a persistent user-level variable on Windows:

[Environment]::SetEnvironmentVariable("DATAIFY_API_TOKEN", "YOUR_DATAIFY_API_TOKEN", "User")

Core Workflow

  1. Before submitting, show the user the required values, shared values, optional values, and defaults listed in the Parameter Checklist.
  2. For dropdown fields, show all allowed options as Markdown tables with both Label and Value columns. Use scripts/submit_dataify_youtube_transcript_by_id.py --list-options to print the full dropdown tables.
  3. Ask whether the user wants to change any value before running the task.
  4. Ask whether the user wants to collect multiple YouTube transcript groups. If yes, ask for multiple video_id values.
  5. Normalize the final video_id values into a list of spider_parameters objects.
  6. Normalize subtitles_language, subtitles_type, and selected_only into one shared spider_universal object.
  7. Resolve the Dataify token from explicit input or saved DATAIFY_API_TOKEN.
  8. If no token is available, ask the user to enter their API TOKEN and ask whether to save it as DATAIFY_API_TOKEN.
  9. Validate each video ID, dropdown value, and file name.
  10. Submit a Builder request to create the task.
  11. Read data.task_id from the Builder response and read data.status or status when present.
  12. Stop after Builder succeeds.
  13. Tell the user to visit https://dataify.com/dashboard/ to view or manage results.

Parameter Checklist

When the user invokes this skill, first tell them these values are used. Always display submitted parameters as a Markdown table; do not use a plain sentence or bullet list for the parameter confirmation.

Field Required Default Location Notes
video_id Yes 8RePenzQH80 spider_parameters Unique YouTube video ID used to identify the video whose subtitles should be collected.
subtitles_language No ab spider_universal Dropdown-style shared parameter. Subtitle language.
subtitles_type No auto_generated spider_universal Dropdown-style shared parameter. Subtitle type.
selected_only No false spider_universal Dropdown-style shared parameter. Whether to use only selected specifications.
file_name No {{TasksID}} Builder form field Use the default when the user does not change it.

Then ask: "Do you want to change any of these values before I submit the task?"

Also ask: "Do you want to collect multiple YouTube transcript groups? If yes, provide multiple video_id values."

If the user has already provided some values, show those values in place of the defaults and only ask whether the remaining/defaulted values should be changed.

Dropdown Options

Before asking the user to choose dropdown values, show all allowed options as Markdown tables with both Label and Value columns.

Use this command to print the complete tables:

python3 ".\scripts\submit_dataify_youtube_transcript_by_id.py" --list-options

If python3 is not available, use the local Python 3 command for that machine, such as python.

The script prints:

  • subtitles_language options, using the supplied cn value as Label and typeValue as Value.
  • subtitles_type options.
  • selected_only options.

Parameter Handling

  • video_id is required. If the user does not provide it, use the default 8RePenzQH80 only after showing it in the parameter confirmation table.
  • Trim leading and trailing whitespace from video_id.
  • video_id cannot be empty.
  • Multiple collection groups only repeat video_id inside spider_parameters.
  • subtitles_language, subtitles_type, and selected_only are shared parameters. Send them in spider_universal, not inside each spider_parameters object.
  • subtitles_language defaults to ab.
  • subtitles_type defaults to auto_generated.
  • selected_only defaults to false.
  • file_name defaults to {{TasksID}}. If the user changes it, submit the user-provided value.
  • file_name cannot be empty.

Single-group example:

spider_parameters=[{"video_id":"8RePenzQH80"}]
spider_universal={"subtitles_language":"ab","subtitles_type":"auto_generated","selected_only":"false"}

Multi-group example:

spider_parameters=[{"video_id":"8RePenzQH80"},{"video_id":"dQw4w9WgXcQ"}]
spider_universal={"subtitles_language":"ab","subtitles_type":"auto_generated","selected_only":"false"}

Dataify Builder Request

Use form fields rather than hand-built URL-encoded strings.

  • URL: https://scraperapi.dataify.com/builder?platform=1
  • Method: POST
  • Authorization header: Bearer DATAIFY_API_TOKEN
  • Content type: application/x-www-form-urlencoded
  • Fixed fields:
    • spider_name=youtube.com
    • spider_id=youtube_transcript_by-id
    • spider_errors=true
  • Default field:
    • file_name={{TasksID}}
  • Dynamic fields:
    • spider_parameters must be a JSON string array of video ID objects.
    • spider_universal must be a JSON string object containing shared subtitle settings.

Script

For stable execution, prefer scripts/submit_dataify_youtube_transcript_by_id.py with Python 3.6 or newer instead of rewriting the Builder flow.

python3 ".\scripts\submit_dataify_youtube_transcript_by_id.py" --video-id "8RePenzQH80"

To override the saved environment token or default shared parameters for one run:

python3 ".\scripts\submit_dataify_youtube_transcript_by_id.py" --api-token "YOUR_DATAIFY_API_TOKEN" --video-id "8RePenzQH80" --subtitles-language "ab" --subtitles-type "auto_generated" --selected-only "false" --file-name "{{TasksID}}"

To submit multiple video IDs:

python3 ".\scripts\submit_dataify_youtube_transcript_by_id.py" --params-json '[{"video_id":"8RePenzQH80"},{"video_id":"dQw4w9WgXcQ"}]'

The script prints a JSON summary with task_id, status, parameters, spider_universal, file_name, dashboard_url, and message.

Troubleshooting

Missing Dataify API TOKEN means no explicit token was passed and DATAIFY_API_TOKEN is not saved locally. Tell the user they need to provide their Dataify API TOKEN, ask whether they want to save it as DATAIFY_API_TOKEN, or tell them they can register or log in at https://dataify.com/login to get one. If they already have a token, tell them it is in the top-right area of https://dataify.com/dashboard/.

video_id cannot be empty means the required YouTube video ID is missing.

Unsupported subtitles_language means the value must be one of the allowed subtitle language codes.

Unsupported subtitles_type means the value must be auto_generated or uploader_provided.

Unsupported selected_only means the value must be false or true.

File name cannot be empty means no usable file_name was provided.

Necessary parameters is empty! usually means the Builder request was not submitted as form fields, spider_parameters was not a JSON string array, spider_universal was not a JSON string object, or one spider_parameters object is missing video_id.

Missing task_id usually means the authorization header, token, spider_name, spider_id, spider_parameters, or spider_universal is wrong.

Guardrails

  • Do not put subtitles_language, subtitles_type, or selected_only inside spider_parameters.
  • Do not omit spider_universal.
  • Do not poll for results after Builder succeeds.
  • Use only API TOKEN and DATAIFY_API_TOKEN when referring to authentication.
  • Do not hard-code local Python paths.
  • Do not claim the Builder response contains YouTube transcript files.
  • Do not invent result fields.
  • Always direct the user to https://dataify.com/dashboard/ after successful task creation.
Usage Guidance
Install only if you intend to use Dataify for YouTube transcript jobs. Be aware that it may run for broadly worded YouTube transcript requests, and task submissions will use your Dataify API TOKEN and send the chosen video IDs and subtitle options to Dataify.
Capability Assessment
Purpose & Capability
The stated purpose, documentation, and Python script all match: collect YouTube subtitles/transcripts by video ID through Dataify Builder and return task status metadata.
Instruction Scope
The trigger wording and implicit invocation are broad for YouTube transcript-related requests, but the workflow instructs the agent to show parameters and ask about changes before submitting, and the behavior remains purpose-aligned.
Install Mechanism
Artifacts are limited to a skill file, one helper script, icons, and agent metadata; no package install, dependency install, hidden setup, or self-modifying behavior is present.
Credentials
The script reads only DATAIFY_API_TOKEN from the environment, validates user parameters, and sends the token and selected YouTube video IDs to the documented Dataify endpoint.
Persistence & Privilege
The skill recommends saving DATAIFY_API_TOKEN for future use, but explicitly requires user confirmation and tells the user to run the persistence command themselves; the script does not write credentials or create background persistence.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install dataify-youtube-transcript-by-id
  3. After installation, invoke the skill by name or use /dataify-youtube-transcript-by-id
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
Initial release of the Dataify YouTube Transcript By ID skill. - Allows users to submit YouTube transcript/subtitle collection tasks by video ID to Dataify Builder. - Automatically handles API TOKEN management with clear instructions for saving and usage. - Guides users through required and optional parameters, displaying them in Markdown tables, and prompts for any changes before submitting. - Supports batch submission for multiple video ID groups and normalizes shared parameters. - Validates all input values and provides links for token registration and result management. - Offers localized Chinese phrase triggers and integrates dropdown value selection guidance.
Metadata
Slug dataify-youtube-transcript-by-id
Version 1.0.0
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 1
Frequently Asked Questions

What is Dataify Youtube Transcript By Id?

Submit Dataify YouTube Transcript by Video ID Builder tasks for the YouTube subtitle file collection tool. Use when the user wants to collect, scrape, crawl,... It is an AI Agent Skill for Claude Code / OpenClaw, with 51 downloads so far.

How do I install Dataify Youtube Transcript By Id?

Run "/install dataify-youtube-transcript-by-id" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Dataify Youtube Transcript By Id free?

Yes, Dataify Youtube Transcript By Id is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Dataify Youtube Transcript By Id support?

Dataify Youtube Transcript By Id is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Dataify Youtube Transcript By Id?

It is built and maintained by dataify-server (@dataify-server); the current version is v1.0.0.

💬 Comments