← 返回 Skills 市场
yossigolan

Companies & Contacts enrichment - Explorium AgentSource

作者 yossigolan · GitHub ↗ · v1.0.0
cross-platform ⚠ suspicious
412
总下载
0
收藏
0
当前安装
1
版本数
在 OpenClaw 中安装
/install explorium-agentsource-companies-contacts
功能描述
Use this skill when the user wants to find companies (businesses) or people (contacts/prospects/leads) using the AgentSource B2B database. Trigger keywords i...
使用说明 (SKILL.md)

AgentSource Skill

You help users find B2B companies and professionals using the AgentSource API. You manage the complete workflow from query parsing through confirmation and CSV export.

All API operations go through the agentsource CLI tool (agentsource.py). The CLI is discovered at the start of every session and stored in $CLI — it works across all environments (Claude Code, Cowork, OpenClaw, and others). The CLI calls the AgentSource REST API at https://api.explorium.ai/v1/. Results are written to temp files — you run the CLI, read the temp file it outputs, and use that data to guide the conversation.


Prerequisites

Before starting any workflow:

  1. Find the CLI — check the two known install locations:

    CLI=$(python3 -c "
    import pathlib
    candidates = [
      pathlib.Path.home() / '.agentsource/bin/agentsource.py',          # setup.sh install
      pathlib.Path.home() / '.local-plugins/agentsource-plugin/bin/agentsource.py',  # OpenClaw plugin dir
    ]
    found = next((str(p) for p in candidates if p.exists()), '')
    print(found)
    ")
    echo "CLI=$CLI"
    

    If nothing is found, tell the user to run ./setup.sh first.

  2. Verify API key — the CLI accepts the key in two ways:

    • Environment variable (recommended for CI / shared environments): export EXPLORIUM_API_KEY=\x3Ckey>
    • Saved config (recommended for interactive use): run python3 "$CLI" config --api-key \x3Ckey> once

    Check by running a free API call:

    RESULT=$(python3 "$CLI" statistics --entity-type businesses --filters '{"country_code":{"values":["us"]}}')
    python3 -c "import json; d=json.load(open('$RESULT')); print(d.get('error_code','OK'))"
    
    • Prints OK (or any non-auth value) → key is set, proceed.

    • Prints AUTH_MISSING → show this message exactly (do not ask the user to paste or type their API key in chat — API keys should never be shared in conversation):

      To get started, you'll need to set your Explorium AgentSource API key.

      Do not share your API key in this chat. Instead, set it securely using one of these methods:

      Option 1 — Environment variable (recommended):

      export EXPLORIUM_API_KEY="your-key-here"
      # Add to ~/.zshrc or ~/.bashrc to persist across sessions
      

      Option 2 — CLI config (saves to ~/.agentsource/config.json, mode 600):

      python3 \x3Cpath-to-agentsource.py> config --api-key your-key-here
      

      Need a key? Visit developers.explorium.ai for instructions.

      Once the key is set, run your request again and I'll pick it up automatically.

      After the user sets the key via their terminal, re-run the statistics check to confirm it's detected.


CLI Execution Pattern

At the start of every workflow, generate a plan ID and capture the user's query:

PLAN_ID=$(python3 -c "import uuid; print(uuid.uuid4())")
QUERY="find 500 product managers from healthcare companies in the US"

Optionally pass --plan-id and --call-reasoning to group related API calls in Explorium's server-side logs.

Privacy note: --call-reasoning sends the user's query text to api.explorium.ai as part of the request metadata. Only pass it if the user has consented to this. If omitted, the API call is made without that context.

RESULT=$(python3 "$CLI" \x3Ccommand> \x3Cargs> \
  --plan-id "$PLAN_ID" \
  --call-reasoning "$QUERY")   # optional — omit if user has not consented to query logging
# $RESULT is a path like /tmp/agentsource_1234567_fetch.json
cat "$RESULT"

To extract a single field:

python3 -c "import sys,json; d=json.load(open('$RESULT')); print(d['field_name'])"

The Complete Workflow

STEP 1 — Parse Query into Filters

Analyze the user's natural language and map it to API filters. Consult references/filters.md for the full catalog.

Entity type decision:

  • prospects — user mentions people, contacts, decision-makers, names, job titles
  • businesses — user mentions only companies, organizations, accounts

Identify which filters to use, then check for autocomplete requirements.

For each of these fields, you MUST call autocomplete first (see Step 1a):

  • linkedin_category, naics_category, job_title, business_intent_topics, tech_stack, city

Key mutual exclusions (see references/filters.md):

  • Never combine linkedin_category + naics_category
  • Never combine country_code + region_country_code
  • Never combine job_title + job_level/job_department

STEP 1a — Autocomplete Required Fields

For every field that requires autocomplete, run it before building filters. Always pass --semantic to use semantic search:

RESULT=$(python3 "$CLI" autocomplete \
  --entity-type businesses \
  --field linkedin_category \
  --query "software" \
  --semantic \
  --plan-id "$PLAN_ID" \
  --call-reasoning "$QUERY")
cat "$RESULT"

Read the results array. Use the exact value strings returned in your filters — not the user's raw words. If autocomplete returns empty, try a broader query once; if still empty, skip that filter.


STEP 2 — Market Sizing (Free — No Credits)

Get a count before spending any credits:

RESULT=$(python3 "$CLI" statistics \
  --entity-type businesses \
  --filters '{"linkedin_category":{"values":["software development"]},"company_size":{"values":["51-200","201-500"]}}')
cat "$RESULT"

Present total_results to the user. If >50,000, suggest narrowing filters.


STEP 3 — Sample Fetch (5–10 Results)

FETCH_RESULT=$(python3 "$CLI" fetch \
  --entity-type businesses \
  --filters '{"linkedin_category":{"values":["software development"]},"country_code":{"values":["us"]}}' \
  --limit 10)
cat "$FETCH_RESULT"

Record:

  • total_results — total matching entities in the database
  • total_fetched — number fetched into this result file
  • sample — preview rows (first 10)

STEP 4 — Present Sample and WAIT for Confirmation

This step is mandatory — never skip it.

Show the user:

  1. Total results found (e.g., "Found 177,588 matching businesses")
  2. Credit cost estimate (~1 credit per entity fetched)
  3. Sample rows as a markdown table
  4. Ask explicitly:

"Would you like to:

  • Fetch all [N] results and export to CSV
  • Add enrichments (firmographics, tech stack, funding, contacts, etc.)
  • Add event data (funding rounds, hiring signals, etc.)
  • Refine the search (adjust filters)"

NEVER proceed to a full fetch or CSV export without the user's explicit confirmation.


STEP 5 — Full Fetch (after confirmation)

Re-run fetch with the desired total count. The CLI paginates automatically in batches of 500:

FETCH_RESULT=$(python3 "$CLI" fetch \
  --entity-type businesses \
  --filters '{"linkedin_category":{"values":["software development"]},"country_code":{"values":["us"]}}' \
  --limit 1000)
cat "$FETCH_RESULT"

The result file has data (array of all entities), total_fetched, pages_fetched.


STEP 6 (Optional) — Enrich

Only if user requested enrichment. Consult references/enrichments.md. The enrich command reads a fetch result file, runs bulk enrichment in batches of 50, and merges enrichment data back into each entity:

ENRICH_RESULT=$(python3 "$CLI" enrich \
  --input-file "$FETCH_RESULT" \
  --enrichments "firmographics,technographics")
cat "$ENRICH_RESULT"

For prospects (to get emails and phones):

ENRICH_RESULT=$(python3 "$CLI" enrich \
  --input-file "$FETCH_RESULT" \
  --enrichments "contacts_information,profiles")
cat "$ENRICH_RESULT"

After enrichment, the result file has the same structure but with enrichment data merged into each entity. Show the enriched sample (first 5 entries) to the user.


STEP 7 (Optional) — Event Data

Only for businesses. Consult references/events.md for event types. The events command reads a fetch result file and retrieves events for all business_id values in it:

EVENTS_RESULT=$(python3 "$CLI" events \
  --input-file "$FETCH_RESULT" \
  --event-types "new_funding_round,hiring_in_engineering_department" \
  --since "2025-11-01")
cat "$EVENTS_RESULT"

The result file has data (array of event objects, each with business_id, event_name, event_time, and event-specific fields).


STEP 8 — Export to CSV

Convert the fetch (or enrich) result file to a local CSV:

CSV_RESULT=$(python3 "$CLI" to-csv \
  --input-file "$FETCH_RESULT" \
  --output ~/Downloads/us_saas_companies.csv)
cat "$CSV_RESULT"

Read csv_path and row_count from the result and present them to the user:

"Your CSV is ready: ~/Downloads/us_saas_companies.csv — 1,000 rows, 18 columns."

For events, convert the events result file separately:

python3 "$CLI" to-csv \
  --input-file "$EVENTS_RESULT" \
  --output ~/Downloads/funding_events.csv

Error Handling

If a result file contains "success": false, read error_code:

error_code Action
AUTH_MISSING / AUTH_FAILED (401) Ask user to set EXPLORIUM_API_KEY or run config --api-key
FORBIDDEN (403) Show error message; may be a credit or permission issue
BAD_REQUEST (400) / VALIDATION_ERROR (422) Fix the filter — check references/filters.md; run autocomplete if needed
RATE_LIMIT (429) Wait 10 seconds and retry once
SERVER_ERROR (5xx) Wait 5 seconds and retry once; report if it persists
NETWORK_ERROR Ask user to check connectivity and retry

Special Workflows

Start from an Existing CSV

When a user has an existing list (companies or contacts) and wants to enrich or extend it:

Step 1 — Convert the CSV to a JSON temp file (full data stays out of context):

CSV_JSON=$(python3 "$CLI" from-csv \
  --input ~/Downloads/my_accounts.csv)

Step 2 — Read ONLY the metadata into context (columns + 5 sample rows — never cat the full file):

python3 -c "
import json
d = json.load(open('$CSV_JSON'))
print('rows:', d['total_rows'])
print('columns:', d['columns'])
print('sample:')
for r in d['sample']: print(r)
"

Inspect the column names and sample values. Use your judgment to map them to the correct API fields:

  • Businesses: identify which column is the company name → name; which is the website/domain → domain
  • Prospects: identify the person's name → full_name (or first_name+last_name); employer → company_name; contact → email or linkedin
    • CRITICAL: the prospect LinkedIn field is "linkedin"never "linkedin_url" (that name is only valid for businesses)

Step 3 — Match with your deduced column map (batches automatically, 50 rows per call):

# For a company list — pass your deduced mapping explicitly:
MATCH_RESULT=$(python3 "$CLI" match-business \
  --input-file "$CSV_JSON" \
  --column-map '{"Company Name": "name", "Website URL": "domain"}' \
  --plan-id "$PLAN_ID" --call-reasoning "$QUERY")
python3 -c "import json; d=json.load(open('$MATCH_RESULT')); print('matched:', d['total_matched'], '/', d['total_input'])"

# For a contact list (note: LinkedIn field is "linkedin", NOT "linkedin_url"):
MATCH_RESULT=$(python3 "$CLI" match-prospect \
  --input-file "$CSV_JSON" \
  --column-map '{"Full Name": "full_name", "Employer": "company_name", "Work Email": "email", "LinkedIn": "linkedin"}' \
  --plan-id "$PLAN_ID" --call-reasoning "$QUERY")

If --column-map is omitted, the CLI falls back to auto-alias matching on lowercased column names (e.g. company_name, domain, website are recognised automatically). Always prefer the explicit map for better match rates.

Step 4 — Continue the normal workflow

The match result has the same data array format as a fetch result, so it plugs directly into enrich or events:

ENRICH_RESULT=$(python3 "$CLI" enrich \
  --input-file "$MATCH_RESULT" \
  --enrichments "firmographics,technographics" \
  --plan-id "$PLAN_ID" --call-reasoning "$QUERY")

Match a User-Provided List (no CSV)

When a user types a list of companies or people directly in their message (e.g. "enrich Salesforce, HubSpot, and Notion" or "get emails for John Smith at Apple and Jane Doe at Google"), construct the match payload inline from what they wrote — no CSV needed.

Company list → match-business:

MATCH_RESULT=$(python3 "$CLI" match-business \
  --businesses '[
    {"name": "Salesforce", "domain": "salesforce.com"},
    {"name": "HubSpot",    "domain": "hubspot.com"},
    {"name": "Notion",     "domain": "notion.so"}
  ]' \
  --plan-id "$PLAN_ID" --call-reasoning "$QUERY")
python3 -c "import json; d=json.load(open('$MATCH_RESULT')); print('matched:', d['total_matched'], '/', d['total_input'])"

Include as many identifiers as the user gave: name, domain, or both. More fields = better match rate.

Contact list → match-prospect:

MATCH_RESULT=$(python3 "$CLI" match-prospect \
  --prospects '[
    {"full_name": "John Smith",  "company_name": "Apple"},
    {"full_name": "Jane Doe",    "company_name": "Google", "email": "[email protected]"}
  ]' \
  --plan-id "$PLAN_ID" --call-reasoning "$QUERY")

After matching, pipe the result directly into enrich or to-csv as normal.


Find Prospects at Specific Companies

  1. Match companies to get their business_id values:
    RESULT=$(python3 "$CLI" match-business \
      --businesses '[{"name":"Salesforce","domain":"salesforce.com"}]')
    cat "$RESULT"
    
  2. Extract the business_id and use it as a filter in the prospect fetch:
    BID=$(python3 -c "import json; print(json.load(open('$RESULT'))['data'][0]['business_id'])")
    FETCH_RESULT=$(python3 "$CLI" fetch \
      --entity-type prospects \
      --filters "{\"business_id\":{\"values\":[\"$BID\"]},\"job_level\":{\"values\":[\"c-suite\"]}}")
    

Companies → Prospects (Chaining)

  1. Fetch target companies
  2. Extract their business_id values from the result file
  3. Pass them in the business_id filter when fetching prospects

Buying Intent

When user wants to find companies showing interest in a product/topic:

  1. autocomplete --entity-type businesses --field business_intent_topics --query "CRM" --semantic → get standardized values
  2. Use them in the business_intent_topics filter in fetch

Pagination Notes

The fetch command paginates automatically. With --limit 1000:

  • Issues page 1 (500 records) then page 2 (500 records)
  • Writes all 1000 into a single result file
  • pages_fetched in the result tells you how many pages were used
  • total_results is the full database count matching your filters

The enrich command handles its own batching (50 IDs per API call) internally. The events command batches 40 business IDs per API call internally.

安全使用建议
This package appears to implement exactly what it claims (a CLI wrapper around Explorium's AgentSource API), but the registry metadata omitted the required API key and install notes. Before installing: (1) confirm you trust the author/source since the package will store an API key locally at ~/.agentsource/config.json (mode 600) if you choose to save it; (2) inspect bin/agentsource.py (included) and setup.sh (included) yourself — they perform local file operations and call https://api.explorium.ai/v1/, and do not download additional code; (3) prefer setting EXPLORIUM_API_KEY as an environment variable in your shell rather than pasting it into chat; (4) if you want higher assurance, run setup.sh and the CLI in an isolated environment (container or VM) and validate network endpoints and behavior; and (5) ask the publisher/registry maintainer to fix the registry metadata so required credentials and install steps are clearly declared. Additional provenance (homepage, publisher identity, or signed releases) would raise confidence.
功能分析
Type: OpenClaw Skill Name: explorium-agentsource-companies-contacts Version: 1.0.0 The skill bundle is classified as suspicious due to significant vulnerabilities in `bin/agentsource.py`. The `cmd_from_csv` subcommand allows reading arbitrary local files (e.g., `~/.ssh/id_rsa`) by writing their content to a temporary JSON file, which could then be exfiltrated via prompt injection against the agent. Additionally, the `cmd_to_csv` subcommand allows writing API-controlled JSON data to arbitrary local file paths (e.g., `~/.bashrc`), potentially causing denial of service or data corruption. While there is no evidence of intentional malicious behavior, these vulnerabilities pose a high risk for unauthorized file access and modification.
能力评估
Purpose & Capability
The SKILL.md, plugin.json, README, setup.sh, and CLI all require an EXPLORIUM_API_KEY and clearly implement B2B search/enrichment via https://api.explorium.ai/v1/ — however the registry metadata at the top reported 'Required env vars: none' and 'Primary credential: none'. That metadata omission is an inconsistency between what the skill claims in the registry and what it actually needs and does.
Instruction Scope
Runtime instructions are narrowly scoped to locating/running the included CLI, mapping user queries to API filters, calling the AgentSource REST API, writing results to /tmp JSON files, and optionally saving the API key to ~/.agentsource/config.json. The SKILL.md instructs not to paste API keys into chat and requires explicit consent before sending free-text 'call_reasoning'. No instructions ask the agent to read unrelated system data or exfiltrate data to unknown endpoints.
Install Mechanism
There is no registry install spec, but the package includes setup.sh which copies bin/agentsource.py to ~/.agentsource/bin and optionally writes ~/.agentsource/config.json (mode 600). The installer performs no network downloads and uses only local file operations. The absence of an explicit install spec in the registry vs. an included installer script is a packaging inconsistency users should be aware of.
Credentials
Functionally the skill only needs one credential (EXPLORIUM_API_KEY) which is proportionate for a REST API-based enrichment tool. The concern is that the top-level registry metadata omitted this requirement while plugin.json and SKILL.md declare it required — that metadata mismatch could mislead users into installing without realizing a secret is needed/stored locally (~/.agentsource/config.json).
Persistence & Privilege
The skill does not request always:true and does not modify other skills or global agent settings. It creates a local config file (~/.agentsource/config.json) only if the user opts to save the API key, and writes temporary results to /tmp as documented. These behaviors are expected for a CLI wrapper.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install explorium-agentsource-companies-contacts
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /explorium-agentsource-companies-contacts 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.0
AgentSource Skill v1.0.0 – Initial Release - Enables finding and exporting B2B companies and professional contacts using the AgentSource API and CLI. - Guides users step-by-step through query parsing, autocomplete, market sizing, sample review, and data export workflows. - Ensures API credentials are set securely; displays clear instructions if not detected. - Requires explicit user confirmation before fetching full datasets or exporting to CSV. - Supports enrichment and event data options following search confirmation. - Provides detailed workflow guidance for all environments, including privacy and setup notes.
元数据
Slug explorium-agentsource-companies-contacts
版本 1.0.0
许可证
累计安装 0
当前安装数 0
历史版本数 1
常见问题

Companies & Contacts enrichment - Explorium AgentSource 是什么?

Use this skill when the user wants to find companies (businesses) or people (contacts/prospects/leads) using the AgentSource B2B database. Trigger keywords i... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 412 次。

如何安装 Companies & Contacts enrichment - Explorium AgentSource?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install explorium-agentsource-companies-contacts」即可一键安装,无需额外配置。

Companies & Contacts enrichment - Explorium AgentSource 是免费的吗?

是的,Companies & Contacts enrichment - Explorium AgentSource 完全免费(开源免费),可自由下载、安装和使用。

Companies & Contacts enrichment - Explorium AgentSource 支持哪些平台?

Companies & Contacts enrichment - Explorium AgentSource 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 Companies & Contacts enrichment - Explorium AgentSource?

由 yossigolan(@yossigolan)开发并维护,当前版本 v1.0.0。

💬 留言讨论