← Back to Skills Marketplace
xxxxxxxxxxxxxxxxxxx20gex

ArXiv Scholar Search

by lyq · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ✓ Security Clean
947
Downloads
0
Stars
5
Active Installs
1
Versions
Install in OpenClaw
/install arxiv-scholar-search
Description
Use the arXiv API for academic paper discovery, relevance screening, and structured output. Suitable for topic-based search, latest-paper discovery, fixed-te...
README (SKILL.md)

Scholar Search Workflow

This skill only uses arXiv as the data source. The workflow is fixed:

  1. Read user input and parse the search intent.
  2. Call the arXiv API (via arxiv_search).
  3. Evaluate result quality and decide whether another search round is needed.
  4. Output results using the required template.

0) File Layout (Overview)

File Path Purpose When to Check
SKILL.md Main entry: task parsing, query syntax, calling conventions, quality control Check first at the beginning of every task

1) Parse User Search Requirements

Extract the following constraints first:

  1. Topic keywords (Chinese or English).
  2. Time preference (for example: "latest", "this year", "last three years").
  3. Number of results (for example: 5, 10, 20).
  4. Output goal (paper discovery, comparison, or brief survey).
  5. Output language (Chinese or English).

Keyword strategy:

  1. Use a broad query in round one (topic terms only).
  2. Add constraint terms in round two (method terms, task terms, category terms).
  3. If results are too few, switch to synonyms or broader terms.

2) arXiv API Guide (Built-in)

2.1 API Endpoint

  1. Base endpoint: https://export.arxiv.org/api/query
  2. Request methods: GET (common), POST (optional when parameters are very long)

2.2 Core Parameters

  1. search_query: query expression
  2. id_list: comma-separated list of arXiv IDs
  3. start: pagination offset (0-based)
  4. max_results: number of returned entries
  5. sortBy: relevance / lastUpdatedDate / submittedDate
  6. sortOrder: ascending / descending

2.3 search_query Syntax

Common field prefixes:

  1. ti (title)
  2. au (author)
  3. abs (abstract)
  4. cat (category)
  5. all (all fields)

Boolean operators:

  1. AND
  2. OR
  3. ANDNOT

Time syntax (submittedDate):

  1. Range: submittedDate:[200701*+TO+200712*]
  2. Specific day: submittedDate:20101225

2.4 Direct URL Examples

  1. Topic query: https://export.arxiv.org/api/query?search_query=all:electron
  2. Combined query: https://export.arxiv.org/api/query?search_query=au:del_maestro+AND+ti:checkerboard
  3. Paginated query: https://export.arxiv.org/api/query?search_query=all:electron&start=0&max_results=10
  4. Sorted query: https://export.arxiv.org/api/query?search_query=ti:%22electron+thermal+conductivity%22&sortBy=lastUpdatedDate&sortOrder=descending

2.5 Rate Limits and Stability (Must Follow)

  1. Keep request frequency at no more than one request every 3 seconds.
  2. Use only one active connection at a time.
  3. Avoid high-frequency repeated queries; reuse existing results whenever possible.

3) Calling Conventions (Must Match Actual Use)

Standard call example:

curl -s "https://export.arxiv.org/api/query?search_query=all:multimodal+AND+cat:cs.CL&start=0&max_results=10&sortBy=submittedDate&sortOrder=descending"

Parameter notes:

  1. search_query: arXiv query expression (supports field prefixes and boolean operators).
  2. start: pagination offset (0-based).
  3. max_results: number of returned entries; recommend 5-20.
  4. sortBy: recommend submittedDate or lastUpdatedDate.
  5. sortOrder: recommend descending.

4) Decide Whether to Continue Searching

Run one more search round if any condition below is met:

  1. Too few papers are returned (for example, \x3C 3).
  2. Results are clearly off-topic.
  3. Key fields are frequently missing (title, link, abstract).

Stop searching when:

  1. Result count reaches the target or an acceptable range.
  2. Topic relevance is high.
  3. Additional search rounds provide low value.

5) Output Requirements

Each paper must use the following structure:

-----------
# {Index}. **{Paper Title}**

**Paper Info**: **Venue/Source**: {journal, conference, or source} | **Publication Date**: {yyyy-mm-dd or unknown} | **Source**: [{source name}]({entry link or paper link}) | **PDF**: [PDF link]({pdf link})

### Research Content
{1-2 objective sentences based on the paper content}

### Main Contributions
- {Contribution 1}
- {Contribution 2}
- {Contribution 3}

Must follow:

  1. Add a standalone line ----------- before every paper title.
  2. Keep the title as a standalone level-1 header line.
  3. Keep "Paper Info" in a single line, separated by |.
  4. Keep the PDF field:
    • If a PDF exists: provide a direct link.
    • If no PDF exists: remove the PDF field.
  5. Content must be based only on source metadata, abstract, or TLDR. No speculation.
  6. Do not add conclusions not explicitly supported by the source.
  7. All links must come from tool output. Do not fabricate or guess links.
  8. Do not fabricate venue, date, or citation count.
  9. Write "Research Content" and "Main Contributions" only from returned fields.

6) Failure Handling

  1. API failure: state the reason and the strategies already attempted.
  2. No results: provide actionable keyword rewrite suggestions.
  3. Missing fields: explicitly mark as "unknown/missing"; do not fill with inferred data.
Usage Guidance
This skill is coherent and focused: it only documents how to query the public arXiv API and how to format results. Before installing, confirm your agent runtime will (1) enforce the stated rate limit (≤1 request per 3 seconds and one active connection) to avoid API blocks, (2) not augment the skill with unrelated network or file access, and (3) actually source links from the API responses rather than fabricating them. Because the skill origin is unknown, consider running it in a restricted environment or reviewing a sample run to ensure the agent's network calls and outputs match expectations.
Capability Analysis
Type: OpenClaw Skill Name: arxiv-scholar-search Version: 1.0.0 The arxiv-scholar-search skill is a legitimate tool designed to search the official arXiv API (export.arxiv.org) for academic papers. The instructions in SKILL.md provide a clear, task-oriented workflow for query construction, rate limiting, and structured reporting without any signs of data exfiltration, malicious execution, or prompt injection attacks.
Capability Assessment
Purpose & Capability
Name and description match the SKILL.md. All required actions and parameters relate directly to querying the arXiv API and formatting results for review.
Instruction Scope
The SKILL.md limits data sources to arXiv, specifies exact query construction, rate limits, and strict rules against fabricating links or metadata. It does reference a generic 'arxiv_search' call/tool; if your agent/platform lacks that helper it will need to perform standard HTTPS GETs (curl/fetch) consistent with the guide. No instructions ask the agent to read unrelated files, env vars, or to exfiltrate data.
Install Mechanism
This is an instruction-only skill with no install spec and no code to write to disk, which minimizes install-time risk.
Credentials
No environment variables, credentials, or config paths are requested. The skill's needs are proportional to an arXiv search workflow.
Persistence & Privilege
always is false and there is no mechanism to persist or elevate privileges. The skill does not request modification of other skills or system-wide settings.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install arxiv-scholar-search
  3. After installation, invoke the skill by name or use /arxiv-scholar-search
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
Initial release of arxiv-scholar-search for structured academic paper discovery using the arXiv API. - Enables topic-based paper search, latest-paper discovery, and citation workflows based solely on arXiv data. - Implements a fixed workflow: intent parsing, API querying, automatic quality control, and templated output. - Provides detailed arXiv API syntax guide, including keywords, time, and sorting filters. - Enforces strict output structure with clear separation for title, info, research content, and contributions. - Includes handling for API failures, missing fields, and provides keyword rewriting suggestions for no-result cases. - Ensures all content and metadata are strictly based on retrieved source information, without speculation.
Metadata
Slug arxiv-scholar-search
Version 1.0.0
License MIT-0
All-time Installs 5
Active Installs 5
Total Versions 1
Frequently Asked Questions

What is ArXiv Scholar Search?

Use the arXiv API for academic paper discovery, relevance screening, and structured output. Suitable for topic-based search, latest-paper discovery, fixed-te... It is an AI Agent Skill for Claude Code / OpenClaw, with 947 downloads so far.

How do I install ArXiv Scholar Search?

Run "/install arxiv-scholar-search" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is ArXiv Scholar Search free?

Yes, ArXiv Scholar Search is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does ArXiv Scholar Search support?

ArXiv Scholar Search is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created ArXiv Scholar Search?

It is built and maintained by lyq (@xxxxxxxxxxxxxxxxxxx20gex); the current version is v1.0.0.

💬 Comments