← Back to Skills Marketplace

arXiv Source Reader

Name: arXiv Source Reader
Author: willamhou

by Will.hou · GitHub ↗ · v1.0.5 · MIT-0

cross-platform ✓ Security Clean

580

Downloads

Stars

Active Installs

Versions

Install in OpenClaw

/install arxiv-source

Description

Read and analyze arXiv papers by fetching LaTeX source, listing sections, or extracting abstracts

README (SKILL.md)

arxiv-reader

Read and analyze arXiv papers by fetching their public LaTeX source. Converts LaTeX into clean text suitable for LLM analysis.

Description

This skill fetches arXiv papers from the public arXiv API (arxiv.org), flattens LaTeX includes, and returns clean text. No local file access is required — all content is fetched over HTTPS from arXiv's public endpoints and cached in memory for the session.

Network access: Only connects to arxiv.org and export.arxiv.org to download publicly available paper source tarballs and metadata. No other network connections are made. No data is sent to external services — this is read-only.

Caching: Results are cached in memory (process-scoped) for fast repeat access within the same session. No files are written to disk.

Usage Examples

"Read the paper 2301.00001 from arXiv"
"What sections does paper 2405.12345 have?"
"Get the abstract of 2312.09876"
"Fetch paper 2301.00001 without the appendix"

Process

Quick look — Use arxiv_abstract to get a paper's abstract before committing to a full read
Survey structure — Use arxiv_sections to understand the paper's outline
Deep read — Use arxiv_fetch to get the full flattened LaTeX for analysis

Tools

arxiv_fetch

Fetch the full flattened LaTeX source of an arXiv paper.

Parameters:

arxiv_id (string, required): arXiv paper ID (e.g. 2301.00001 or 2301.00001v2)
remove_comments (boolean, optional): Strip LaTeX comments (default: true)
remove_appendix (boolean, optional): Remove appendix sections (default: false)
figure_paths (boolean, optional): Replace figures with file paths only (default: false)

Returns: { content: string, arxiv_id: string, cached: boolean }

Example:

{ "arxiv_id": "2301.00001", "remove_appendix": true }

arxiv_sections

List all sections and subsections of an arXiv paper.

Parameters:

arxiv_id (string, required): arXiv paper ID

Returns: { arxiv_id: string, sections: string[] }

Example:

{ "arxiv_id": "2301.00001" }

arxiv_abstract

Extract just the abstract from an arXiv paper.

Parameters:

arxiv_id (string, required): arXiv paper ID

Returns: { arxiv_id: string, abstract: string }

Example:

{ "arxiv_id": "2301.00001" }

Notes

Results are cached in memory — repeat requests within the same session are instant
Paper IDs support version suffixes (e.g. 2301.00001v2)
Very large papers may take 10-30 seconds on first fetch
arxiv_abstract uses the public arXiv Atom API for fast metadata retrieval
No filesystem writes — all caching is in-memory only
Only connects to arxiv.org (read-only, public data)

Usage Guidance

This skill is internally consistent: it only declares read access to public arXiv endpoints and requests no credentials or installs. Caveats: (1) the package is instruction-only with no implementation files or publisher metadata — the actual runtime implementation (which will run when the agent invokes the skill) could behave differently, so inspect or obtain the implementation before granting broad trust; (2) LaTeX content can reference external resources (embedded URLs or remote graphics) — confirm that any implementation does not follow or fetch external URLs beyond arxiv.org/export.arxiv.org if you need strict network isolation; (3) if you plan to run this in an environment with sensitive network access, consider running it in a sandbox or requesting the code from the publisher. If you want higher assurance, ask the publisher for the runtime code or an allowlist of exact network actions the skill will perform.

Capability Analysis

Type: OpenClaw Skill Name: arxiv-source Version: 1.0.5 The arxiv-source skill bundle (v1.0.5) is a legitimate utility designed to fetch and process academic papers from arXiv. The documentation in SKILL.md clearly defines its scope, limiting network access to arxiv.org and export.arxiv.org, and explicitly states that it performs no local filesystem writes and maintains an in-memory cache. No evidence of prompt injection, data exfiltration, or malicious instructions was found in the provided files.

Capability Assessment

✓ Purpose & Capability

Name and description match the declared requirements and behavior: fetching public arXiv source tarballs and metadata from arxiv.org/export.arxiv.org. No unrelated environment variables, binaries, or config paths are requested.

✓ Instruction Scope

SKILL.md restricts actions to reading from arxiv.org and export.arxiv.org, performing in-memory caching, and not writing to disk. It does not ask to read user files, other env vars, or contact other hosts in the written instructions.

✓ Install Mechanism

No install spec and no code files are present, which is the lowest-risk model for installs. There is nothing that will be written to disk or downloaded as part of an install step.

✓ Credentials

The skill requests no credentials, no config paths, and no environment variables — appropriate for a read-only fetcher of public arXiv data.

✓ Persistence & Privilege

always is false and the skill does not request permanent presence or system-level changes. Autonomous invocation is allowed by default on the platform but is not combined with other concerning privileges here.

How to Use

Make sure OpenClaw is installed (local or Docker)
Run the install command in chat: /install arxiv-source
After installation, invoke the skill by name or use /arxiv-source
Provide required inputs per the skill's parameter spec and get structured output

Version History

v1.0.5

Declare runtime and network access, clarify read-only public API access, in-memory cache only

v1.0.4

Remove executable code to resolve security flags

v1.0.3

- Initial release as version 1.0.3 - Added core implementation files: index.ts and manifest.json - Provides tools to fetch arXiv LaTeX papers, extract abstracts, and list sections - Supports both standalone and container execution modes - Results are locally cached for faster subsequent access

v1.0.2

- Removed source files: index.ts and manifest.json - Skill implementation has been deleted; documentation remains unchanged - Functionality is effectively disabled in this version

v1.0.1

Add OpenClaw metadata (emoji, tags, platform requirements)

v1.0.0

Initial release: fetch LaTeX source, list sections, extract abstracts from arXiv papers. Zero-dependency standalone mode.

Metadata

Slug arxiv-source

Version 1.0.5

License MIT-0

All-time Installs 5

Active Installs 5

Total Versions 6

Frequently Asked Questions

What is arXiv Source Reader?

Read and analyze arXiv papers by fetching LaTeX source, listing sections, or extracting abstracts. It is an AI Agent Skill for Claude Code / OpenClaw, with 580 downloads so far.

How do I install arXiv Source Reader?

Run "/install arxiv-source" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is arXiv Source Reader free?

Yes, arXiv Source Reader is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does arXiv Source Reader support?

arXiv Source Reader is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created arXiv Source Reader?

It is built and maintained by Will.hou (@willamhou); the current version is v1.0.5.

More Skills

arXiv Source Reader

arxiv-reader

Description

Usage Examples

Process

Tools

arxiv_fetch

arxiv_sections

arxiv_abstract

Notes

What is arXiv Source Reader?

How do I install arXiv Source Reader?

Is arXiv Source Reader free?

Which platforms does arXiv Source Reader support?

Who created arXiv Source Reader?

💬 Comments