← 返回 Skills 市场

Google Scholar Paper Finder

Name: Google Scholar Paper Finder
Author: figpad

作者 FigPad AI · GitHub ↗ · v1.0.0 · MIT-0

cross-platform ✓ 安全检测通过

总下载

当前安装

版本数

在 OpenClaw 中安装

/install google-scholar-paper-finder

功能描述

Use when the user wants to find more relevant academic papers through real-time Google Scholar retrieval with google-scholar-search-mcp, expand search terms...

使用说明 (SKILL.md)

Google Scholar Paper Finder

Use this Skill to solve two linked problems:

Find a broad but controlled pool of relevant papers through real Google Scholar retrieval using google-scholar-search-mcp, query expansion, seed papers, citations, and related articles.
Keep high-quality papers by scoring venues with local journal/conference quality data and returning a clear table.

Retrieval Contract

Use real retrieval. Do not invent papers, citation counts, authors, venues, DOI values, abstracts, or download links.

Required retrieval order:

Use google-scholar-search-mcp for Google Scholar results.
Retrieve, dedupe, and keep at least 50 Google Scholar candidates whenever Scholar returns enough usable records.
Use cited-by, related-article, seed-author/title, and query-expansion routes inside Google Scholar when the initial results are too narrow.
Use local JSON quality data only after retrieval, never as a discovery source.

Fail closed:

If google-scholar-search-mcp is not available, blocked, rate-limited, or returns no usable data, say so explicitly and stop or ask to install/connect it.
Do not silently replace Google Scholar with PubMed, ordinary Google, generic web search, Crossref, Semantic Scholar, publisher pages, or model memory.
Publisher pages, DOI pages, PDF pages, or repository pages may verify or provide access links, but they must not be presented as Google Scholar search evidence.

Google Scholar does not support true regular expressions. Generate Scholar-compatible search queries with phrases, OR, exclusion terms, and concept combinations. Use regex or keyword patterns only after retrieval to filter titles, snippets, abstracts, and references.

Workflow

1. Clarify The Search Target

Extract or ask for the minimum context needed:

research topic or question;
language preference: English, Chinese, or both;
discipline and methods if known;
desired years if any;
whether to prioritize journal papers, conferences, reviews, empirical studies, or methods;
any seed paper, DOI, title, or author already known.

If the user gives a Chinese topic, generate English academic terms as the default search layer, and keep Chinese terms when Chinese literature is relevant.

2. Generate Search Expansion

Break the topic into 2-4 core concepts:

object/population;
method/technology;
context/domain;
outcome/problem.

For each concept, generate:

exact phrase terms;
synonyms and academic variants;
abbreviations;
narrower and broader terms;
likely negative terms to exclude.

Create multiple Google Scholar MCP queries rather than one giant query:

narrow query for precision;
broad query for recall;
review query;
guideline/consensus query when the discipline uses guidelines;
highly cited/classic query;
method query;
recent-year query;
seed-paper author/title query when available.

See search-workflow.md for query patterns and MCP-specific rules.

3. Retrieve Candidate Papers

Use google-scholar-search-mcp to collect candidates. Default target:

collect 50 deduped Google Scholar papers for normal topic searches;
collect 80-100 candidates first when the topic is broad, then return the best 50;
return fewer only when Google Scholar does not provide enough usable results, and say how many were found.

For each candidate, capture as many fields as possible:

title;
authors;
year;
journal or conference venue;
citation count;
Google Scholar URL or Scholar result identifier;
PDF/download URL if visible;
DOI or publisher URL if available;
snippet/abstract if available;
source database: Google Scholar;
source route: query, cited-by, related-articles, seed-title, seed-author, or term-expansion;
query string that found the paper.

Do not promise every paper has a free PDF. Use "download/access link" and prefer PDF links when visible; otherwise use publisher, DOI, or Scholar links.

4. Expand By Citation Chaining

Use citation chaining before concluding the search is complete:

use "Cited by" for forward chaining;
use "Related articles" for lateral expansion;
search exact titles of high-quality seed papers to recover variants, publisher links, and related clusters;
search core authors from the best seed papers when the field is small;
extract recurring terms from titles/snippets and generate second-round queries;
keep a deduped candidate list.

Stop expanding when 50 deduped candidates are collected, or when new results repeat the same venues/authors/keywords after at least two query rounds.

5. Score Quality

Use optional local venue-quality data after retrieval:

journal_scores.json
ccf_conferences.json
eiiRankingName.json
chinese_journal_tags.json

The default quality-data directory is the skill's data/ folder. Users can also set SCHOLAR_QUALITY_DATA_DIR or pass --data-dir.

Use scripts/score_papers.py whenever the candidate list is available as JSON or CSV. The script enriches papers with impact factor, JCR quartile, CAS zone, CCF rank, EI tag, Chinese core tags, quality score, and recommendation tier. If the quality files are missing, the script must still return a ranked table and mark unmatched venues as unknown venue.

Example:

python3 scripts/score_papers.py candidates.json \
  --markdown papers.md \
  --json enriched.json

If no candidate file exists, score manually using the same rules in quality-scoring.md.

6. Rank By Relevance And Quality

Never recommend a paper only because the venue is prestigious. Use this hierarchy:

high relevance + high-quality venue: core must-read;
high relevance + decent/unknown venue: useful reference;
weak relevance + high-quality venue: optional background;
weak relevance + unknown/low-quality venue: remove or mark cautious.

If relevance cannot be verified from title/snippet/abstract, mark it as "needs manual check" instead of pretending confidence.

Default ranking should combine:

relevance to the user's topic or research question;
source quality from IF, JCR/CAS, CCF, EI, or Chinese core tags;
citation count as a secondary authority signal;
recency when the user asks for current research or clinical guidance.

Final Output

Return a Markdown table by default. Include:

Tier	Source	Title	Authors	Year	Venue	IF	Rank/Tags	Citations	Access	Why keep

Use these recommendation tiers:

Core: highly relevant and high-quality source.
Priority: relevant and quality source.
Reference: relevant but source quality is modest or unknown.
Check: potentially useful but needs manual verification.
Remove: low relevance or weak evidence.

Also include:

search queries used;
tools/sources used, especially whether google-scholar-search-mcp succeeded;
candidate-pool size, dedupe count, and final result count;
expansion terms discovered;
inclusion/exclusion notes;
limitations such as missing abstracts, missing PDFs, or unmatched venues.

安全使用建议

Install this only if you have or trust the Google Scholar MCP server you will connect it to, because research queries and retrieval results flow through that external tool. Treat the generated rankings as literature triage, not authoritative citation validation, since venue metrics may be incomplete or outdated.

能力标签

crypto

能力评估

ℹ Purpose & Capability

The artifacts coherently describe Google Scholar retrieval, query expansion, deduplication, citation expansion, and local venue-quality ranking; the metadata capability tag 'crypto' appears unrelated but is not reflected in the skill behavior.

✓ Instruction Scope

Runtime instructions are scoped to real Google Scholar MCP retrieval, require fail-closed behavior when the MCP is unavailable, and explicitly prohibit inventing papers or substituting other sources silently.

✓ Install Mechanism

Installation is a normal clone into the Codex skills directory and expects a separately configured Google Scholar MCP server; no hidden installer, package execution, or automatic setup script is present.

✓ Credentials

The included Python helper reads user-supplied JSON/CSV candidate files and bundled or user-provided quality JSON files, then writes requested JSON or Markdown outputs, which is proportionate to paper ranking.

✓ Persistence & Privilege

No credential harvesting, local session/profile use, privilege escalation, autorun persistence, background workers, broad local indexing, destructive actions, or undisclosed network code were found.

如何使用

确保已安装 OpenClaw（本地或 Docker 部署）
在对话框中输入安装命令：/install google-scholar-paper-finder
安装完成后，直接呼叫该 Skill 的名称或使用 /google-scholar-paper-finder 触发
根据 Skill 的参数说明提供必要输入，即可获得结构化输出

版本历史

v1.0.0

Initial release: real-time Google Scholar search, query expansion, paper deduplication, and venue-quality ranking with bundled JCR/CAS/CCF/EI/CSSCI/core-journal data.

元数据

Slug google-scholar-paper-finder

版本 1.0.0

许可证 MIT-0

累计安装 0

当前安装数 0

历史版本数 1

常见问题