Description

Universal web clipper for Obsidian Vault. Saves content from X/Twitter, WeChat, Douyin, Xiaohongshu, GitHub, and generic web pages. Triggers when user sends...

README (SKILL.md)

Obsidian Clipper

Name: Clip to Your Vault
Author: zgissing

Universal clipper — saves URLs from any platform to your Obsidian Vault with local media, tags, and wikilinks.

Configuration

On first run, read config.yml from the same directory as this SKILL.md file. If missing, tell the user to copy config.yml.example to config.yml and fill in vault.base_path.

Key paths derived from config:

BASE        = config.vault.base_path
ATTACHMENTS = BASE / config.vault.attachments_dir
X_DIR       = BASE / config.vault.dirs.x
WECHAT_DIR  = BASE / config.vault.dirs.wechat
XHS_DIR     = BASE / config.vault.dirs.xiaohongshu
DOUYIN_DIR  = BASE / config.vault.dirs.douyin
GITHUB_DIR  = BASE / config.vault.dirs.github
WEB_DIR     = BASE / config.vault.dirs.web

URL Router

Match the URL and dispatch to the correct handler:

URL pattern	Handler
`x.com/` or `twitter.com/`	X Handler
`mp.weixin.qq.com/*`	WeChat Handler
`xhslink.com/` or `xiaohongshu.com/`	Xiaohongshu Handler
`v.douyin.com/` or `douyin.com/video/`	Douyin Handler
`github.com/{owner}/{repo}` (no deeper path)	GitHub Handler
Everything else	Web Handler

Shared Rules

These apply to ALL handlers:

File naming

Use the content title as filename
Strip /\:*?"\x3C>| and all emoji / special Unicode
Truncate if >60 characters
If a file with the same name exists, ask the user before overwriting

Media download

Download all images and videos to ATTACHMENTS/
Image naming: {title-slug}-{n}.{ext} (slug ≤20 chars, no emoji)
Video naming: {title-slug}.mp4 or {title-slug}-video-{n}.mp4
Replace remote URLs with Obsidian wikilinks: ![[filename.ext]]

"Why clipped" field

Infer from conversation context if possible
If context is insufficient, ask the user

Cross-platform linking

If the content contains a github.com/{owner}/{repo} link, auto-trigger the GitHub Handler to create a GitHub note, then add bidirectional wikilinks

X Handler

Saves X (Twitter) posts and articles.

Step 1: Fetch post data

curl -s "https://api.fxtwitter.com/{handle}/status/{tweet_id}"

Extract from URL: x.com/{handle}/status/{id} → handle, tweet_id.

Fields:

tweet.author.name → author name
tweet.author.screen_name → @handle
tweet.text → post body (short posts)
tweet.article → long-form article (if present)
tweet.article.title → article title
tweet.article.content.blocks → structured content (Draft.js format)
tweet.created_at → publish date
tweet.likes / tweet.retweets / tweet.views → engagement
tweet.media → short post images
tweet.article.media_entities → article inline images

Step 2: Determine content type

Short post (tweet.article is null): use tweet.text, images from tweet.media.photos.

Article (tweet.article exists): parse tweet.article.content.blocks (Draft.js):

block type	Markdown
`unstyled`	paragraph
`header-one`	`# heading`
`header-two`	`## heading`
`header-three`	`### heading`
`unordered-list-item`	`- item`
`ordered-list-item`	`1. item`
`atomic`	insert image `![[filename]]`
`blockquote`	`> quote`

Inline styles: Bold → **text**, Italic → *text*.

For atomic blocks: entityMap → value.data.mediaItems[].mediaId → match media_entities → media_info.original_img_url.

Step 3: Download images

Download all images to ATTACHMENTS/.

Step 4: Generate file

Write to X_DIR/{title}.md:

Short post:

---
title: "{author name}的推文 - {first 30 chars}"
author: "{author name}"
handle: "@{screen_name}"
source: {original URL}
date: {publish date YYYY-MM-DD}
tags:
  - clipping
  - {auto tags}
---

# {author name}的推文

> X: @{handle} ({name}) | {date} | {likes} likes · {retweets} retweets · {views} views

{body text}

{images ![[filename]]}

Article:

---
title: "{article.title}"
author: "{author name}"
handle: "@{screen_name}"
source: {original URL}
date: {publish date YYYY-MM-DD}
tags:
  - clipping
  - {auto tags}
---

# {article.title}

> X: @{handle} ({name}) | {date} | {likes} likes · {retweets} retweets · {views} views

{parsed Markdown body with ![[local images]]}

Notes

fxtwitter API needs no auth but may rate-limit; try vxtwitter.com as fallback
Title: articles use article.title; short posts use {name}-{first 15 chars of text}

WeChat Handler

Saves WeChat Official Account (公众号) articles.

Step 1: Fetch article data

curl -s "https://down.mptext.top/api/public/v1/download?url={URL-encoded-link}&format=json"

Extract: title, nick_name, create_time, content_noencode, link.

If API returns 204 or fails, fall back to defuddle or WebFetch.

Step 2: Fetch HTML (if rich content needed)

If content has images or complex formatting:

curl -s "https://down.mptext.top/api/public/v1/download?url={URL-encoded-link}&format=html"

Extract image URLs and formatted content from HTML.

Step 3: Download images

Download each image to ATTACHMENTS/
WeChat image URLs: mmbiz.qpic.cn/mmbiz_png/... → .png, mmbiz_jpg/... → .jpg
Replace with ![[filename.jpg]]

Step 4: Generate file

Write to WECHAT_DIR/{title}.md:

---
title: "{article title}"
author: "{公众号 name}"
source: {original link}
date: {publish date YYYY-MM-DD}
tags:
  - clipping
  - {auto tags}
---

# {article title}

> 公众号：{name} | {publish date}

{body with ![[local images]]}

Xiaohongshu Handler

Saves 小红书 notes (images + video).

Step 1: Resolve short link

If URL is xhslink.com, follow redirects to get the real URL with full query parameters (especially xsec_token):

curl -sL "\x3Cshort-link>" -o /dev/null -w "%{url_effective}"

Important: The full query string is required. Bare URLs return empty noteDetailMap.

Step 2: Fetch SSR data

Xiaohongshu is an SPA, but SSR embeds data in window.__INITIAL_STATE__:

curl -sL "\x3Cfull-URL-with-params>" \
  -H "User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_7) AppleWebKit/537.36" \
  -H "Accept: text/html" | python3 -c "
import sys, re, json

html = sys.stdin.read()
m = re.search(r'window\.__INITIAL_STATE__\s*=\s*(\{.*?\})\s*\x3C/script>', html, re.DOTALL)
if not m:
    print('NO_DATA')
    sys.exit(1)

raw = m.group(1).replace(':undefined', ':null')
d = json.loads(raw)

note_map = d.get('note', {}).get('noteDetailMap', {})
for nid, detail in note_map.items():
    n = detail.get('note', {})
    result = {
        'title': n.get('title', ''),
        'desc': n.get('desc', ''),
        'type': n.get('type', ''),
        'author': n.get('user', {}).get('nickname', ''),
        'time': n.get('time', ''),
        'images': [],
        'video_url': None,
        'tags': [t.get('name', '') for t in n.get('tagList', [])]
    }
    for img in n.get('imageList', []):
        url = img.get('urlDefault', '') or img.get('urlPre', '') or img.get('url', '')
        if url: result['images'].append(url)
    video = n.get('video', {})
    if video:
        media = video.get('media', {})
        stream = media.get('stream', {})
        for codec in ['h264', 'h265', 'h266', 'av1']:
            streams = stream.get(codec, [])
            if streams:
                result['video_url'] = streams[0].get('masterUrl', '')
                break
    print(json.dumps(result, ensure_ascii=False))
"

Step 3: Download media

Images: curl -L -o each image to ATTACHMENTS/
Video (if present): curl -L -o to ATTACHMENTS/{title-slug}-video.mp4

Step 4: Generate file

Write to XHS_DIR/{title}.md:

---
title: "{title}"
author: "{author}"
source: {real URL}
domain: xiaohongshu.com
date: {today YYYY-MM-DD}
tags:
  - clipping
  - {tags from tagList, 2-4}
---

# {title}

> 来源：小红书 | {author} | {publish date}

![[cover-image.jpg]]

![[video.mp4]]  (if video exists)

{desc body, strip [话题]# markers}

## 标签

{tags joined with /}

Notes

If curl can't reach Xiaohongshu, add --proxy {config.xiaohongshu.proxy}
desc often contains [话题]#tag[话题]# markers — strip them for clean text

Douyin Handler

Saves 抖音 videos. Requires douyin-downloader tool — check config.douyin.enabled first. If disabled, tell the user to install douyin-downloader and enable it in config.

Step 1: Extract and resolve link

From share text like 7.94 复制打开抖音...https://v.douyin.com/xxxxx/ DUL:/..., extract the v.douyin.com short link.

Resolve to full video ID:

curl -sI -L --max-redirs 5 "https://v.douyin.com/xxxxx/" 2>&1 | grep -i location | grep '/video/' | sed 's|.*/video/\([0-9]*\).*|\1|' | head -1

Construct: https://www.douyin.com/video/{aweme_id}

Step 2: Download with douyin-downloader

Edit {config.douyin.tool_path}/config.yml: set link to the full URL

Run:

cd {config.douyin.tool_path} && {config.douyin.python} run.py

Find files in Downloaded/ — .mp4, _cover.jpg, _data.json

Step 3: Extract metadata

From _data.json:

desc → description (title + hashtags)
author.nickname → author
statistics.digg_count → likes
statistics.comment_count → comments
statistics.share_count → shares
create_time → Unix timestamp → convert to YYYY-MM-DD

Step 4: Copy media to vault

Copy video and cover to ATTACHMENTS/.

Step 5: Generate file

Write to DOUYIN_DIR/{title}.md:

---
title: "{video title}"
source: https://www.douyin.com/video/{aweme_id}
author: "{author}"
platform: 抖音
digg: {likes}
comment: {comments}
share: {shares}
created: {publish date YYYY-MM-DD}
date: {today YYYY-MM-DD}
tags:
  - clipping
  - 抖音
  - {auto tags}
---

# {video title}

> {original desc with hashtags}

## 为什么收藏

{inferred or asked}

## 视频

![[{video-file}.mp4]]

## 封面

![[{cover-file}_cover.jpg]]

Notes

Short links MUST be resolved first — douyin-downloader cannot handle them

If download fails (0% success), cookies may be expired — tell user to re-run cookie fetcher:

cd {config.douyin.tool_path} && {config.douyin.python} -m tools.cookie_fetcher --config config.yml

create_time is Unix timestamp: python3 -c "import datetime; print(datetime.datetime.fromtimestamp({ts}).strftime('%Y-%m-%d'))"

GitHub Handler

Saves GitHub repositories.

Step 1: Fetch repo metadata

curl -s "https://api.github.com/repos/{owner}/{repo}"

Extract: name, full_name, description, stargazers_count, language, license.spdx_id, topics, created_at, html_url, homepage.

Step 2: Fetch README

curl -s "https://api.github.com/repos/{owner}/{repo}/readme"

Decode base64 content to get README markdown.

Step 3: Summarize README

Do NOT copy the full README. Extract:

Core features: 3-5 bullet points, one sentence each
Quick start: only the most essential command or API snippet
Notes: limitations, dependencies, special requirements

Step 4: Generate file

Write to GITHUB_DIR/{repo-name}.md:

---
title: "{repo name}"
source: {repo URL}
author: "{owner}"
stars: {star count}
language: "{primary language}"
license: "{license}"
created: {repo created date YYYY-MM-DD}
date: {today YYYY-MM-DD}
tags:
  - clipping
  - github
  - {auto tags from topics}
---

# {repo name}

> {one-line description}

## 为什么收藏

{inferred or asked}

## 核心功能

- {point 1}
- {point 2}
- ...

## 快速开始

{minimal code block}

## 备注

{limitations, dependencies, notes}

Notes

Star count as raw number (no 1.2k formatting — for Dataview sorting)
Use name not full_name for filename
If API returns 403 (rate limit), tell user to wait
Strip emoji from filenames

Web Handler

Saves generic web pages. Fallback for URLs that don't match any platform handler.

Step 1: Extract content with defuddle

defuddle parse \x3Curl> --json

Extract: title, author, content (HTML), description, published, domain.

If defuddle is not installed, run npm install -g defuddle-cli first.

Step 2: Handle empty content (SPA fallback)

If content is empty or just \x3Cbody>\x3C/body> (client-rendered SPA), try fallback paths in order:

Path A: CDP browser (if config.web.cdp_enabled):

Open page: curl -s "{config.web.cdp_url}/new?url=\x3CURL>" → get targetId
Scroll: curl -s "{cdp_url}/scroll?target=\x3CID>&direction=bottom"
Extract title: curl -s -X POST "{cdp_url}/eval?target=\x3CID>" -d 'document.title'
Extract body: curl -s -X POST "{cdp_url}/eval?target=\x3CID>" -d "document.querySelector('article')?.innerHTML || document.querySelector('main')?.innerHTML || document.body.innerHTML"
Extract images: curl -s -X POST "{cdp_url}/eval?target=\x3CID>" -d "[...document.querySelectorAll('img')].map(i=>i.src).filter(s=>s.startsWith('http'))"
Extract videos: curl -s -X POST "{cdp_url}/eval?target=\x3CID>" -d "[...document.querySelectorAll('video source, video')].map(v=>v.src||v.querySelector('source')?.src).filter(Boolean)"
Close tab: curl -s "{cdp_url}/close?target=\x3CID>"

Path B: WebFetch — use the WebFetch tool as fallback.

Path C: Bookmark mode — if URL is a web app/tool/dashboard or all paths fail, create a bookmark note from meta info only.

Step 3: Download media

Download images and videos to ATTACHMENTS/, replace with ![[filename]].

Step 4: Generate file

Write to WEB_DIR/{title}.md:

Content page:

---
title: "{page title}"
author: "{author}"
source: {URL}
domain: "{domain}"
date: {today YYYY-MM-DD}
tags:
  - clipping
  - {auto tags}
---

# {page title}

> 来源：{domain} | {author} | {publish date}

{Markdown body with ![[local media]]}

Bookmark page (tool/app/SPA):

---
title: "{page title}"
author: "{author}"
source: {URL}
domain: "{domain}"
date: {today YYYY-MM-DD}
tags:
  - clipping
  - {auto tags}
---

# {page title}

> 来源：{domain} | {URL}

{description}

## 为什么收藏

{inferred or asked}

## 核心功能

- {points from meta info}

Notes

defuddle does not work on SPAs (React/Vue client-rendered) — use CDP path
Always close CDP tabs after use

Usage Guidance

Before installing, verify the authoritative source: confirm the GitHub repo (https://github.com/Gisg/obsidian-clipper) is the intended upstream and that the registry publisher legitimately links to it. Do NOT run any curl | bash one‑liner unless you trust the repo and have inspected the script. Inspect the GitHub repository contents (especially any additional scripts beyond SKILL.md) for unexpected network calls or data exfiltration. Decide whether you are comfortable that third‑party proxy endpoints (fxtwitter, down.mptext.top, vxtwitter, etc.) will see the URLs/content you clip — if not, avoid those handlers or run the skill in a sandbox. When configuring, set vault.base_path to a dedicated directory and review the files the skill writes. If you need higher assurance, manually clone the repo and inspect code before enabling the skill, or request the publisher to provide a homepage/source that matches the registry owner.

Capability Analysis

Type: OpenClaw Skill Name: clip-to-your-vault Version: 1.0.0 The skill implements a universal web clipper using high-risk capabilities, including arbitrary shell execution (curl, python3), network access to third-party APIs (fxtwitter.com, mptext.top), and broad file system access to manage an Obsidian vault. While these behaviors are aligned with the stated purpose in SKILL.md, the instructions direct the agent to execute complex shell scripts and interact with external tools (douyin-downloader, defuddle) and remote debugging protocols (CDP), which significantly expands the attack surface. No clear evidence of intentional malice was found, but the reliance on unauthenticated third-party APIs and local script execution warrants a suspicious classification.

Capability Assessment

ℹ Purpose & Capability

The skill's name/description (Obsidian web clipper) aligns with the SKILL.md: it downloads web content, media, and writes Markdown into an Obsidian vault. It does require access to the user's filesystem (vault.base_path) which is proportionate. However, the registry entry has no homepage/source while setup.sh/README point to a GitHub repo (Gisg/obsidian-clipper) — an ownership/source mismatch worth verifying.

⚠ Instruction Scope

Runtime instructions perform network fetches (fxtwitter, down.mptext.top, vxtwitter fallback, douyin/downloader, defuddle/CDP) and download media into the user's vault; this is expected for a clipper. Concerns: (1) it relies on third‑party proxy APIs (fxtwitter, down.mptext.top) which will receive the URLs/content you ask to clip and could log them; (2) Douyin handling requires cookie fetching and running Playwright scripts (sensitive); (3) SKILL.md instructs reading config.yml from the skill directory and writing files into the user's vault — ensure config points to the intended directory. The instructions do not attempt to read unrelated system secrets, but they do enable sending content to external services.

⚠ Install Mechanism

There is no formal install spec in the registry, but setup.sh includes a one‑liner comment that pipes a raw GitHub URL to bash and the script performs a git clone from https://github.com/Gisg/obsidian-clipper.git into ~/.claude/skills. Pulling and executing remote code (curl | bash) is high risk. The actual clone target is GitHub (a well‑known host), but the registry owner differs from the GitHub repo author, which is an inconsistency to validate before running any installer.

✓ Credentials

The skill declares no required environment variables or credentials. It does ask for a config.yml with an absolute vault.base_path and optional proxy/tool paths; that is reasonable and proportionate. Be aware optional flows (Douyin cookie fetching, CDP URL, Xiaohongshu proxy) may require additional local tools/config and cookies but the skill does not request unrelated secrets.

✓ Persistence & Privilege

The skill does not request always:true and is user‑invocable only. The installer clones into the user's ~/.claude/skills directory and creates symlinks and a local config file; those are normal for a skill. It does not request to modify other skills or system settings beyond placing files under the user's home.

Version History

v1.0.0

obsidian-clipper 1.0.0 - Initial release: Universal web clipper for Obsidian Vault supporting X/Twitter, WeChat Official Accounts, Douyin, Xiaohongshu, GitHub, and generic web pages. - Automatically routes URLs to platform-specific handlers with media download and formatting. - Saves notes with local images/videos, content-based tags, and Obsidian wikilinks. - Asks for overwrite confirmation on filename conflicts and prompts for a "why clipped" reason if context is unclear. - Requires simple config file setup on first use.

Metadata

Slug clip-to-your-vault

Version 1.0.0

License MIT-0

All-time Installs 0

Active Installs 0

Total Versions 1

Frequently Asked Questions

What is Clip to Your Vault?

Universal web clipper for Obsidian Vault. Saves content from X/Twitter, WeChat, Douyin, Xiaohongshu, GitHub, and generic web pages. Triggers when user sends... It is an AI Agent Skill for Claude Code / OpenClaw, with 80 downloads so far.

How do I install Clip to Your Vault?

Run "/install clip-to-your-vault" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Clip to Your Vault free?

Yes, Clip to Your Vault is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Clip to Your Vault support?

Clip to Your Vault is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Clip to Your Vault?

It is built and maintained by zGissing (@zgissing); the current version is v1.0.0.

More Skills

Clip to Your Vault

Obsidian Clipper

Configuration

URL Router

Shared Rules

File naming

Tags

Media download

"Why clipped" field

Cross-platform linking

X Handler

Step 1: Fetch post data

Step 2: Determine content type

Step 3: Download images

Step 4: Generate file

Notes

WeChat Handler

Step 1: Fetch article data

Step 2: Fetch HTML (if rich content needed)

Step 3: Download images

Step 4: Generate file

Xiaohongshu Handler

Step 1: Resolve short link

Step 2: Fetch SSR data

Step 3: Download media

Step 4: Generate file

Notes

Douyin Handler

Step 1: Extract and resolve link

Step 2: Download with douyin-downloader

Step 3: Extract metadata

Step 4: Copy media to vault

Step 5: Generate file

Notes

GitHub Handler

Step 1: Fetch repo metadata

Step 2: Fetch README

Step 3: Summarize README

Step 4: Generate file

Notes

Web Handler

Step 1: Extract content with defuddle

Step 2: Handle empty content (SPA fallback)

Step 3: Download media

Step 4: Generate file

Notes

What is Clip to Your Vault?

How do I install Clip to Your Vault?

Is Clip to Your Vault free?

Which platforms does Clip to Your Vault support?

Who created Clip to Your Vault?

💬 Comments