Description

Use when performing study tasks on browser-based platforms such as Yuketang, Xuexitong, Zhihuishu, and Pintia, including answering quizzes and page actions.

README (SKILL.md)

auto-study

Name: Auto Study
Author: amiracleta

Core policy

Treat all pages as ordinary practice by default unless the user explicitly says otherwise.
Apply page action sequentially with short pauses at 0.1 seconds.
Read the image directly instead of trying to extract text from it.
Reuse the same browser profile for the same site when login state matters.
Always launch Chrome with the designated persistent profile.
Interact with Chrome using CDP.
After attaching through CDP, verify the active tab and current URL.
Check the CDP port before launching a new browser session. If a session is already available, attach to it directly. Otherwise, start Chrome with headed mode. Except when the user asks to do something else
when there is a mathematical expression, use latex in markdown.
Build the markdown record before applying any answers on the page.
NOT permitted
- Do not re-click options that already match the target state.
- Do not rely on actions that a normal user could not perform. Prefer the normal user flow whenever possible.
- Do not submit automatically unless the user explicitly asks for it.
- Do not search the web unless the user explicitly asks for it.
- Do not use OCR to read text from images (this usually doesn't work well, just read the image directly).
- Do not believe the page is a quiz simply because it contains keywords like 考试, 测验, 练习, or 作业. Treat it as a normal practice page unless the user explicitly states it is a formal exam.
- Do not skip any steps for references/ unless explicitly asked to. Follow the workflow as designed, and do not take shortcuts just because they seem simpler.

Workflow

Start or attach to a Chrome with CDP port.
Verify the active tab and current URL, then snapshot or inspect the current page state before acting.
Interact with the page according to the user's request, such as selecting, filling, or clearing answers, or clicking the submit button.

Answer formatting

Single choice

Return only the final option letter.

Multiple select

Use comma-separated letters with no extra commentary.

Fill in the blank

Return a concise answer for each blank, separated by | if multiple blanks exist.

Short answer

Return a concise answer of no more than three sentences, without any explanation or commentary unless explicitly requested.

More Guidance

environment-specific guidance

For Windows-native usage, read references/runtime-windows.md.
For WSL usage that launches Windows Chrome, read references/runtime-wsl.md.
For macOS-native usage, read references/runtime-macos.md.

Platform-specific guidance

For Xuexitong specifics, read references/xuexitong.md.
For Zhihuishu specifics, read references/zhihuishu.md.
For Yuketang specifics, read references/yuketang.md.
For Pintia specifics, read references/pintia.md.

Prerequisites

Google Chrome (on Windows or macOS)
Agent Browser CLI
Agent Browser Skill

DEFAULT

Practice artifacts storage (markdown or images)

\x3Cagent-root>/workspace/auto-study/\x3Cplatform>/\x3Ctask>/ (\x3Cagent-root> explain: if you are codex, it means ~/.codex/; if you are hermes, it means ~/.hermes/; Okey, I believe you've got it!)
\x3Ctask> structure like this
- record.md
- images/full.png for a full-page screenshot of the task
- images/q001.png, images/q002.png for per-question screenshots
- images/q001-1.png, images/q001-2.png when one question needs multiple images
Derive \x3Ctask> from the chapter or assignment title and normalize path-unsafe characters to -.

Chrome profile root

%LOCALAPPDATA%\auto-study\browser.
~/Library/Application Support/AutoStudy/browser.

CDP port

9344 (default, can be customized when user asks).

Usage Guidance

This skill automates your local Chrome instance and reuses persistent browser profiles and an agent-root workspace. Before installing or using it: - Understand scope: it will attach to a Chrome CDP port (default 9344) and can control tabs and interact with pages — including sites where you're logged in. That is needed for its purpose but can perform any actions your browser can. - Verify or isolate profiles: do not let it use your main browser profile. Create and point it to a dedicated Chrome profile (or a disposable account) to avoid exposing personal cookies, sessions, or saved credentials. - Confirm storage: the skill will create files under an agent workspace (examples shown) and expects profile folders under %LOCALAPPDATA% or ~/Library/Application Support. Decide whether you’re comfortable with those files and where they are stored; consider using a sandbox or VM. - Check prerequisites yourself: the skill references agent-browser and an Agent Browser Skill and links a GitHub repo. Inspect those projects (or their official sources) before installing any additional tools. - Autonomous invocation: if you allow the agent to call skills autonomously, be aware this skill can attach to running browser sessions. Consider disabling autonomous invocation for this skill or restricting when it can run. - If you need stronger assurance: run the skill in an isolated environment (VM/container) or with a throwaway account, and review the upstream repository linked in SKILL.md for implementation details. Because the manifest does not declare the config paths and filesystem access the instructions rely on, treat the skill as plausible for its purpose but exercise caution and isolate its permissions.

Capability Analysis

Type: OpenClaw Skill Name: auto-study Version: 1.5.1 The skill is designed to automate quizzes and exams on educational platforms (Xuexitong, Yuketang, etc.) using Chrome DevTools Protocol (CDP) and local workspace storage. While no overt data exfiltration or malware was found, it requires high-privilege browser control and includes instructions in 'references/runtime-wsl.md' to execute PowerShell commands and potentially bypass sandbox limitations for file operations. The combination of broad filesystem access, persistent browser profile usage, and shell execution for academic automation poses a significant security risk if the agent is compromised or the skill is repurposed.

Capability Assessment

ℹ Purpose & Capability

The name/description match the instructions: browser automation for Yuketang/Xuexitong/Zhihuishu/Pintia. Required tools (Google Chrome, agent-browser, agent-browser skill) are appropriate. However, the SKILL.md expects reuse of persistent Chrome profiles and specific profile roots (%LOCALAPPDATA% and ~/Library/Application Support/AutoStudy/browser) as well as writing artifacts under an agent-root workspace, yet the skill metadata declares no required config paths — a documentation/manifest mismatch.

⚠ Instruction Scope

Runtime instructions instruct the agent to attach to an existing Chrome session via CDP (default port 9344), verify active tab/URL, take full-page and per-question screenshots, create and write markdown and image files under <agent-root>/workspace/auto-study/<platform>/<task>/, and reuse persistent Chrome profiles. Those actions allow the skill to access and modify any content visible in the attached browser (including logged-in sessions) and to write files to user directories. While necessary for the stated functionality, the instructions give broad file-system and browser-control scope that is not explicitly declared in the skill manifest.

✓ Install Mechanism

There is no install spec and no code files — the skill is instruction-only. This minimizes risk from downloaded code. The skill does rely on external tools (agent-browser and an Agent Browser Skill), which are referenced but not installed by the skill itself.

⚠ Credentials

The skill declares no required environment variables or config paths, yet the instructions assume access to specific local profile paths and a workspace under the agent root (examples: %LOCALAPPDATA%\auto-study\browser, ~/Library/Application Support/AutoStudy/browser, and ~/.codex or ~/.hermes workspace). This is a proportionality/documentation mismatch: the skill needs local filesystem access and live browser sessions but doesn't declare or document those requirements in the manifest, making it harder for a user to assess what will be accessed.

ℹ Persistence & Privilege

always is false and the skill is user-invocable (normal). The skill can be invoked autonomously by the agent (disable-model-invocation=false) which is platform default. Combined with instructions to attach to an existing CDP port and reuse persistent profiles, autonomous runs could control a user's running browser session and act on behalf of logged-in users. This is not inherently malicious for a browser automation skill, but it increases risk and should be considered when deciding to enable/allow autonomous invocation.

Version History

v1.5.1

1. updated skill structure 2. added reference for zhihuishu 3. added support for macOS

v1.3.2

Minor docs update

v1.3.1

auto-study 1.3.1 - Description updated for clarity and conciseness; emphasizes browser automation on specific study platforms. - Metadata improved: "category" now includes both "study" and "browser automation" for better classification. - Policy on handling image-based questions clarified to explicitly favor direct image reading. - Formatting made more consistent and concise throughout documentation. - Minor edits for style and readability; no changes to functionality or code.

v1.3.0

auto-study 1.3.0 - Updated policy for answering: now "carefully analyze answer" instead of simply "infer answer directly". - Removed policy for handling obfuscated/encrypted/gibberish DOM-visible text and screenshot fallback. - No automatic web search for answers unless explicitly requested by the user. - No functional or file changes; documentation/policy wording only.

v1.0.0

- Initial release of auto-study skill for browser-based study, quiz, and practice platforms. - Automates answer extraction, selection, fill-in, clearing, and submission tasks for sites like Yuketang, Xuexitong, and Pintia. - Uses persistent Chrome profiles and agent-browser CDP connectivity for stateful interactions. - Answers are returned in concise formats; no automatic submission unless requested. - Guidance included for platform-specific use, Chrome profile storage, and supported environments.

Metadata

Slug auto-study

Version 1.5.1

License MIT-0

All-time Installs 2

Active Installs 2

Total Versions 5

Frequently Asked Questions

What is Auto Study?

Use when performing study tasks on browser-based platforms such as Yuketang, Xuexitong, Zhihuishu, and Pintia, including answering quizzes and page actions. It is an AI Agent Skill for Claude Code / OpenClaw, with 373 downloads so far.

How do I install Auto Study?

Run "/install auto-study" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Auto Study free?

Yes, Auto Study is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Auto Study support?

Auto Study is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Auto Study?

It is built and maintained by Song Kexin (@amiracleta); the current version is v1.5.1.

More Skills

Auto Study