← Back to Skills Marketplace
amiracleta

Auto Study

by Song Kexin · GitHub ↗ · v1.5.1 · MIT-0
cross-platform ⚠ suspicious
373
Downloads
1
Stars
2
Active Installs
5
Versions
Install in OpenClaw
/install auto-study
Description
Use when performing study tasks on browser-based platforms such as Yuketang, Xuexitong, Zhihuishu, and Pintia, including answering quizzes and page actions.
README (SKILL.md)

auto-study

Core policy

  • Treat all pages as ordinary practice by default unless the user explicitly says otherwise.

  • Apply page action sequentially with short pauses at 0.1 seconds.

  • Read the image directly instead of trying to extract text from it.

  • Reuse the same browser profile for the same site when login state matters.

  • Always launch Chrome with the designated persistent profile.

  • Interact with Chrome using CDP.

  • After attaching through CDP, verify the active tab and current URL.

  • Check the CDP port before launching a new browser session. If a session is already available, attach to it directly. Otherwise, start Chrome with headed mode. Except when the user asks to do something else

  • when there is a mathematical expression, use latex in markdown.

  • Build the markdown record before applying any answers on the page.

  • NOT permitted

    • Do not re-click options that already match the target state.

    • Do not rely on actions that a normal user could not perform. Prefer the normal user flow whenever possible.

    • Do not submit automatically unless the user explicitly asks for it.

    • Do not search the web unless the user explicitly asks for it.

    • Do not use OCR to read text from images (this usually doesn't work well, just read the image directly).

    • Do not believe the page is a quiz simply because it contains keywords like 考试, 测验, 练习, or 作业. Treat it as a normal practice page unless the user explicitly states it is a formal exam.

    • Do not skip any steps for references/ unless explicitly asked to. Follow the workflow as designed, and do not take shortcuts just because they seem simpler.

Workflow

  1. Start or attach to a Chrome with CDP port.
  2. Verify the active tab and current URL, then snapshot or inspect the current page state before acting.
  3. Interact with the page according to the user's request, such as selecting, filling, or clearing answers, or clicking the submit button.

Answer formatting

Single choice

Return only the final option letter.

Multiple select

Use comma-separated letters with no extra commentary.

Fill in the blank

Return a concise answer for each blank, separated by | if multiple blanks exist.

Short answer

Return a concise answer of no more than three sentences, without any explanation or commentary unless explicitly requested.

More Guidance

environment-specific guidance

  • For Windows-native usage, read references/runtime-windows.md.
  • For WSL usage that launches Windows Chrome, read references/runtime-wsl.md.
  • For macOS-native usage, read references/runtime-macos.md.

Platform-specific guidance

  • For Xuexitong specifics, read references/xuexitong.md.
  • For Zhihuishu specifics, read references/zhihuishu.md.
  • For Yuketang specifics, read references/yuketang.md.
  • For Pintia specifics, read references/pintia.md.

Prerequisites

DEFAULT

Practice artifacts storage (markdown or images)

  • \x3Cagent-root>/workspace/auto-study/\x3Cplatform>/\x3Ctask>/ (\x3Cagent-root> explain: if you are codex, it means ~/.codex/; if you are hermes, it means ~/.hermes/; Okey, I believe you've got it!)
  • \x3Ctask> structure like this
    • record.md
    • images/full.png for a full-page screenshot of the task
    • images/q001.png, images/q002.png for per-question screenshots
    • images/q001-1.png, images/q001-2.png when one question needs multiple images
  • Derive \x3Ctask> from the chapter or assignment title and normalize path-unsafe characters to -.

Chrome profile root

  • %LOCALAPPDATA%\auto-study\browser.
  • ~/Library/Application Support/AutoStudy/browser.

CDP port

  • 9344 (default, can be customized when user asks).
Usage Guidance
This skill automates your local Chrome instance and reuses persistent browser profiles and an agent-root workspace. Before installing or using it: - Understand scope: it will attach to a Chrome CDP port (default 9344) and can control tabs and interact with pages — including sites where you're logged in. That is needed for its purpose but can perform any actions your browser can. - Verify or isolate profiles: do not let it use your main browser profile. Create and point it to a dedicated Chrome profile (or a disposable account) to avoid exposing personal cookies, sessions, or saved credentials. - Confirm storage: the skill will create files under an agent workspace (examples shown) and expects profile folders under %LOCALAPPDATA% or ~/Library/Application Support. Decide whether you’re comfortable with those files and where they are stored; consider using a sandbox or VM. - Check prerequisites yourself: the skill references agent-browser and an Agent Browser Skill and links a GitHub repo. Inspect those projects (or their official sources) before installing any additional tools. - Autonomous invocation: if you allow the agent to call skills autonomously, be aware this skill can attach to running browser sessions. Consider disabling autonomous invocation for this skill or restricting when it can run. - If you need stronger assurance: run the skill in an isolated environment (VM/container) or with a throwaway account, and review the upstream repository linked in SKILL.md for implementation details. Because the manifest does not declare the config paths and filesystem access the instructions rely on, treat the skill as plausible for its purpose but exercise caution and isolate its permissions.
Capability Analysis
Type: OpenClaw Skill Name: auto-study Version: 1.5.1 The skill is designed to automate quizzes and exams on educational platforms (Xuexitong, Yuketang, etc.) using Chrome DevTools Protocol (CDP) and local workspace storage. While no overt data exfiltration or malware was found, it requires high-privilege browser control and includes instructions in 'references/runtime-wsl.md' to execute PowerShell commands and potentially bypass sandbox limitations for file operations. The combination of broad filesystem access, persistent browser profile usage, and shell execution for academic automation poses a significant security risk if the agent is compromised or the skill is repurposed.
Capability Assessment
Purpose & Capability
The name/description match the instructions: browser automation for Yuketang/Xuexitong/Zhihuishu/Pintia. Required tools (Google Chrome, agent-browser, agent-browser skill) are appropriate. However, the SKILL.md expects reuse of persistent Chrome profiles and specific profile roots (%LOCALAPPDATA% and ~/Library/Application Support/AutoStudy/browser) as well as writing artifacts under an agent-root workspace, yet the skill metadata declares no required config paths — a documentation/manifest mismatch.
Instruction Scope
Runtime instructions instruct the agent to attach to an existing Chrome session via CDP (default port 9344), verify active tab/URL, take full-page and per-question screenshots, create and write markdown and image files under <agent-root>/workspace/auto-study/<platform>/<task>/, and reuse persistent Chrome profiles. Those actions allow the skill to access and modify any content visible in the attached browser (including logged-in sessions) and to write files to user directories. While necessary for the stated functionality, the instructions give broad file-system and browser-control scope that is not explicitly declared in the skill manifest.
Install Mechanism
There is no install spec and no code files — the skill is instruction-only. This minimizes risk from downloaded code. The skill does rely on external tools (agent-browser and an Agent Browser Skill), which are referenced but not installed by the skill itself.
Credentials
The skill declares no required environment variables or config paths, yet the instructions assume access to specific local profile paths and a workspace under the agent root (examples: %LOCALAPPDATA%\auto-study\browser, ~/Library/Application Support/AutoStudy/browser, and ~/.codex or ~/.hermes workspace). This is a proportionality/documentation mismatch: the skill needs local filesystem access and live browser sessions but doesn't declare or document those requirements in the manifest, making it harder for a user to assess what will be accessed.
Persistence & Privilege
always is false and the skill is user-invocable (normal). The skill can be invoked autonomously by the agent (disable-model-invocation=false) which is platform default. Combined with instructions to attach to an existing CDP port and reuse persistent profiles, autonomous runs could control a user's running browser session and act on behalf of logged-in users. This is not inherently malicious for a browser automation skill, but it increases risk and should be considered when deciding to enable/allow autonomous invocation.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install auto-study
  3. After installation, invoke the skill by name or use /auto-study
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.5.1
1. updated skill structure 2. added reference for zhihuishu 3. added support for macOS
v1.3.2
Minor docs update
v1.3.1
auto-study 1.3.1 - Description updated for clarity and conciseness; emphasizes browser automation on specific study platforms. - Metadata improved: "category" now includes both "study" and "browser automation" for better classification. - Policy on handling image-based questions clarified to explicitly favor direct image reading. - Formatting made more consistent and concise throughout documentation. - Minor edits for style and readability; no changes to functionality or code.
v1.3.0
auto-study 1.3.0 - Updated policy for answering: now "carefully analyze answer" instead of simply "infer answer directly". - Removed policy for handling obfuscated/encrypted/gibberish DOM-visible text and screenshot fallback. - No automatic web search for answers unless explicitly requested by the user. - No functional or file changes; documentation/policy wording only.
v1.0.0
- Initial release of auto-study skill for browser-based study, quiz, and practice platforms. - Automates answer extraction, selection, fill-in, clearing, and submission tasks for sites like Yuketang, Xuexitong, and Pintia. - Uses persistent Chrome profiles and agent-browser CDP connectivity for stateful interactions. - Answers are returned in concise formats; no automatic submission unless requested. - Guidance included for platform-specific use, Chrome profile storage, and supported environments.
Metadata
Slug auto-study
Version 1.5.1
License MIT-0
All-time Installs 2
Active Installs 2
Total Versions 5
Frequently Asked Questions

What is Auto Study?

Use when performing study tasks on browser-based platforms such as Yuketang, Xuexitong, Zhihuishu, and Pintia, including answering quizzes and page actions. It is an AI Agent Skill for Claude Code / OpenClaw, with 373 downloads so far.

How do I install Auto Study?

Run "/install auto-study" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Auto Study free?

Yes, Auto Study is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Auto Study support?

Auto Study is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Auto Study?

It is built and maintained by Song Kexin (@amiracleta); the current version is v1.5.1.

💬 Comments