← Back to Skills Marketplace

Percept Listen

Name: Percept Listen
Author: jarvis563

by jarvis563 · GitHub ↗ · v1.0.0

cross-platform ⚠ suspicious

486

Downloads

Stars

Active Installs

Versions

Install in OpenClaw

/install percept-listen

Description

Captures ambient audio from wearable devices, transcribes locally, and streams searchable, speaker-tagged conversation data to your OpenClaw agent.

Usage Guidance

This skill can collect and store ambient conversation data and asks you to install and run a third-party Python package and a local web server reachable via a public tunnel. Before installing: (1) Verify the 'getpercept' package and the GitHub repo (owner, recent commits, issues) to ensure you trust the code; (2) Confirm where STT actually happens — if the phone/cloud does speech-to-text, sensitive audio may be processed by third parties despite the 'local' claim; (3) Be aware that using ngrok/Cloudflare/Tailscale exposes webhook traffic to those providers — use authenticated tunnels and TLS, and limit access; (4) Review and secure the filesystem paths (percept/data/percept.db, /tmp/percept-live.txt) and set appropriate permissions/rotation; (5) Consider legal and privacy obligations for recording ambient conversations and obtain consent; (6) If you are not comfortable auditing a pip package or exposing transcripts via a tunnel, do not install. Additional helpful info to reduce uncertainty: a link to an official project homepage, signed releases or PyPI/project metadata, and explicit documentation stating whether STT runs on-device or in-cloud.

Capability Analysis

Type: OpenClaw Skill Name: percept-listen Version: 1.0.0 The skill bundle describes a local ambient audio capture and transcription service. It instructs the user to install a Python package (`getpercept`) and run a local server (`uvicorn`) to receive data via a user-configured webhook/tunnel. The documentation explicitly states that "All processing stays local — no audio leaves your machine," indicating no intent for data exfiltration. There are no signs of prompt injection, malicious execution, persistence mechanisms, or obfuscation within the provided `SKILL.md` or `_meta.json` files. While exposing a local server via a tunnel can introduce vulnerabilities if the server itself is insecure, the instructions themselves are for setting up a legitimate service, not for malicious purposes.

Capability Assessment

ℹ Purpose & Capability

Name and description (ambient capture + local transcription) align with the SKILL.md instructions to run a 'percept' receiver and accept webhook transcripts. However, the doc contains a contradictory phrase: the flow says 'Omi pendant captures audio → phone does STT → sends transcript segments via webhook', which implies the phone performs speech-to-text (possibly cloud-based) rather than transcription occurring purely on the user's machine. The skill also instructs installing a PyPI package and running a web server/tunnel although the registry metadata lists no required binaries, env vars, or credentials — a mild mismatch between declared requirements and operational needs.

⚠ Instruction Scope

Runtime instructions tell the operator to pip-install getpercept, run a uvicorn server, and configure a public tunnel (ngrok/Cloudflare/Tailscale) so a mobile app can post transcripts to a local webhook. That design intentionally exposes conversation transcripts to the chosen tunneling provider and to any network path between device and tunnel endpoint. Instructions also prescribe where transcripts and DB files are stored on disk. The doc does not warn that the phone's STT may be cloud-based or that tunnel providers will see webhook traffic — this is material scope creep for a skill claiming 'all processing stays local.'

ℹ Install Mechanism

There is no formal install spec in the registry; the SKILL.md directs users to 'pip install getpercept' and to run 'percept start' or uvicorn. Installing unverified PyPI packages and running a web server on your machine is a standard but non-trivial action that can introduce arbitrary code. The SKILL.md points to a GitHub repo (github.com/GetPercept/percept), which is helpful, but the registry metadata lists the source as unknown and has no homepage — you should verify the package and repo before installing.

ℹ Credentials

The skill declares no required env vars or credentials, which is consistent with the registry metadata. In practice, using tunnels and webhooks will typically require accounts/credentials for ngrok/Tailscale/Cloudflare or some configuration in the Omi/phone app; those are not documented or requested explicitly. Also, the doc's claim that 'all processing stays local' conflicts with the stated phone-side STT step, which may use third-party cloud STT and therefore would involve external credentials and data egress not described here.

✓ Persistence & Privilege

The skill does not request elevated platform privileges or 'always: true'. It is user-invocable and can run services (percept start) on the host, but that is normal for an instruction-only integration. Note: once installed and running, the local server will persist conversation data on disk and the agent (if granted access) could read that data — evaluate that data flow carefully.

How to Use

Make sure OpenClaw is installed (local or Docker)
Run the install command in chat: /install percept-listen
After installation, invoke the skill by name or use /percept-listen
Provide required inputs per the skill's parameter spec and get structured output

Version History

v1.0.0

- Initial release of percept-listen: enable ambient audio capture and transcription for OpenClaw agents using wearable devices. - Supports Omi pendant and Apple Watch as audio sources, with local speech-to-text and webhook delivery of transcripts. - All transcripts are structured, searchable, and stored locally; includes local SQLite database and live transcript files. - Setup instructions provided for Percept server, device pairing, and webhook configuration. - Data and configuration remain private; audio processing and storage do not leave the user's machine.

Metadata

Slug percept-listen

Version 1.0.0

License —

All-time Installs 0

Active Installs 0

Total Versions 1

Frequently Asked Questions

What is Percept Listen?

Captures ambient audio from wearable devices, transcribes locally, and streams searchable, speaker-tagged conversation data to your OpenClaw agent. It is an AI Agent Skill for Claude Code / OpenClaw, with 486 downloads so far.

How do I install Percept Listen?

Run "/install percept-listen" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Percept Listen free?

Yes, Percept Listen is completely free (open-source). You can download, install and use it at no cost.

Which platforms does Percept Listen support?

Percept Listen is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Percept Listen?

It is built and maintained by jarvis563 (@jarvis563); the current version is v1.0.0.

More Skills