← Back to Skills Marketplace
sxu75374

Video Chat With Me

by sxu75374 · GitHub ↗ · v1.1.0
cross-platform ⚠ suspicious
1191
Downloads
0
Stars
2
Active Installs
5
Versions
Install in OpenClaw
/install videochat-withme
Description
Real-time AI video chat that routes through your OpenClaw agent. Uses Groq Whisper (cloud STT), edge-tts (cloud TTS via Microsoft), and OpenClaw chatCompletions API for conversation. Your agent sees your camera, hears your voice, and responds with its own personality and memory. Requires: GROQ_API_KEY for speech recognition. Reads ~/.openclaw/openclaw.json for gateway port and auth token. Data flows: audio → Groq cloud (STT), TTS text → Microsoft cloud (edge-tts), camera frames (base64) + text → OpenClaw gateway → your configured LLM provider (may be cloud — frames leave the machine if using a cloud LLM). Installs a persistent launchd service (optional). Trigger phrases: "video chat", "voice call", "call me", "视频一下", "语音", "打电话给我", "我要和你视频", "videochat-withme".
Usage Guidance
This skill's functionality matches its description, but it carries elevated privacy and privilege implications. Before installing: 1) Note the source is unknown and there's no homepage—exercise caution. 2) Review server.py and setup.sh yourself (they are included) — verify exactly how the OpenClaw gateway token is read/used. 3) Understand that the skill reads your GROQ key and the OpenClaw gateway auth token (from ~/.openclaw/openclaw.json) — that gateway token grants access to the agent's memory/tools and can route frames to cloud LLMs. 4) If you install, consider running without the LaunchAgent first (use start.sh directly) and test locally, or run in a sandboxed account/container. 5) If you require privacy: configure your OpenClaw gateway to use a local/self‑hosted model (so frames don’t leave your machine), or do not provide the gateway auth in the config. 6) If you’re not comfortable auditing or trusting the author, do not install. Rotating credentials after removal and checking /tmp/videochat-withme.log and the created LaunchAgent plist are good post‑install checks.
Capability Analysis
Type: OpenClaw Skill Name: videochat-withme Version: 1.1.0 The skill is classified as suspicious due to several high-risk capabilities, primarily the installation of a persistent launchd service (`com.openclaw.videochat-withme.plist`) via `scripts/setup.sh` and the use of `subprocess.run` for `ffmpeg` and `edge-tts` in `scripts/server.py`. While these actions are documented and necessary for the skill's stated purpose (real-time video chat), the persistent service grants the skill continuous execution, and the `subprocess.run` calls, especially with LLM-generated text as input to `edge-tts`, present a potential Remote Code Execution (RCE) vulnerability if not robustly sanitized against shell injection. Data exfiltration to Groq (STT) and Microsoft (TTS) cloud services, and potentially to a user-configured cloud LLM, is also present but explicitly documented.
Capability Assessment
Purpose & Capability
Name/description match the actual behavior: python+ffmpeg server that sends audio to Groq Whisper (GROQ_API_KEY) and routes camera frames/text to the OpenClaw gateway for chatCompletions. Declared required binaries and config (gateway.http) align with these needs. Minor concern: source/homepage missing which reduces trust.
Instruction Scope
SKILL.md and scripts instruct the agent to read ~/.openclaw/openclaw.json and ~/.openclaw/secrets/groq_api_key.txt, run setup.sh, install Python deps and mkcert, generate certs, and start a FastAPI server that encodes camera frames as base64 and POSTs them to the gateway. All of this is coherent with the stated purpose but grants the skill broad access to camera frames, microphone audio, and your agent session.
Install Mechanism
No centralized install spec, but included setup.sh will install Python packages via pip, use brew to install ffmpeg/mkcert if available, generate local certs, and write a launchd plist. These are common but modify the system (LaunchAgents, pip installs). There are no suspicious remote download URLs in the provided scripts.
Credentials
GROQ_API_KEY is expected. The skill reads the OpenClaw gateway auth token from ~/.openclaw/openclaw.json (not declared as an env var) and uses it to call /v1/chat/completions as the main agent. That token effectively gives the skill access to the agent's full memory, personality, and tools — a high‑privilege credential that has broad consequences if misused.
Persistence & Privilege
setup.sh optionally installs a LaunchAgent that auto-starts the HTTP server (persistent, starts at login). Persistent service combined with reading the gateway token and sending camera frames to the agent increases risk: the service could be triggered without explicit user action and route sensitive data to a cloud LLM if the gateway forwards externally.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install videochat-withme
  3. After installation, invoke the skill by name or use /videochat-withme
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.1.0
Fix: prevent TTS audio from replaying on every click/keypress
v1.0.3
Declare gateway.http config requirement; fix privacy claims — camera frames may reach cloud LLM via gateway; declare openclaw.json access
v1.0.2
Use standard OpenClaw metadata.openclaw.requires schema to declare GROQ_API_KEY env var and python3/ffmpeg bins; explicit data flow in description
v1.0.1
Fix security scan: declare env vars, config paths, data flows in metadata; correct privacy claims (STT/TTS use cloud APIs)
v1.0.0
Initial release: real-time AI video chat with voice, vision, and full agent personality
Metadata
Slug videochat-withme
Version 1.1.0
License
All-time Installs 2
Active Installs 2
Total Versions 5
Frequently Asked Questions

What is Video Chat With Me?

Real-time AI video chat that routes through your OpenClaw agent. Uses Groq Whisper (cloud STT), edge-tts (cloud TTS via Microsoft), and OpenClaw chatCompletions API for conversation. Your agent sees your camera, hears your voice, and responds with its own personality and memory. Requires: GROQ_API_KEY for speech recognition. Reads ~/.openclaw/openclaw.json for gateway port and auth token. Data flows: audio → Groq cloud (STT), TTS text → Microsoft cloud (edge-tts), camera frames (base64) + text → OpenClaw gateway → your configured LLM provider (may be cloud — frames leave the machine if using a cloud LLM). Installs a persistent launchd service (optional). Trigger phrases: "video chat", "voice call", "call me", "视频一下", "语音", "打电话给我", "我要和你视频", "videochat-withme". It is an AI Agent Skill for Claude Code / OpenClaw, with 1191 downloads so far.

How do I install Video Chat With Me?

Run "/install videochat-withme" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Video Chat With Me free?

Yes, Video Chat With Me is completely free (open-source). You can download, install and use it at no cost.

Which platforms does Video Chat With Me support?

Video Chat With Me is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Video Chat With Me?

It is built and maintained by sxu75374 (@sxu75374); the current version is v1.1.0.

💬 Comments