← Back to Skills Marketplace
alimostafaradwan

Gemini Assistant

by Ali Mostafa Radwan · GitHub ↗ · v1.0.0
cross-platform ⚠ suspicious
589
Downloads
0
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install gemini-assistant
Description
General-purpose AI assistant using Gemini API with voice and text support. Use when you need a smart AI assistant that can answer questions, have conversatio...
README (SKILL.md)

Gemini Assistant

A general-purpose AI assistant powered by Google's Gemini API. Supports both text and voice interactions.

Usage

Text Mode

cd ~/.openclaw/agents/kashif/skills/gemini-assistant && python3 handler.py "Your question or message"

Voice Mode

cd ~/.openclaw/agents/kashif/skills/gemini-assistant && python3 handler.py --audio /path/to/audio.ogg "optional context"

Response Format

The handler returns a JSON response:

{
  "message": "[[audio_as_voice]]\
MEDIA:/tmp/gemini_voice_xxx.ogg",
  "text": "Text response from Gemini"
}

Configuration

Set your Gemini API key:

export GEMINI_API_KEY="your-api-key-here"

Or create a .env file in the skill directory:

GEMINI_API_KEY=your-api-key-here

Model Options

The default model is gemini-2.5-flash-native-audio-preview-12-2025 for audio support.

To use a different model, edit handler.py:

MODEL = "gemini-2.0-flash-exp"  # For text-only

Requirements

  • google-genai>=1.0.0
  • numpy>=1.24.0
  • soundfile>=0.12.0
  • librosa>=0.10.0 (for audio input)
  • FFmpeg (for audio conversion)

Features

  • 🎙️ Voice input/output support
  • 💬 Text conversations
  • 🔧 Configurable system instructions
  • ⚡ Fast responses with Gemini Flash
Usage Guidance
This skill appears to be an ordinary Gemini voice/text assistant, but there are multiple packaging/documentation mismatches you should clear up before use: 1) Confirm that GEMINI_API_KEY is required (skill.json and SKILL.md require it; the registry summary omitted it). 2) Be aware the handler auto-loads a .env file in the skill folder — don't store other secrets there. 3) The documented response format (including a 'text' field) doesn't match handler.py, which only returns audio MEDIA paths; callers expecting text will fail. 4) The skill requires ffmpeg on the host and will write OGG files to /tmp. 5) If you rely on the skill for both audio and text, either test it or modify handler.py to include text outputs (or update model/config to request TEXT modality). If you don't trust the source, inspect or run the code in a sandboxed environment and provide only a dedicated Gemini API key with limited scope rather than reuse broader credentials.
Capability Analysis
Type: OpenClaw Skill Name: gemini-assistant Version: 1.0.0 The skill is classified as suspicious due to the user's ability to provide arbitrary `system_instruction` to the Gemini model via `handler.py`, which allows for prompt injection against the external AI service. While not directly compromising the host system or OpenClaw agent, this capability allows a user to manipulate the AI's behavior beyond its intended persona. Additionally, `handler.py` modifies the `LD_LIBRARY_PATH` environment variable for the `ffmpeg` subprocess, which, while likely a benign workaround, is a risky practice that could be exploited in a compromised environment.
Capability Assessment
Purpose & Capability
The name/description (Gemini-based voice+text assistant) aligns with the code, which calls google.genai and handles audio/text. However metadata inconsistencies exist: the registry summary claims no required env vars while skill.json and SKILL.md require GEMINI_API_KEY. skill.json also advertises both AUDIO and TEXT modalities but the handler's runtime configuration requests only AUDIO. These mismatches reduce confidence in CI/packaging quality.
Instruction Scope
SKILL.md instructs setting GEMINI_API_KEY (and mentions a .env file) and running handler.py; the code does auto-load a .env file in the skill directory. SKILL.md shows an expected JSON response including a 'text' field, but handler.py does not populate a 'text' key — it only produces an audio MEDIA path or an error message. The discrepancy means the documented response format is inaccurate and callers expecting a 'text' field may break.
Install Mechanism
There is no install spec (instruction-only install), and dependencies are standard Python/audio libraries (google-genai, numpy, librosa, soundfile). No remote downloads or obscure installers are used. The skill does call the system ffmpeg binary (/usr/bin/ffmpeg), which must exist on the host.
Credentials
The skill uses a single service credential (GEMINI_API_KEY), which is proportionate for a Gemini client. However the registry metadata asserted 'required env vars: none' while both SKILL.md and skill.json list GEMINI_API_KEY — this inconsistency is suspicious and should be clarified. The handler also auto-loads any .env file found in the skill directory, which may cause it to pick up secrets placed there.
Persistence & Privilege
The skill does not request always:true, does not modify other skills or global configs, and only writes temporary files to /tmp and the skill directory (.env read). It sets an LD_LIBRARY_PATH for ffmpeg subprocess but does not persist system-wide changes.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install gemini-assistant
  3. After installation, invoke the skill by name or use /gemini-assistant
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
Initial release - General AI assistant using Gemini API with voice and text support
Metadata
Slug gemini-assistant
Version 1.0.0
License
All-time Installs 0
Active Installs 0
Total Versions 1
Frequently Asked Questions

What is Gemini Assistant?

General-purpose AI assistant using Gemini API with voice and text support. Use when you need a smart AI assistant that can answer questions, have conversatio... It is an AI Agent Skill for Claude Code / OpenClaw, with 589 downloads so far.

How do I install Gemini Assistant?

Run "/install gemini-assistant" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Gemini Assistant free?

Yes, Gemini Assistant is completely free (open-source). You can download, install and use it at no cost.

Which platforms does Gemini Assistant support?

Gemini Assistant is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Gemini Assistant?

It is built and maintained by Ali Mostafa Radwan (@alimostafaradwan); the current version is v1.0.0.

💬 Comments