Smart Memory

Name: Smart Memory
Author: bluepointdigital

Description

Persistent local transcript-first memory for OpenClaw via a Node adapter and FastAPI engine.

Usage Guidance

Install only if you want a local service that can retain and later surface conversation history. Review where the SQLite database and hot-memory files are stored, avoid putting secrets in persisted chats, keep the server bound to localhost, and be aware that setup downloads packages and the default embedding model enables trusted remote code execution unless changed.

Capability Analysis

Type: OpenClaw Skill Name: smart-memory Version: 3.1.0 The skill bundle implements a sophisticated local memory architecture (Smart Memory v3.1) designed to provide AI agents with persistent, transcript-backed context. The system uses a FastAPI backend (server.py) and a Node.js adapter (smart-memory/index.js) to manage memory ingestion, retrieval, and background 'cognition' tasks like memory decay and conflict resolution. While the bundle includes scripts that perform system-level operations—such as install.sh (cloning a repository), postinstall.js (setting up a Python virtual environment), and openclaw-prime.sh (spawning a background server)—these actions are transparently documented and necessary for the stated purpose of a local cognitive engine. No evidence of data exfiltration, unauthorized remote control, or malicious prompt injection was found; the instructions provided to the agent are functional guidelines for surfacing relevant memories.

Capability Assessment

⚠ Purpose & Capability

The memory, transcript, retrieval, inspection, and background cognition features fit the stated purpose, but the skill handles durable conversation history and exposes broad local inspection/rebuild APIs that can reveal or rewrite derived memory state.

ℹ Instruction Scope

Runtime instructions mostly match the memory purpose and use localhost APIs, but they encourage routine retrieval and prompt-context injection without strong consent, minimization, or sensitive-data handling guidance.

⚠ Install Mechanism

Installation creates a Python virtualenv and downloads Python packages automatically; install.sh also replaces the target smart-memory directory with rm -rf before reinstalling, which is bounded but destructive.

⚠ Credentials

A local FastAPI server, background tasks, transcript database, retry queue, and HTTP APIs are proportionate to a memory backend, but trust_remote_code=True for the default embedding model materially expands execution risk beyond ordinary local storage.

⚠ Persistence & Privilege

The skill intentionally persists transcripts, derived memories, hot-memory metadata, queued commits, and audit state on disk, and several helpers can auto-start a long-running localhost server; this is disclosed in parts but still high-impact for privacy and lifecycle control.

Version History

v3.1.0

Smart Memory v3.1 introduces transcript-first architecture, revision-aware memory, and expanded inspection. - Replaces legacy vector-memory structure with transcript-first ingest and deterministic rebuild - Adds evidence-backed revision lifecycle, memory supersession, and core/working memory lanes - Expands entity/relationship indexing and inspection endpoints for transcripts, evidence, and evaluation - Updates OpenClaw wrapper to support new backend transparently; API surface remains stable - Removes old `smart-memory-v25` implementation and supporting files

v2.5.0

# Changelog ## [2.5.0] - 2026-03-05 ### Added - New native OpenClaw skill package at `skills/smart-memory-v25/` for the local FastAPI cognitive engine. - Three active memory tools: - `memory_search` - `memory_commit` - `memory_insights` - Tool-level mandatory health gate (`GET /health`) before execution. - Persistent retry queue (`.memory_retry_queue.json`) for failed memory commits when server/embedder is unavailable. - Automatic retry queue flush on healthy tool calls and heartbeat. - Session arc lifecycle capture: - mid-session checkpoint every 20 turns - session-end episodic capture hook - Passive prompt injection middleware for `[ACTIVE CONTEXT]` formatting and pending insight guidance. - OpenClaw hook helper (`openclaw-hooks.js`) for turn and teardown integration. ### Changed - Memory commit flow now serializes commits to protect local CPU embedding throughput under bursty commit calls. - Retrieval wrapper now uses compatibility fallback for `/retrieve` payload filters, then applies type/relevance/limit filtering safely client-side. - Auto-tag fallback is now extensible via rule definitions (`tagging.js`) and includes default `working_question` + `decision` heuristics. - Documentation sweep completed: README now includes `skills/smart-memory-v25` architecture details, and skill docs now enforce CPU-only PyTorch policy with no GPU fallback guidance. - README architecture content was consolidated into a whole-system overview, Mermaid flowcharts were simplified for consistent rendering, and obsolete `ARCHITECTURE.md` was removed. ### Fixed - Commit failure behavior now returns explicit operational state feedback to the agent: - `Memory commit failed - server unreachable. Queued for retry.` --- ## [2.3.0] - 2026-03-05 ### Added (Hot Memory Extension) - **Persistent Working Context**: New optional extension for maintaining active projects, working questions, and top-of-mind items across sessions - `hot_memory_manager.py`: Core persistence with JSON storage - `memory_adapter.py`: API wrapper for seamless `/compose` integration - `smem-hook.sh`: Shell hook for post-conversation updates - **Auto-Detection**: Automatically identifies project mentions and questions from conversation content - Keyword-based project detection (Tappy.Menu, Content Foundry, etc.) - Question extraction (any message containing `?`) - Working context updates on every interaction - **Intelligent Duplicate Prevention**: Prevents duplicate project entries by matching on project keys rather than full descriptions - Extracts keys from "Project Name - Description" format - Normalized comparison prevents false duplicates - **Live Insight Integration**: Fetches pending insights from `/insights/pending` and includes them in composed prompts - **Full Documentation**: `HOT_MEMORY_EXTENSION.md` with usage guide and API reference ### Changed - Extended memory architecture to support session-surviving working context - Hot memory appears in `[WORKING CONTEXT]` section of composed prompts - Token budget allocates ~400 tokens for working memory by default --- ## [2.2.0] - 2026-03-05 ### Added - FastAPI observability endpoints for runtime inspection: - `GET /health` (embedder-loaded status and backend metadata) - `GET /memories` (with optional `?type=` filter) - `GET /memory/{memory_id}` - `GET /insights/pending` - Regression coverage for strict token budgeting and observability behavior. ### Changed - Standardized cognitive runtime installs to CPU-only PyTorch wheels in `postinstall.js`: - `pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cpu` - Added `einops>=0.8.0` to cognitive requirements for Nomic embedding compatibility. - `PromptComposerRequest.hot_memory` is now optional with a safe default payload. ### Fixed - Enforced strict `max_prompt_tokens` handling in prompt rendering with deterministic eviction order: 1. Oldest conversation history 2. Lower-ranked retrieved memories 3. Insight queue items 4. Working memory 5. Temporal state 6. Agent identity (preserved) - Retrieval access tracking now persists correctly: - increments `access_count` - updates `last_accessed` - Ingestion now performs semantic deduplication before writing new long-term memory: - top-1 similarity check (`> 0.85`) - reinforces existing memory instead of duplicating - increments belief `reinforced_count` where applicable - Belief conflict resolution thresholds were relaxed to detect shared-entity conflicts with opposing stance/sentiment. ## [2.1.2] - 2026-02-06 ### Security - **CRITICAL**: Fixed path traversal vulnerabilities in multiple files: - `memory.js`: `memoryGet()` function - `vector_memory_local.js`: `getFullContent()` function - Added path resolution validation to ensure all file access stays within workspace - Added allowlist check to restrict access to `MEMORY.md`, `memory/*.md`, and `.hot_memory.md` only - Blocks attempts like `../../../etc/passwd` or nested traversal patterns ## [2.1.1] - 2026-02-05 ### Added - AGENTS.md template for memory recall instructions - MEMORY_STRUCTURE.md with directory organization guide - Test script (`--test` command) for verification - Troubleshooting table in README - Better onboarding documentation ## [2.1.0] - 2026-02-04 ### Added - Smart wrapper with automatic fallback (vector -> built-in) - Zero-configuration philosophy - Graceful degradation when vector not ready ## [2.0.0] - 2026-02-04 ### Added - 100% local embeddings using `all-MiniLM-L6-v2` via Transformers.js - No API calls required - Semantic chunking (by headers, not just lines) - Cosine similarity scoring - JSON storage for personal-scale use - OpenClaw skill manifest - Programmatic API wrapper (`memory.js`) ### Changed - Replaced word-frequency embeddings with neural embeddings - Improved retrieval quality significantly - Better chunking strategy (semantic boundaries) ## [1.0.0] - 2026-02-04 ### Added - Initial version with word-frequency embeddings - Simple JSON storage - Basic CLI interface - pgvector support (Docker-based) ### Notes - Word-frequency method works but has limited semantic understanding - Neural embeddings (v2) recommended for production use

v2.2.0

- Major refactor: Migrated codebase from JavaScript to Python, replacing 12 JS files with 61 new Python files. - Introduced new modular architecture with dedicated directories for cognition, embeddings, entities, hot_memory, and ingestion. - Enhanced memory processing by adding agents for associative insight, belief conflict resolution, memory consolidation, heuristic filtering, and more. - Updated and expanded documentation, including new files like ARCHITECTURE.md and improved descriptions in SKILL.md. - Removed all legacy JavaScript logic, replacing it with the new Python-based system structure.

v2.1.2

- Added six new JavaScript module files: chunker.js, embed.js, hot_memory.js, memory_bridge.js, search.js, and session_memory.js. - Enhanced project codebase by separating functionality into dedicated modules for chunking, embedding, searching, and memory management. - No user-facing changes documented; update increases maintainability and modularity.

v1.0.1

- Added curated retrieval via Focus Agent, enabling multi-pass context synthesis for complex queries - Introduced dual search modes: fast (direct) and focus (curated), with live toggling support - Improved hybrid search combining FTS5 keyword (BM25) and semantic vector similarity - Provided CLI and JS API for mode switching and retrieval with detailed examples - Enhanced performance and scalability with optional sqlite-vec native vector search - Expanded documentation with installation, usage, environment variables, and feature comparison

Metadata

Slug smart-memory

Version 3.1.0

License MIT-0

All-time Installs 289

Active Installs 66

Total Versions 5

Frequently Asked Questions

What is Smart Memory?

Persistent local transcript-first memory for OpenClaw via a Node adapter and FastAPI engine. It is an AI Agent Skill for Claude Code / OpenClaw, with 7676 downloads so far.

How do I install Smart Memory?

Run "/install smart-memory" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Smart Memory free?

Yes, Smart Memory is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Smart Memory support?

Smart Memory is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Smart Memory?

It is built and maintained by BluePointDigital (@bluepointdigital); the current version is v3.1.0.

More Skills

What is Smart Memory?

How do I install Smart Memory?

Is Smart Memory free?

Which platforms does Smart Memory support?

Who created Smart Memory?

💬 Comments