Memory-optimization-plan/x-post-research-ksimback-memory-guide.md

# X Post Research: Hermes Memory Systems (Kevin Simback)

**Source:** https://x.com/ksimback/status/2058262328496554021
**Author:** Kevin Simback (@KSimback)
**Date saved:** 2026-05-24
**Context:** Saved from Telegram discussion for future reference alongside the Memory-optimization-plan repo.

---

## Core Thesis

Memory is the most important part of turning a stateless chatbot into a compounding agent. Without it, every session starts from zero. Hermes treats memory as **first-class pluggable infrastructure** with a clean 3-layer architecture.

---

## The 3-Layer Memory Stack

### Layer 1: Native (ships with every install, always on)
- Two small markdown files in `~/.hermes/`:
  - **MEMORY.md** (~2,200 char cap): Project context, technical decisions, key facts.
  - **USER.md** (~1,375 char cap): Your preferences, working style, role.
- Both files are **pasted in full** at the start of every session.
- Managed via the `memory` tool (`add`/`replace`/`remove`).
- **hermes_state.db** (SQLite + FTS5): Full archive of every session, tool call, reasoning step, and cost. Not injected into prompts — only searched when needed.
- Mental model: Markdown files = always-visible sticky notes. Database = deep archive you query.

### Layer 2: Official MemoryProvider plugins (only one active at a time)
Run `hermes memory setup` and pick one. They replace/augment the native layer.

- **Honcho (Plastic Labs)**: Models *how you think* and your reasoning patterns. Builds evolving user profiles. Popular in the Hermes Discord.
- **Mem0**: Easiest setup (API key). Automatic fact extraction + semantic search. Strong ecosystem (powers AWS Strands).
- **Hindsight (Vectorize.io)**: Benchmark leader (91.4+ on LongMemEval). Uses four memory networks (world facts, events, opinions, observations) + multi-strategy retrieval + a "reflect" tool.
- **Holographic**: Fully local, zero dependencies, air-gapped. Uses Holographic Reduced Representations in SQLite. Extremely fast.
- **OpenViking**: Memory as editable files in a directory tree with tiered loading (summary → paragraph → full content). Good for cost-sensitive or human-readable memory.
- **RetainDB**: Cheapest convenient cloud option. "Memory Router" mode for transparent operation.
- **ByteRover**: Memory as a git repo (branch, merge, rollback). Very low token use. Strong at preventing memory loss on restarts.
- **Supermemory**: Built for scale and low latency (sub-300ms at massive volume). Uses custom vector graph engine + "context fencing".

### Layer 3: Community extensions
These are additive or alternative.

- **GBrain (by garrytan)** — Simback's personal choice. Not a traditional MemoryProvider. It's an 8-layer knowledge engine that builds a self-wiring knowledge graph from markdown vaults (Obsidian/Logseq/Notion compatible). Features epistemology tracking, dream cycles (autonomous overnight synthesis), hybrid search, and versioned chunking. Excellent for world facts and "company of agents" setups.
- **Mnemosyne**: Strong fully-local MemoryProvider alternative. Tiered cognitive architecture (working/episodic/scratchpad memory) with time-aware recall.
- Others: Ladybug (importance scoring), yantrikdb (explainable retrieval), hermes-agentmemory (true deletion/privacy), PLUR (cross-agent memory sharing), FlowState-QMD (predictive warming).

---

## How to Choose & Warning Signs

Simback recommends starting simple (often just Layer 1 is enough) and only adding complexity when you have a clear need.

**Watch for overkill:**
- Noticeable latency
- Rising costs without increased usage
- Agent contradicting itself
- Context overflow errors
- No measurable improvement in output quality after weeks

---

## Relevance to This Repo

This X post research directly informs the **Three-Tier Memory System** documented in this repo's `readme.md`:
- Hot Memory ≈ Layer 1 native + per-session injection
- Vault Living Files ≈ Layer 3 vault-style systems (GBrain/Obsidian-style)
- Daily Notes ≈ Searchable timeline + hermes_state.db style archiving

The implementation using SilverBullet + Hermes native memory + cron pipelines is a practical, self-hosted realization of these concepts.

---

*Research captured via x_search tool on 2026-05-24. Saved to Gitea for persistent reference.*