4.2 KiB
X Post Research: Hermes Memory Systems (Kevin Simback)
Source: https://x.com/ksimback/status/2058262328496554021
Author: Kevin Simback (@KSimback)
Date saved: 2026-05-24
Context: Saved from Telegram discussion for future reference alongside the Memory-optimization-plan repo.
Core Thesis
Memory is the most important part of turning a stateless chatbot into a compounding agent. Without it, every session starts from zero. Hermes treats memory as first-class pluggable infrastructure with a clean 3-layer architecture.
The 3-Layer Memory Stack
Layer 1: Native (ships with every install, always on)
- Two small markdown files in
~/.hermes/:- MEMORY.md (~2,200 char cap): Project context, technical decisions, key facts.
- USER.md (~1,375 char cap): Your preferences, working style, role.
- Both files are pasted in full at the start of every session.
- Managed via the
memorytool (add/replace/remove). - hermes_state.db (SQLite + FTS5): Full archive of every session, tool call, reasoning step, and cost. Not injected into prompts — only searched when needed.
- Mental model: Markdown files = always-visible sticky notes. Database = deep archive you query.
Layer 2: Official MemoryProvider plugins (only one active at a time)
Run hermes memory setup and pick one. They replace/augment the native layer.
- Honcho (Plastic Labs): Models how you think and your reasoning patterns. Builds evolving user profiles. Popular in the Hermes Discord.
- Mem0: Easiest setup (API key). Automatic fact extraction + semantic search. Strong ecosystem (powers AWS Strands).
- Hindsight (Vectorize.io): Benchmark leader (91.4+ on LongMemEval). Uses four memory networks (world facts, events, opinions, observations) + multi-strategy retrieval + a "reflect" tool.
- Holographic: Fully local, zero dependencies, air-gapped. Uses Holographic Reduced Representations in SQLite. Extremely fast.
- OpenViking: Memory as editable files in a directory tree with tiered loading (summary → paragraph → full content). Good for cost-sensitive or human-readable memory.
- RetainDB: Cheapest convenient cloud option. "Memory Router" mode for transparent operation.
- ByteRover: Memory as a git repo (branch, merge, rollback). Very low token use. Strong at preventing memory loss on restarts.
- Supermemory: Built for scale and low latency (sub-300ms at massive volume). Uses custom vector graph engine + "context fencing".
Layer 3: Community extensions
These are additive or alternative.
- GBrain (by garrytan) — Simback's personal choice. Not a traditional MemoryProvider. It's an 8-layer knowledge engine that builds a self-wiring knowledge graph from markdown vaults (Obsidian/Logseq/Notion compatible). Features epistemology tracking, dream cycles (autonomous overnight synthesis), hybrid search, and versioned chunking. Excellent for world facts and "company of agents" setups.
- Mnemosyne: Strong fully-local MemoryProvider alternative. Tiered cognitive architecture (working/episodic/scratchpad memory) with time-aware recall.
- Others: Ladybug (importance scoring), yantrikdb (explainable retrieval), hermes-agentmemory (true deletion/privacy), PLUR (cross-agent memory sharing), FlowState-QMD (predictive warming).
How to Choose & Warning Signs
Simback recommends starting simple (often just Layer 1 is enough) and only adding complexity when you have a clear need.
Watch for overkill:
- Noticeable latency
- Rising costs without increased usage
- Agent contradicting itself
- Context overflow errors
- No measurable improvement in output quality after weeks
Relevance to This Repo
This X post research directly informs the Three-Tier Memory System documented in this repo's readme.md:
- Hot Memory ≈ Layer 1 native + per-session injection
- Vault Living Files ≈ Layer 3 vault-style systems (GBrain/Obsidian-style)
- Daily Notes ≈ Searchable timeline + hermes_state.db style archiving
The implementation using SilverBullet + Hermes native memory + cron pipelines is a practical, self-hosted realization of these concepts.
Research captured via x_search tool on 2026-05-24. Saved to Gitea for persistent reference.