What Honcho provides
Honcho acts as a long-term memory and user-model layer alongside Hermes’ built-in memory files (MEMORY.md and USER.md).
It gives Hermes three capabilities:
- Prompt-time context injection — durable context about a user loaded into the prompt before generating a response.
- Cross-session continuity — recall of stable preferences, project history, and working context across conversations.
- Durable writeback — stable facts learned during a conversation stored back for future turns.
Dual-peer architecture
Both the user and the AI agent have peer representations in Honcho:- User peer: observed from user messages. Learns preferences, goals, communication style.
- AI peer: observed from assistant messages. Builds the agent’s knowledge representation.
Available tools
Hermes exposes four Honcho tools to the agent:| Tool | What it does |
|---|---|
honcho_profile | Fast peer card retrieval (no LLM). Returns curated key facts about the user. |
honcho_search | Semantic search over memory. Returns raw excerpts ranked by relevance. |
honcho_context | Dialectic Q&A powered by Honcho’s LLM. Synthesizes answers from conversation history. |
honcho_conclude | Writes durable facts to Honcho when the user states preferences, corrections, or important context. |
Running Honcho locally with Hermes
Follow the Self-Hosting Guide to get Honcho running locally. Once it’s up, point Hermes at your instance:$HERMES_HOME/honcho.json > ~/.hermes/honcho.json > ~/.honcho/config.json):
recallMode, writeFrequency, sessionStrategy, dialecticReasoningLevel, etc.), see the Hermes memory provider docs.
Community quick-start: elkimek/honcho-self-hosted provides a one-command installer with pre-configured model tiers and Hermes Agent integration.
Verifying the integration
1. Check status
2. Store a fact and recall it across sessions
In one conversation, tell Hermes something specific:3. Test tool calling directly
Ask Hermes to use a specific Honcho tool:Configuration options
| Field | Default | Description |
|---|---|---|
recallMode | hybrid | hybrid (auto-inject + tools), context (inject only), tools (tools only) |
writeFrequency | async | async, turn, session, or integer N |
sessionStrategy | per-directory | per-directory, per-repo, per-session, global |
dialecticReasoningLevel | low | minimal, low, medium, high, max |
dialecticDynamic | true | Auto-bump reasoning level by query complexity |
messageMaxChars | 25000 | Max chars per message (chunked if exceeded) |
Next steps
Hermes Agent Honcho Docs
Setup, configuration, CLI commands, and all config options.
Hermes Agent Source
Source code, installation, and full documentation.
Honcho Architecture
Peers, sessions, and how reasoning works.
Self-Hosting Guide
Full local environment setup, provider configuration, and troubleshooting.