--- title: "Deploy Hindsight Agent Memory on Docker: Complete Setup Guide" description: "Step-by-step guide to deploying Hindsight, an open-source agent memory system, using Docker Compose with PostgreSQL and pgvector for production use." date: 2026-07-03 categories: ["AI"] tags: ["ai-agents","docker","self-hosted"] --- import Button from "../../components/widgets/Button.astro"; import YouTubeEmbed from "../../components/widgets/YouTubeEmbed.astro"; import Tabs from "../../components/widgets/Tabs.astro"; import Tab from "../../components/widgets/Tab.astro"; import Notice from "../../components/widgets/Notice.astro"; import Accordion from "../../components/widgets/Accordion.astro"; import ListCheck from "../../components/widgets/ListCheck.astro"; import { Picture } from "astro:assets"; import hindsightUi from "../../assets/images/26/07/hindsight-ui.webp"; Most AI agents forget everything the moment a conversation ends. You tell them your preferences, correct their mistakes, feed them context, and the next session starts from scratch. Hindsight fixes that. [Hindsight](https://github.com/vectorize-io/hindsight) is an open-source agent memory system built by Vectorize.io. It doesn't just store conversation history like a glorified chat log. Instead, it extracts facts, builds mental models, and learns from interactions over time. On the LongMemEval benchmark (the standard test for agent memory), it outperforms every other solution currently available. The core idea: agents should get better the more you use them, the same way a human assistant learns your preferences over weeks and months. This guide walks through deploying Hindsight on Docker with a proper PostgreSQL backend, configuring it for production use, and interacting with it through the API and client libraries. ## What Hindsight actually does Hindsight organizes memory into three categories: - **World facts** - Things that are true ("The project uses PostgreSQL 17") - **Experiences** - Things that happened ("Last deployment broke because of a migration issue") - **Mental models** - Patterns formed by reflecting on facts and experiences ("This user prefers detailed error messages over brief summaries") When you add new information through the `retain` operation, Hindsight runs it through an LLM to extract entities, relationships, and temporal data. It stores these as a combination of vector embeddings, keyword indexes, and graph structures. When you search with `recall`, it runs four retrieval strategies in parallel: 1. Semantic search (vector similarity) 2. Keyword matching (BM25) 3. Graph traversal (entity and relationship links) 4. Temporal filtering (time ranges) Results get merged with reciprocal rank fusion and reranked for relevance. The third operation, `reflect`, goes deeper. It pulls together related memories and generates new observations. Think of it as the agent thinking about what it knows, rather than just retrieving it. ## Prerequisites You'll need: - A VPS or home server running Linux. I recommend [Hetzner](https://go.bitdoze.com/hetzner) or [Hostinger](https://go.bitdoze.com/hostinger-vps) for VPS hosting - Docker and Docker Compose installed - An OpenAI API key (or another supported LLM provider) - At least 2 GB of RAM available (the slim image uses less, the full image needs more)