Configuration Reference

This page provides a complete reference for all environment variables and configuration options used in AutoMem. Configuration controls service connectivity, authentication, embedding generation, background processing, and search behavior.

For a quick-reference table grouped by category, see Environment Variables. For deployment-specific setup, see Docker & Local Dev or Railway Deployment.

Configuration Loading

AutoMem loads environment variables from multiple sources in a specific precedence order, allowing flexible configuration across different deployment scenarios.

Load order (highest to lowest priority):

Process environment — Variables set via export or passed directly to the process
Project .env — File in repository root
User config — ~/.config/automem/.env (user-specific, never committed)

This hierarchy allows local overrides without modifying shared configuration files. For example, a developer can set FALKORDB_HOST=localhost in their user config while the project .env contains shared development defaults.

graph TB
    subgraph "Configuration Hierarchy"
        EnvFile[".env<br/>(Repository Root)"]
        HomeConfig["~/.config/automem/.env<br/>(User-specific)"]
        RailwaySecrets["Railway Secrets<br/>(Production)"]
    end

    AppPy["app.py"]
    ConsPy["consolidation.py"]
    Scripts["scripts/*.py"]

    EnvFile -->|"dotenv.load_dotenv()"| AppPy
    HomeConfig -->|"dotenv.load_dotenv()"| AppPy
    RailwaySecrets -->|"os.getenv()"| AppPy

    AppPy -->|"passes config"| ConsPy
    EnvFile -->|"directly read"| Scripts

Server Configuration

Core Service Connection

Required configuration for connecting to data stores:

Variable	Type	Required	Default	Description
`FALKORDB_HOST`	string	Yes	`localhost`	Graph database hostname
`FALKORDB_PORT`	int	Yes	`6379`	Graph database port
`FALKORDB_PASSWORD`	string	No	unset	Redis/FalkorDB password if auth enabled
`FALKORDB_GRAPH`	string	No	`memories`	Graph database name (Cypher `GRAPH.QUERY` target)
`GRAPH_NAME`	string	No	`memories`	Alias for `FALKORDB_GRAPH`
`PORT`	int	No	`8001`	Flask API server port

graph TB
    subgraph external["External Clients"]
        Client["HTTP Client"]
    end

    subgraph railway["Railway Internal Network (IPv6)"]
        subgraph mcp["mcp-sse-server"]
            MCPServer["Node.js Server<br/>Binds to ::<br/>PORT from env"]
        end

        subgraph api["memory-service"]
            FlaskApp["Flask App<br/>app.py<br/>Binds to ::<br/>PORT from env"]
        end

        subgraph db["falkordb"]
            FalkorDB["FalkorDB<br/>Port 6379<br/>FALKOR_PASSWORD"]
        end
    end

    Client -->|"HTTPS<br/>Public Domain"| MCPServer
    Client -->|"HTTPS<br/>Public Domain"| FlaskApp

    MCPServer -->|"HTTP<br/>memory-service.railway.internal:8001"| FlaskApp
    FlaskApp -->|"Redis Protocol<br/>falkordb.railway.internal:6379"| FalkorDB

Vector Search (Optional)

Qdrant configuration enables semantic search but is not required. AutoMem operates in graph-only mode if these are unset:

Variable	Type	Required	Default	Description
`QDRANT_URL`	string	No	unset	Qdrant API endpoint (HTTP/HTTPS)
`QDRANT_API_KEY`	string	No	unset	Qdrant authentication key (required for cloud)
`QDRANT_COLLECTION`	string	No	`memories`	Collection name for memory vectors
`COLLECTION_NAME`	string	No	`memories`	Alias for `QDRANT_COLLECTION`
`VECTOR_SIZE`	int	No	`3072`	Embedding dimension (768/1024/2048/3072)

Authentication

All endpoints except /health require authentication:

Variable	Type	Required	Default	Description
`AUTOMEM_API_TOKEN`	string	Yes	unset	Authentication token for standard operations
`ADMIN_API_TOKEN`	string	Yes	unset	Token for admin endpoints (`/admin/*`, `/enrichment/reprocess`)
`API_TOKEN`	string	No	unset	Fallback alias for `AUTOMEM_API_TOKEN`
`ADMIN_TOKEN`	string	No	unset	Fallback alias for `ADMIN_API_TOKEN`

Embedding Provider Configuration

Controls embedding generation with automatic provider selection:

Variable	Type	Required	Default	Description
`EMBEDDING_PROVIDER`	string	No	`auto`	Provider selection mode (`auto`/`voyage`/`openai`/`local`/`ollama`/`placeholder`)
`EMBEDDING_MODEL`	string	No	`text-embedding-3-large`	OpenAI model name or identifier
`VOYAGE_API_KEY`	string	No	unset	Voyage AI API key
`VOYAGE_MODEL`	string	No	`voyage-4`	Voyage model (`voyage-4`, `voyage-4-large`)
`OPENAI_API_KEY`	string	No	unset	OpenAI or compatible provider API key
`OPENAI_BASE_URL`	string	No	unset	Custom endpoint for OpenAI-compatible APIs (OpenRouter, LiteLLM, vLLM)
`OLLAMA_BASE_URL`	string	No	`http://localhost:11434`	Ollama server endpoint
`OLLAMA_MODEL`	string	No	`nomic-embed-text`	Ollama embedding model name

Provider characteristics:

Provider	Quality	Cost	Offline	Dimensions	API Key
Voyage	Excellent	$0.00012/1K tokens	No	256/512/1024/2048	Required
OpenAI	Excellent	$0.00002–$0.00013/1K tokens	No	768/3072	Required
Ollama	Good	Free	Yes	Model-dependent	Not required
FastEmbed	Good	Free	Yes (after download)	384/768/1024	Not required
Placeholder	None	Free	Yes	Configurable	Not required

When EMBEDDING_PROVIDER=auto, the provider is selected by checking API key availability in order: Voyage, then OpenAI, then local/Ollama, then placeholder.

Embedding Batching

Controls batch processing to reduce API costs:

Variable	Type	Required	Default	Description
`EMBEDDING_BATCH_SIZE`	int	No	`20`	Items per batch API call
`EMBEDDING_BATCH_TIMEOUT_SECONDS`	float	No	`2.0`	Max wait time before flushing partial batch

Batching reduces API costs by 40–50% by combining multiple embedding requests into single API calls. The timeout ensures reasonable latency even when traffic is low.

Enrichment Pipeline

Controls automatic background enrichment after memory storage:

Variable	Type	Required	Default	Description
`ENRICHMENT_MAX_ATTEMPTS`	int	No	`3`	Retry limit before marking failed
`ENRICHMENT_SIMILARITY_LIMIT`	int	No	`5`	Number of semantic neighbors to link
`ENRICHMENT_SIMILARITY_THRESHOLD`	float	No	`0.8`	Min cosine similarity for `SIMILAR_TO` edge
`ENRICHMENT_IDLE_SLEEP_SECONDS`	int	No	`2`	Worker sleep duration when queue empty
`ENRICHMENT_FAILURE_BACKOFF_SECONDS`	int	No	`5`	Delay between retry attempts
`ENRICHMENT_ENABLE_SUMMARIES`	bool	No	`true`	Auto-generate memory summaries
`ENRICHMENT_SPACY_MODEL`	string	No	`en_core_web_sm`	spaCy model for NER (if installed)

Entity tag generation:

Extracted entities become structured tags with the pattern entity:<type>:<slug>:

Entity Type	Example Input	Generated Tag
Tool	`PostgreSQL`	`entity:tool:postgresql`
Project	`automem`	`entity:project:automem`
Person	`Sarah`	`entity:person:sarah`
Organization	`OpenAI`	`entity:organization:openai`
Concept	`ACID compliance`	`entity:concept:acid-compliance`

Consolidation Engine

Controls background memory maintenance cycles:

Variable	Type	Required	Default	Description
`CONSOLIDATION_TICK_SECONDS`	int	No	`60`	Scheduler check interval
`CONSOLIDATION_DECAY_INTERVAL_SECONDS`	int	No	`86400`	Decay cycle frequency (1 day)
`CONSOLIDATION_DECAY_IMPORTANCE_THRESHOLD`	float	No	`0.3`	Min importance to process in decay
`CONSOLIDATION_CREATIVE_INTERVAL_SECONDS`	int	No	`604800`	Creative cycle frequency (1 week)
`CONSOLIDATION_CLUSTER_INTERVAL_SECONDS`	int	No	`2592000`	Cluster cycle frequency (1 month)
`CONSOLIDATION_FORGET_INTERVAL_SECONDS`	int	No	`0`	Forget cycle frequency (disabled by default)
`CONSOLIDATION_ARCHIVE_THRESHOLD`	float	No	`0.2`	Relevance threshold for archiving
`CONSOLIDATION_DELETE_THRESHOLD`	float	No	`0.05`	Relevance threshold for deletion
`CONSOLIDATION_GRACE_PERIOD_DAYS`	int	No	`30`	Min age before memory can be forgotten
`CONSOLIDATION_IMPORTANCE_PROTECTION_THRESHOLD`	float	No	`0.7`	Memories above this importance are protected
`CONSOLIDATION_PROTECTED_TYPES`	string	No	`Decision,Pattern`	Comma-separated types to never forget

Consolidation task details:

Task	Cypher Operation	Purpose	Protected Conditions
Decay	`SET m.importance = m.importance * decay_factor`	Exponential relevance reduction	`type IN PROTECTED_TYPES` OR `importance > IMPORTANCE_PROTECTION_THRESHOLD`
Creative	`MATCH (m1:Memory)-[*..3]-(m2:Memory)`	Multi-hop association discovery	N/A
Cluster	`MATCH (m:Memory) ... CREATE (p:Pattern)`	Pattern node generation	N/A
Forget	`SET m.archived = true` or `DELETE m`	Archive/remove low-value memories	Age < `GRACE_PERIOD_DAYS` OR protected conditions

Search Scoring Weights

Fine-tune hybrid search ranking. Weights are applied to individual signals and summed to produce a final score:

Variable	Type	Required	Default	Description
`SEARCH_WEIGHT_VECTOR`	float	No	`0.25`	Vector similarity component
`SEARCH_WEIGHT_KEYWORD`	float	No	`0.15`	Keyword/TF-IDF matching
`SEARCH_WEIGHT_TAG`	float	No	`0.10`	Tag overlap score
`SEARCH_WEIGHT_IMPORTANCE`	float	No	`0.05`	User-assigned importance
`SEARCH_WEIGHT_RECENCY`	float	No	`0.10`	Freshness boost
`SEARCH_WEIGHT_CONFIDENCE`	float	No	`0.05`	Memory confidence score
`SEARCH_WEIGHT_EXACT`	float	No	`0.25`	Content token overlap

The default weights balance semantic similarity (vector + keyword + exact = 0.65) with metadata signals (tags + importance + recency + confidence = 0.35). Adjust weights to favor specific signals for your use case.

Recall Behavior

Controls query result expansion and limits:

Variable	Type	Required	Default	Description
`RECALL_MAX_LIMIT`	int	No	`100`	Maximum results returned by `/recall`
`RECALL_RELATION_LIMIT`	int	No	`5`	Max related memories per result
`RECALL_EXPANSION_LIMIT`	int	No	`20`	Max memories added via `expand_relations=true`

Sync Worker (Drift Repair)

Controls automatic drift detection between FalkorDB and Qdrant:

Variable	Type	Required	Default	Description
`SYNC_CHECK_INTERVAL_SECONDS`	int	No	`300`	Frequency of drift checks (5 minutes)
`SYNC_AUTO_REPAIR`	bool	No	`true`	Automatically queue missing embeddings

The sync worker counts memories in FalkorDB vs Qdrant and queues repair operations when drift exceeds 5%.

Memory Type Configuration

Controls classification and relationship validation:

Variable	Type	Required	Default	Description
`MEMORY_TYPES`	string	No	See below	Comma-separated valid memory types
`RELATIONSHIP_TYPES`	string	No	See below	Comma-separated valid relationship types
`ALLOWED_RELATIONS`	string	No	Same as `RELATIONSHIP_TYPES`	Alias for backward compatibility

Default memory types:

Decision, Pattern, Preference, Style, Habit, Insight, Context, Memory

Default relationship types:

RELATES_TO, LEADS_TO, OCCURRED_BEFORE, PREFERS_OVER, EXEMPLIFIES,
CONTRADICTS, REINFORCES, INVALIDATED_BY, EVOLVED_INTO, DERIVED_FROM, PART_OF

Type aliases — The TYPE_ALIASES mapping in automem.config normalizes variations:

Input	Normalized To
`decision`, `decisions`	`Decision`
`pattern`, `patterns`	`Pattern`
`preference`, `preferences`	`Preference`
`fact`, `facts`, `knowledge`	`Context`

Classification Model

Controls LLM-based memory classification fallback:

Variable	Type	Required	Default	Description
`CLASSIFICATION_MODEL`	string	No	`gpt-4o-mini`	OpenAI model for content classification

When an explicit type is not provided in the request, or regex patterns fail to match, AutoMem uses the LLM classification model. The system prompt for classification is defined in MemoryClassifier.SYSTEM_PROMPT in app.py.

MCP Client Configuration

The mcp-automem client uses two primary environment variables to locate and authenticate with the AutoMem backend service. These can be set via .env file, shell environment, or platform-specific MCP configuration files.

Variable	Required	Default	Description
`AUTOMEM_ENDPOINT`	Yes	`http://127.0.0.1:8001`	HTTP URL of the AutoMem service
`AUTOMEM_API_KEY`	No	(none)	API key for authenticated instances (preferred name)
`AUTOMEM_API_TOKEN`	No	(none)	Alternative name for the API key
`AUTOMEM_PROCESS_TAG`	No	(none)	Process title tag for safe cleanup in multi-process environments
`MCP_PROCESS_TAG`	No	(none)	Alternative process tag variable
`AUTOMEM_LOG_LEVEL`	No	(none)	Set to `debug` for verbose logging

Common endpoint values:

Local development: http://127.0.0.1:8001 or http://localhost:8001
Railway deployment: https://your-service.railway.app
Custom deployment: Your service’s public or internal URL

API key resolution order — The readAutoMemApiKeyFromEnv() function checks variables in this priority order:

graph TB
    subgraph Env_Resolution["Environment Variable Resolution"]
        DOTENV["dotenv.config()<br/>.env file loading"]

        ENDPOINT_CHECK{"AUTOMEM_ENDPOINT<br/>exists?"}
        ENDPOINT_DEFAULT["Default:<br/>http://127.0.0.1:8001"]
        ENDPOINT_VALUE["Use env value"]

        API_KEY_FUNC["readAutoMemApiKeyFromEnv()"]
        KEY_PRIORITY["Priority:<br/>1. AUTOMEM_API_KEY<br/>2. AUTOMEM_API_TOKEN<br/>3. AUTOMEM_TOKEN<br/>4. API_KEY"]
    end

    subgraph Client_Config["AutoMemClient Config"]
        CONFIG_OBJ["clientConfig: AutoMemConfig<br/>{ endpoint, apiKey }"]
        CLIENT_INSTANCE["new AutoMemClient(config)"]
    end

    DOTENV --> ENDPOINT_CHECK
    ENDPOINT_CHECK -->|"No"| ENDPOINT_DEFAULT
    ENDPOINT_CHECK -->|"Yes"| ENDPOINT_VALUE
    ENDPOINT_DEFAULT --> CONFIG_OBJ
    ENDPOINT_VALUE --> CONFIG_OBJ

    DOTENV --> API_KEY_FUNC
    API_KEY_FUNC --> KEY_PRIORITY
    KEY_PRIORITY --> CONFIG_OBJ

    CONFIG_OBJ --> CLIENT_INSTANCE

Configuration Resolution Priority

The client resolves configuration from multiple sources with a defined priority order:

graph TB
    subgraph "Configuration Resolution Priority"
        direction TB
        CMD["1. Command-line args<br/>--endpoint, --api-key"]
        ENV_FILE["2. .env file<br/>current directory"]
        PROCESS_ENV["3. Process environment<br/>shell exports"]
        DEFAULT["4. Default<br/>http://localhost:8001"]
    end

    AUTO_CLIENT["AutoMemClient"] --> CMD
    CMD -->|Not found| ENV_FILE
    ENV_FILE -->|Not found| PROCESS_ENV
    PROCESS_ENV -->|Not found| DEFAULT

    ENV_FILE --> DOT_ENV["dotenv.config()<br/>loads AUTOMEM_ENDPOINT<br/>loads AUTOMEM_API_KEY"]

Environment variables (highest priority) — Direct shell environment or .env file or platform-specific MCP server env blocks
~/.claude.json configuration — Used by CLI commands when environment is not set; scans all mcpServers entries for AutoMem config
Default values (lowest priority) — endpoint: http://127.0.0.1:8001, apiKey: undefined

Platform-Specific Configuration Files

Each AI platform stores MCP server configuration differently:

Platform	Configuration File	Format
Claude Desktop	`~/Library/Application Support/Claude/claude_desktop_config.json` (macOS)	JSON
Claude Desktop	`%APPDATA%\Claude\claude_desktop_config.json` (Windows)	JSON
Claude Desktop	`~/.config/Claude/claude_desktop_config.json` (Linux)	JSON
Cursor IDE	`~/.cursor/mcp.json`	JSON
Claude Code	`~/.claude.json`	JSON
Codex	`~/.codex/config.toml`	TOML
OpenClaw	`~/.openclaw/openclaw.json`	JSON

JSON configuration example (Claude Desktop, Cursor, Claude Code):

{
  "mcpServers": {
    "automem": {
      "command": "npx",
      "args": ["-y", "@verygoodplugins/mcp-automem"],
      "env": {
        "AUTOMEM_ENDPOINT": "https://your-service.railway.app",
        "AUTOMEM_API_KEY": "your-api-token"
      }
    }
  }
}

The command and args launch the MCP server in stdio mode. The env block passes configuration to the server process.

TOML configuration example (Codex):

[mcp.servers.automem]
command = "npx"
args = ["-y", "@verygoodplugins/mcp-automem"]

[mcp.servers.automem.env]
AUTOMEM_ENDPOINT = "https://your-service.railway.app"
AUTOMEM_API_KEY = "your-api-token"

Content Size Governance

The store_memory tool enforces content size limits to maintain embedding quality:

Limit Type	Threshold	Behavior
Soft limit	500 characters	Warning; backend may auto-summarize
Hard limit	2000 characters	Rejected immediately with error

MCP Client Validation

The setup wizard validates the endpoint before saving configuration:

URL format check — Ensures AUTOMEM_ENDPOINT is a valid HTTP/HTTPS URL
Health endpoint probe — Sends GET /health request with 2-second timeout
Database status check — Verifies FalkorDB and Qdrant connectivity
Configuration write — Saves validated config to .env

At runtime, if the endpoint is unreachable, queue operations are skipped rather than blocking. This prevents queue operations from stalling when the service is temporarily down.

Advanced Configuration

Logging and Debug

Variable	Type	Required	Default	Description
`LOG_LEVEL`	string	No	`INFO`	Python logging level (`DEBUG`, `INFO`, `WARNING`, `ERROR`)
`FLASK_ENV`	string	No	`production`	Flask environment mode

Testing Configuration

Variable	Type	Required	Default	Description
`AUTOMEM_RUN_INTEGRATION_TESTS`	bool	No	`0`	Enable integration test suite
`AUTOMEM_START_DOCKER`	bool	No	`0`	Auto-start Docker Compose for tests
`AUTOMEM_STOP_DOCKER`	bool	No	`0`	Auto-stop Docker after tests
`AUTOMEM_TEST_BASE_URL`	string	No	`http://localhost:8001`	Test target URL
`AUTOMEM_ALLOW_LIVE`	bool	No	`0`	Allow tests against non-localhost
`AUTOMEM_TEST_API_TOKEN`	string	No	unset	Token for integration tests
`AUTOMEM_TEST_ADMIN_TOKEN`	string	No	unset	Admin token for integration tests

Configuration Validation

AutoMem validates critical configuration at startup with fail-fast behavior for critical misconfigurations and graceful degradation for optional features:

Variable	Validation	Failure Behavior
`FALKORDB_HOST`, `FALKORDB_PORT`	Connection test on startup	503 Service Unavailable
`AUTOMEM_API_TOKEN`	Must be non-empty string	500 Internal Server Error
`ADMIN_API_TOKEN`	Must be non-empty string	500 Internal Server Error
`PORT`	Must be valid port number	Defaults to 8001
`QDRANT_URL`	Connection test if provided	Log warning, continue
`VECTOR_SIZE`	Must match Qdrant collection	Fail fast on mismatch
`EMBEDDING_PROVIDER`	Must be valid option	Defaults to `auto`

The health endpoint at GET /health reflects connection status and can be used to verify configuration:

{
  "status": "healthy",
  "falkordb": "connected",
  "qdrant": "connected",
  "memory_count": 142,
  "enrichment": {
    "status": "running",
    "queue_depth": 0
  },
  "graph": "memories"
}

When Qdrant is unavailable (expected in graph-only mode):

{
  "status": "healthy",
  "falkordb": "connected",
  "qdrant": "unavailable",
  "memory_count": 142
}

Configuration Examples

Minimal local setup (graph-only, no vector search)

AUTOMEM_API_TOKEN=your-token-here
ADMIN_API_TOKEN=your-admin-token-here
FALKORDB_HOST=localhost
FALKORDB_PORT=6379
PORT=8001

Railway production deployment

PORT=8001
FALKORDB_HOST=falkordb.railway.internal
FALKORDB_PORT=6379
FALKORDB_PASSWORD=<generated-by-template>
AUTOMEM_API_TOKEN=<generated-by-template>
ADMIN_API_TOKEN=<generated-by-template>
OPENAI_API_KEY=sk-...
QDRANT_URL=https://your-cluster.cloud.qdrant.io
QDRANT_API_KEY=your-qdrant-key
VECTOR_SIZE=3072

OpenAI-compatible provider (OpenRouter)

EMBEDDING_PROVIDER=openai
OPENAI_API_KEY=sk-or-...
OPENAI_BASE_URL=https://openrouter.ai/api/v1
EMBEDDING_MODEL=text-embedding-3-large
VECTOR_SIZE=3072

Variable resolution troubleshooting (Railway)

flowchart TD
    Start["Variable not resolving?"]

    Start --> Check1{"Using Railway<br/>template deploy?"}
    Check1 -->|Yes| Check2{"Variable shows<br/>literal $\{\{...\}\}?"}
    Check1 -->|No| UseHardcoded["Use hardcoded values<br/>NOT template syntax"]

    Check2 -->|Yes| Bug["Template bug<br/>Report to Railway"]
    Check2 -->|No| Working["Working correctly"]

    UseHardcoded --> SetVars["Set variables:<br/>FALKORDB_HOST=falkordb.railway.internal<br/>FALKORDB_PASSWORD=(copy from falkordb)"]

    SetVars --> Verify["Test connection:<br/>railway run redis-cli ping"]

    Bug --> Workaround["Workaround:<br/>Use hardcoded values"]
    Workaround --> SetVars

Security Checklist

When deploying AutoMem to production, verify these security practices:

Never commit .env files — add to .gitignore
Use strong tokens — minimum 32 bytes of entropy for AUTOMEM_API_TOKEN and ADMIN_API_TOKEN
Rotate secrets periodically — update tokens every 90 days
Restrict ADMIN_API_TOKEN — use a separate, more restricted token for admin operations
Enable FalkorDB authentication — always set FALKORDB_PASSWORD in production
Use HTTPS for external services — Qdrant Cloud, OpenAI, Voyage endpoints must use TLS
Validate environment on startup — review logs for configuration warnings
Use Railway private networking — never expose FalkorDB publicly
Never use query parameter auth (?api_key=) in production — tokens appear in server logs