5.6 KiB
CLAUDE.md
This file provides guidance to Claude Code (claude.ai/code) when working with code in this repository.
See the root CLAUDE.md for overall architecture, service roles, and the dual-store memory model.
Running This Service
npm run memory # From repo root (node src/index.js)
npm -w packages/memory-service run dev # With --watch
Default port: 3002. Requires Qdrant and the embedding-service to be reachable on startup.
SQLite Schema
src/db/schema.js is the source of truth for the data model. Key schema facts:
sessionsandepisodesare linked by FK with cascade delete — deleting a session removes all its episodes automatically.episodes_ftsis an FTS5 virtual table that mirrorsuser_messageandai_response. It is kept in sync via SQL triggers on INSERT/UPDATE/DELETE. On service startup, the FTS index is fully rebuilt from live episode data.- Several columns (
sessions.name,sessions.project_id,projects.isolated, etc.) were added as migrations usingALTER TABLEwrapped in individual try-catch blocks. Failures are silently swallowed — if a column already exists, the alter fails and the service continues. Theidx_summaries_projectindex is defined twice (benign duplicate). summariesrows withsession_id IS NULLand aproject_idrepresent project-level overviews, not session summaries. This distinction is howGET /projects/:id/overviewworks.
Async Pipeline: Episode Creation
POST /episodes returns a 201 as soon as the SQLite insert succeeds. Two background tasks run after without blocking the response:
- Embedding — Fetches a vector from embedding-service, stores to Qdrant with
{sessionId, createdAt}as payload metadata. - Entity extraction — Sends the episode text to Ollama (
qwen2.5:3b, temp 0.1, 200 tokens) and upserts any recognized entities to both SQLite and Qdrant.
Both tasks catch and log errors silently. An episode can exist in SQLite with no corresponding Qdrant point if either step fails.
Entity Extraction Details
src/entities/extraction.js:
- Fetches the last 20 known entities from SQLite before prompting the model, so the prompt can ask for name/type consistency with existing entries.
- Recognized types:
person,place,project,technology,concept,organization— anything else is discarded. - Ignores a hardcoded list of low-value names (
hello,thanks,good morning, etc.). - Extracts JSON using a regex (
{...}) applied to raw model output, so surrounding prose doesn't break parsing. - Entity upsert uses
ON CONFLICT(name, type) DO UPDATE— preserves existingnotesif the new extraction returns null (COALESCE(entities.notes, excluded.notes)). - After upsert, embeds each entity as
"${name} (${type}): ${notes}"and stores to Qdrant withprojectIdin the payload for project-scoped filtering.
Summarization Strategy
src/summarization/project.js:
- Preferred path: generate a project overview from existing session-level summaries (higher-level abstraction, shorter input).
- Fallback path: if no session summaries exist, summarize raw episodes directly (up to
SUMMARIES.MAX_PROJECT_EPISODE_LIMIT). - Both paths truncate input at
SUMMARIES.MAX_SUMMARY_CHARS(30,000 chars) by slicing from the end (most recent content wins). - Strips ChatML tokens from the Ollama response (
<|im_start|>,<|im_end|>). - Uses temp 0.2 and
num_predict 1200.
Known Quirk: getRecentEpisodes
src/episodic/index.js getRecentEpisodes(sessionId, limit) has a parameter mismatch — the SQLite query binds only limit, not sessionId, so it returns recent episodes across all sessions. Orchestration-service uses getEpisodesBySession() (the paginated route) instead, so this bug is not visible in normal operation. Don't rely on getRecentEpisodes when you need session-scoped results.
Qdrant Client
src/semantic/index.js creates the Qdrant client lazily on first use and reuses it. All three collections (episodes, entities, summaries) are created at startup if missing. There is no connection health check — if Qdrant is unreachable, semantic operations throw at call time.
API Endpoints Quick Reference
| Method | Path | Notes |
|---|---|---|
| GET | /health |
Static response, no dependency checks |
| GET/POST | /sessions |
POST requires externalId; duplicate → 409 |
| GET/PATCH | /sessions/by-external/:externalId |
PATCH accepts name, projectId |
| DELETE | /sessions/by-external/:externalId |
Cascades to episodes, summaries, relationships |
| GET/POST | /episodes |
POST triggers async embedding + entity extraction |
| GET | /episodes/search |
FTS5 search; route must precede /:id |
| GET | /sessions/:id/episodes |
Paginated, ordered created_at DESC |
| DELETE | /episodes/:id |
Removes from SQLite + async Qdrant delete |
| POST | /entities |
Upsert by (name, type) |
| GET | /entities/by-type/:type |
All entities of given type |
| GET/DELETE | /entities/:id |
|
| POST | /relationships |
Upsert by (fromId, toId, label); conflict = no-op |
| GET | /entities/:id/relationships |
Outbound only |
| DELETE | /relationships |
Body: fromId, toId, label |
| GET/POST | /projects |
POST requires non-empty name |
| GET/PATCH/DELETE | /projects/:id |
|
| POST | /projects/:id/summarize |
On-demand overview generation; 422 if no data |
| GET | /projects/:id/overview |
Returns null (not 404) if no overview exists |
| GET | /projects/:id/summaries |
All summaries for project |
| POST | /summaries |
Requires content + at least one of sessionId/projectId |
| GET | /sessions/:id/summaries |
|
| PATCH/DELETE | /summaries/:id |