mirror of
https://github.com/thedotmack/claude-mem
synced 2026-04-25 17:15:04 +02:00
* Initial plan * fix: break infinite summary-retry loop (#1633) Three-part fix: 1. Parser coercion: When LLM returns <observation> tags instead of <summary>, coerce observation content into summary fields (root cause fix) 2. Stronger summary prompt: Add clearer tag requirements with warnings 3. Circuit breaker: Track consecutive summary failures per session, skip further attempts after 3 failures to prevent unbounded prompt growth Agent-Logs-Url: https://github.com/thedotmack/claude-mem/sessions/e345e8ec-bc97-4eaa-94bd-6e951fda8f77 Co-authored-by: thedotmack <683968+thedotmack@users.noreply.github.com> * refactor: extract shared constants for summary mode marker and failure threshold Addresses code review feedback: SUMMARY_MODE_MARKER and MAX_CONSECUTIVE_SUMMARY_FAILURES are now defined once in sdk/prompts.ts and imported by ResponseProcessor and SessionManager. Agent-Logs-Url: https://github.com/thedotmack/claude-mem/sessions/e345e8ec-bc97-4eaa-94bd-6e951fda8f77 Co-authored-by: thedotmack <683968+thedotmack@users.noreply.github.com> * fix: guard summary failure counter on summaryExpected (Greptile P1) The circuit breaker counter previously incremented on any response containing <observation> or <summary> tags — which matches virtually every normal observation response. After 3 observations the breaker would open and permanently block summarization, reproducing the data-loss scenario #1633 was meant to prevent. Gate the increment block on summaryExpected (already computed for parseSummary coercion) so the counter only tracks actual summary attempts. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> * test: cover circuit-breaker + apply review polish - Use findLast / at(-1) for last-user-message lookup instead of filter + index (O(1) common case). - Drop redundant `|| 0` fallback — field is required and initialized. - Add comment noting counter is ephemeral by design. - Add ResponseProcessor tests covering: * counter NOT incrementing on normal observation responses (regression guard for the Greptile P1) * counter incrementing when a summary was expected but missing * counter resetting to 0 on successful summary storage Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> * fix: iterate all observation blocks; don't count skip_summary as failure Addresses CodeRabbit review on #2072: - coerceObservationToSummary now iterates all <observation> blocks with a global regex and returns the first block that has title, narrative, or facts. Previously, an empty leading observation would short-circuit and discard populated follow-ups. - Circuit-breaker counter now treats explicit <skip_summary/> as neutral — neither a failure nor a success — so a run that happens to end on a skip doesn't punish the session or mask a prior bad streak. Real failures (no summary, no skip) still increment. - Tests added for both cases. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> * test: reference SUMMARY_MODE_MARKER constant instead of hardcoded string Addresses CodeRabbit nitpick: tests should pull the marker from the canonical source so they don't silently drift when the constant is renamed or edited. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> * fix: also coerce observations when <summary> has empty sub-tags When the LLM wraps an empty <summary></summary> around real observation content, the #1360 empty-subtag guard rejects the summary and returns null — which would lose the observation content and resurrect the #1633 retry loop. Fall back to coerceObservationToSummary in that branch too, mirroring the unmatched-<summary> path. Adds a test covering the empty-summary-wraps-observation case and a guard test for empty summary with no observation content. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> --------- Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Co-authored-by: thedotmack <683968+thedotmack@users.noreply.github.com> Co-authored-by: Alex Newman <thedotmack@gmail.com> Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
389 lines
15 KiB
TypeScript
389 lines
15 KiB
TypeScript
/**
|
|
* ResponseProcessor: Shared response processing for all agent implementations
|
|
*
|
|
* Responsibility:
|
|
* - Parse observations and summaries from agent responses
|
|
* - Execute atomic database transactions
|
|
* - Orchestrate Chroma sync (fire-and-forget)
|
|
* - Broadcast to SSE clients
|
|
* - Clean up processed messages
|
|
*
|
|
* This module extracts 150+ lines of duplicate code from SDKAgent, GeminiAgent, and OpenRouterAgent.
|
|
*/
|
|
|
|
import { logger } from '../../../utils/logger.js';
|
|
import { parseObservations, parseSummary, type ParsedObservation, type ParsedSummary } from '../../../sdk/parser.js';
|
|
import { SUMMARY_MODE_MARKER, MAX_CONSECUTIVE_SUMMARY_FAILURES } from '../../../sdk/prompts.js';
|
|
import { updateCursorContextForProject } from '../../integrations/CursorHooksInstaller.js';
|
|
import { updateFolderClaudeMdFiles } from '../../../utils/claude-md-utils.js';
|
|
import { getWorkerPort } from '../../../shared/worker-utils.js';
|
|
import { SettingsDefaultsManager } from '../../../shared/SettingsDefaultsManager.js';
|
|
import { USER_SETTINGS_PATH } from '../../../shared/paths.js';
|
|
import type { ActiveSession } from '../../worker-types.js';
|
|
import type { DatabaseManager } from '../DatabaseManager.js';
|
|
import type { SessionManager } from '../SessionManager.js';
|
|
import type { WorkerRef, StorageResult } from './types.js';
|
|
import { broadcastObservation, broadcastSummary } from './ObservationBroadcaster.js';
|
|
import { cleanupProcessedMessages } from './SessionCleanupHelper.js';
|
|
|
|
/**
|
|
* Process agent response text (parse XML, save to database, sync to Chroma, broadcast SSE)
|
|
*
|
|
* This is the unified response processor that handles:
|
|
* 1. Adding response to conversation history (for provider interop)
|
|
* 2. Parsing observations and summaries from XML
|
|
* 3. Atomic database transaction to store observations + summary
|
|
* 4. Async Chroma sync (fire-and-forget, failures are non-critical)
|
|
* 5. SSE broadcast to web UI clients
|
|
* 6. Session cleanup
|
|
*
|
|
* @param text - Response text from the agent
|
|
* @param session - Active session being processed
|
|
* @param dbManager - Database manager for storage operations
|
|
* @param sessionManager - Session manager for message tracking
|
|
* @param worker - Worker reference for SSE broadcasting (optional)
|
|
* @param discoveryTokens - Token cost delta for this response
|
|
* @param originalTimestamp - Original epoch when message was queued (for accurate timestamps)
|
|
* @param agentName - Name of the agent for logging (e.g., 'SDK', 'Gemini', 'OpenRouter')
|
|
*/
|
|
export async function processAgentResponse(
|
|
text: string,
|
|
session: ActiveSession,
|
|
dbManager: DatabaseManager,
|
|
sessionManager: SessionManager,
|
|
worker: WorkerRef | undefined,
|
|
discoveryTokens: number,
|
|
originalTimestamp: number | null,
|
|
agentName: string,
|
|
projectRoot?: string,
|
|
modelId?: string
|
|
): Promise<void> {
|
|
// Track generator activity for stale detection (Issue #1099)
|
|
session.lastGeneratorActivity = Date.now();
|
|
|
|
// Add assistant response to shared conversation history for provider interop
|
|
if (text) {
|
|
session.conversationHistory.push({ role: 'assistant', content: text });
|
|
}
|
|
|
|
// Parse observations and summary
|
|
const observations = parseObservations(text, session.contentSessionId);
|
|
|
|
// Detect whether the most recent prompt was a summary request.
|
|
// If so, enable observation-to-summary coercion to prevent the infinite
|
|
// retry loop described in #1633.
|
|
const lastMessage = session.conversationHistory.at(-1);
|
|
const lastUserMessage = lastMessage?.role === 'user'
|
|
? lastMessage
|
|
: session.conversationHistory.findLast(m => m.role === 'user') ?? null;
|
|
const summaryExpected = lastUserMessage?.content?.includes(SUMMARY_MODE_MARKER) ?? false;
|
|
|
|
const summary = parseSummary(text, session.sessionDbId, summaryExpected);
|
|
|
|
if (
|
|
text.trim() &&
|
|
observations.length === 0 &&
|
|
!summary &&
|
|
!/<observation>|<summary>|<skip_summary\b/.test(text)
|
|
) {
|
|
const preview = text.length > 200 ? `${text.slice(0, 200)}...` : text;
|
|
logger.warn('PARSER', `${agentName} returned non-XML response; observation content was discarded`, {
|
|
sessionId: session.sessionDbId,
|
|
preview
|
|
});
|
|
}
|
|
|
|
// Convert nullable fields to empty strings for storeSummary (if summary exists)
|
|
const summaryForStore = normalizeSummaryForStorage(summary);
|
|
|
|
// Get session store for atomic transaction
|
|
const sessionStore = dbManager.getSessionStore();
|
|
|
|
// CRITICAL: Must use memorySessionId (not contentSessionId) for FK constraint
|
|
if (!session.memorySessionId) {
|
|
throw new Error('Cannot store observations: memorySessionId not yet captured');
|
|
}
|
|
|
|
// SAFETY NET (Issue #846 / Multi-terminal FK fix):
|
|
// The PRIMARY fix is in SDKAgent.ts where ensureMemorySessionIdRegistered() is called
|
|
// immediately when the SDK returns a memory_session_id. This call is a defensive safety net
|
|
// in case the DB was somehow not updated (race condition, crash, etc.).
|
|
// In multi-terminal scenarios, createSDKSession() now resets memory_session_id to NULL
|
|
// for each new generator, ensuring clean isolation.
|
|
sessionStore.ensureMemorySessionIdRegistered(session.sessionDbId, session.memorySessionId);
|
|
|
|
// Log pre-storage with session ID chain for verification
|
|
logger.info('DB', `STORING | sessionDbId=${session.sessionDbId} | memorySessionId=${session.memorySessionId} | obsCount=${observations.length} | hasSummary=${!!summaryForStore}`, {
|
|
sessionId: session.sessionDbId,
|
|
memorySessionId: session.memorySessionId
|
|
});
|
|
|
|
// ATOMIC TRANSACTION: Store observations + summary ONCE
|
|
// Messages are already deleted from queue on claim, so no completion tracking needed
|
|
const result = sessionStore.storeObservations(
|
|
session.memorySessionId,
|
|
session.project,
|
|
observations,
|
|
summaryForStore,
|
|
session.lastPromptNumber,
|
|
discoveryTokens,
|
|
originalTimestamp ?? undefined,
|
|
modelId
|
|
);
|
|
|
|
// Log storage result with IDs for end-to-end traceability
|
|
logger.info('DB', `STORED | sessionDbId=${session.sessionDbId} | memorySessionId=${session.memorySessionId} | obsCount=${result.observationIds.length} | obsIds=[${result.observationIds.join(',')}] | summaryId=${result.summaryId || 'none'}`, {
|
|
sessionId: session.sessionDbId,
|
|
memorySessionId: session.memorySessionId
|
|
});
|
|
|
|
// Track whether a summary record was stored so the status endpoint can expose this
|
|
// to the Stop hook for silent-summary-loss detection (#1633)
|
|
session.lastSummaryStored = result.summaryId !== null;
|
|
|
|
// Circuit breaker: track consecutive summary failures (#1633).
|
|
// Only evaluate when a summary was actually expected (summarize message was sent).
|
|
// Without this guard, the counter would increment on every normal observation
|
|
// response, tripping the breaker after 3 observations and permanently blocking
|
|
// summarization — reproducing the data-loss scenario this fix is meant to prevent.
|
|
if (summaryExpected) {
|
|
const skippedIntentionally = /<skip_summary\b/.test(text);
|
|
if (summaryForStore !== null) {
|
|
// Summary was present in the response — reset the failure counter
|
|
session.consecutiveSummaryFailures = 0;
|
|
} else if (skippedIntentionally) {
|
|
// Explicit <skip_summary/> is a valid protocol response — neither success
|
|
// nor failure. Leave the counter unchanged so we don't mask a bad run that
|
|
// happens to end on a skip, but also don't punish intentional skips.
|
|
} else {
|
|
// Summary was expected but none was stored — count as failure
|
|
session.consecutiveSummaryFailures += 1;
|
|
if (session.consecutiveSummaryFailures >= MAX_CONSECUTIVE_SUMMARY_FAILURES) {
|
|
logger.error('SESSION', `Circuit breaker: ${session.consecutiveSummaryFailures} consecutive summary failures — further summarize requests will be skipped (#1633)`, {
|
|
sessionId: session.sessionDbId,
|
|
contentSessionId: session.contentSessionId
|
|
});
|
|
}
|
|
}
|
|
}
|
|
|
|
// CLAIM-CONFIRM: Now that storage succeeded, confirm all processing messages (delete from queue)
|
|
// This is the critical step that prevents message loss on generator crash
|
|
const pendingStore = sessionManager.getPendingMessageStore();
|
|
for (const messageId of session.processingMessageIds) {
|
|
pendingStore.confirmProcessed(messageId);
|
|
}
|
|
if (session.processingMessageIds.length > 0) {
|
|
logger.debug('QUEUE', `CONFIRMED_BATCH | sessionDbId=${session.sessionDbId} | count=${session.processingMessageIds.length} | ids=[${session.processingMessageIds.join(',')}]`);
|
|
}
|
|
// Clear the tracking array after confirmation
|
|
session.processingMessageIds = [];
|
|
|
|
// AFTER transaction commits - async operations (can fail safely without data loss)
|
|
await syncAndBroadcastObservations(
|
|
observations,
|
|
result,
|
|
session,
|
|
dbManager,
|
|
worker,
|
|
discoveryTokens,
|
|
agentName,
|
|
projectRoot
|
|
);
|
|
|
|
// Sync and broadcast summary if present
|
|
await syncAndBroadcastSummary(
|
|
summary,
|
|
summaryForStore,
|
|
result,
|
|
session,
|
|
dbManager,
|
|
worker,
|
|
discoveryTokens,
|
|
agentName
|
|
);
|
|
|
|
// Clean up session state
|
|
cleanupProcessedMessages(session, worker);
|
|
}
|
|
|
|
/**
|
|
* Normalize summary for storage (convert null fields to empty strings)
|
|
*/
|
|
function normalizeSummaryForStorage(summary: ParsedSummary | null): {
|
|
request: string;
|
|
investigated: string;
|
|
learned: string;
|
|
completed: string;
|
|
next_steps: string;
|
|
notes: string | null;
|
|
} | null {
|
|
if (!summary) return null;
|
|
|
|
return {
|
|
request: summary.request || '',
|
|
investigated: summary.investigated || '',
|
|
learned: summary.learned || '',
|
|
completed: summary.completed || '',
|
|
next_steps: summary.next_steps || '',
|
|
notes: summary.notes
|
|
};
|
|
}
|
|
|
|
/**
|
|
* Sync observations to Chroma and broadcast to SSE clients
|
|
*/
|
|
async function syncAndBroadcastObservations(
|
|
observations: ParsedObservation[],
|
|
result: StorageResult,
|
|
session: ActiveSession,
|
|
dbManager: DatabaseManager,
|
|
worker: WorkerRef | undefined,
|
|
discoveryTokens: number,
|
|
agentName: string,
|
|
projectRoot?: string
|
|
): Promise<void> {
|
|
for (let i = 0; i < observations.length; i++) {
|
|
const obsId = result.observationIds[i];
|
|
const obs = observations[i];
|
|
const chromaStart = Date.now();
|
|
|
|
// Sync to Chroma (fire-and-forget, skipped if Chroma is disabled)
|
|
dbManager.getChromaSync()?.syncObservation(
|
|
obsId,
|
|
session.contentSessionId,
|
|
session.project,
|
|
obs,
|
|
session.lastPromptNumber,
|
|
result.createdAtEpoch,
|
|
discoveryTokens
|
|
).then(() => {
|
|
const chromaDuration = Date.now() - chromaStart;
|
|
logger.debug('CHROMA', 'Observation synced', {
|
|
obsId,
|
|
duration: `${chromaDuration}ms`,
|
|
type: obs.type,
|
|
title: obs.title || '(untitled)'
|
|
});
|
|
}).catch((error) => {
|
|
logger.error('CHROMA', `${agentName} chroma sync failed, continuing without vector search`, {
|
|
obsId,
|
|
type: obs.type,
|
|
title: obs.title || '(untitled)'
|
|
}, error);
|
|
});
|
|
|
|
// Broadcast to SSE clients (for web UI)
|
|
// BUGFIX: Use obs.files_read and obs.files_modified (not obs.files)
|
|
broadcastObservation(worker, {
|
|
id: obsId,
|
|
memory_session_id: session.memorySessionId,
|
|
session_id: session.contentSessionId,
|
|
platform_source: session.platformSource,
|
|
type: obs.type,
|
|
title: obs.title,
|
|
subtitle: obs.subtitle,
|
|
text: null, // text field is not in ParsedObservation
|
|
narrative: obs.narrative || null,
|
|
facts: JSON.stringify(obs.facts || []),
|
|
concepts: JSON.stringify(obs.concepts || []),
|
|
files_read: JSON.stringify(obs.files_read || []),
|
|
files_modified: JSON.stringify(obs.files_modified || []),
|
|
project: session.project,
|
|
prompt_number: session.lastPromptNumber,
|
|
created_at_epoch: result.createdAtEpoch
|
|
});
|
|
}
|
|
|
|
// Update folder CLAUDE.md files for touched folders (fire-and-forget)
|
|
// This runs per-observation batch to ensure folders are updated as work happens
|
|
// Only runs if CLAUDE_MEM_FOLDER_CLAUDEMD_ENABLED is true (default: false)
|
|
const settings = SettingsDefaultsManager.loadFromFile(USER_SETTINGS_PATH);
|
|
// Handle both string 'true' and boolean true from JSON settings
|
|
const settingValue = settings.CLAUDE_MEM_FOLDER_CLAUDEMD_ENABLED;
|
|
const folderClaudeMdEnabled = settingValue === 'true' || settingValue === true;
|
|
|
|
if (folderClaudeMdEnabled) {
|
|
const allFilePaths: string[] = [];
|
|
for (const obs of observations) {
|
|
allFilePaths.push(...(obs.files_modified || []));
|
|
allFilePaths.push(...(obs.files_read || []));
|
|
}
|
|
|
|
if (allFilePaths.length > 0) {
|
|
updateFolderClaudeMdFiles(
|
|
allFilePaths,
|
|
session.project,
|
|
getWorkerPort(),
|
|
projectRoot
|
|
).catch(error => {
|
|
logger.warn('FOLDER_INDEX', 'CLAUDE.md update failed (non-critical)', { project: session.project }, error as Error);
|
|
});
|
|
}
|
|
}
|
|
}
|
|
|
|
/**
|
|
* Sync summary to Chroma and broadcast to SSE clients
|
|
*/
|
|
async function syncAndBroadcastSummary(
|
|
summary: ParsedSummary | null,
|
|
summaryForStore: { request: string; investigated: string; learned: string; completed: string; next_steps: string; notes: string | null } | null,
|
|
result: StorageResult,
|
|
session: ActiveSession,
|
|
dbManager: DatabaseManager,
|
|
worker: WorkerRef | undefined,
|
|
discoveryTokens: number,
|
|
agentName: string
|
|
): Promise<void> {
|
|
if (!summaryForStore || !result.summaryId) {
|
|
return;
|
|
}
|
|
|
|
const chromaStart = Date.now();
|
|
|
|
// Sync to Chroma (fire-and-forget, skipped if Chroma is disabled)
|
|
dbManager.getChromaSync()?.syncSummary(
|
|
result.summaryId,
|
|
session.contentSessionId,
|
|
session.project,
|
|
summaryForStore,
|
|
session.lastPromptNumber,
|
|
result.createdAtEpoch,
|
|
discoveryTokens
|
|
).then(() => {
|
|
const chromaDuration = Date.now() - chromaStart;
|
|
logger.debug('CHROMA', 'Summary synced', {
|
|
summaryId: result.summaryId,
|
|
duration: `${chromaDuration}ms`,
|
|
request: summaryForStore.request || '(no request)'
|
|
});
|
|
}).catch((error) => {
|
|
logger.error('CHROMA', `${agentName} chroma sync failed, continuing without vector search`, {
|
|
summaryId: result.summaryId,
|
|
request: summaryForStore.request || '(no request)'
|
|
}, error);
|
|
});
|
|
|
|
// Broadcast to SSE clients (for web UI)
|
|
broadcastSummary(worker, {
|
|
id: result.summaryId,
|
|
session_id: session.contentSessionId,
|
|
platform_source: session.platformSource,
|
|
request: summaryForStore!.request,
|
|
investigated: summaryForStore!.investigated,
|
|
learned: summaryForStore!.learned,
|
|
completed: summaryForStore!.completed,
|
|
next_steps: summaryForStore!.next_steps,
|
|
notes: summaryForStore!.notes,
|
|
project: session.project,
|
|
prompt_number: session.lastPromptNumber,
|
|
created_at_epoch: result.createdAtEpoch
|
|
});
|
|
|
|
// Update Cursor context file for registered projects (fire-and-forget)
|
|
updateCursorContextForProject(session.project, getWorkerPort()).catch(error => {
|
|
logger.warn('CURSOR', 'Context update failed (non-critical)', { project: session.project }, error as Error);
|
|
});
|
|
}
|