worldmonitor

mirror of https://github.com/koala73/worldmonitor.git synced 2026-04-25 17:14:57 +02:00

Author	SHA1	Message	Date
Sebastien Melki	1db05e6caa	feat(usage): per-request Axiom telemetry pipeline (gateway + upstream attribution) (#3403 ) * feat(gateway): thread Vercel Edge ctx through createDomainGateway (#3381) PR-0 of the Axiom usage-telemetry stack. Pure infra change: no telemetry emission yet, only the signature plumbing required for ctx.waitUntil to exist on the hot path. - createDomainGateway returns (req, ctx) instead of (req) - rewriteToSebuf propagates ctx to its target gateway - 5 alias callsites updated to pass ctx through - ~30 [rpc].ts callsites unchanged (export default createDomainGateway(...)) Pattern reference: api/notification-channels.ts:166. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> * feat(usage): pure UsageIdentity resolver + Axiom emit primitives (#3381) server/_shared/usage-identity.ts - buildUsageIdentity: pure function, consumes already-resolved gateway state. - Static ENTERPRISE_KEY_TO_CUSTOMER map (explicit, reviewable in code). - Does not re-verify JWTs or re-validate API keys. server/_shared/usage.ts - buildRequestEvent / buildUpstreamEvent: allowlisted-primitive builders only. Never accept Request/Response — additive field leaks become structurally impossible. - emitUsageEvents → ctx.waitUntil(sendToAxiom). Direct fetch, 1.5s timeout, no retry, gated by USAGE_TELEMETRY=1 and AXIOM_API_TOKEN. - Sliding-window circuit breaker (5% over 5min, min 20 samples). Trips with one structured console.error; subsequent drops are 1%-sampled console.warn. - Header derivers reuse Vercel/CF headers for request_id, region, country, reqBytes; ua_hash null unless USAGE_UA_PEPPER is set (no stable fingerprinting). - Dev-only x-usage-telemetry response header for 2-second debugging. server/_shared/auth-session.ts - New resolveClerkSession returning { userId, orgId } in one JWT verify so customer_id can be Clerk org id without a second pass. resolveSessionUserId kept as back-compat wrapper. No emission wiring yet — that lands in the next commit (gateway request event + 403 + 429). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> * feat(gateway): emit Axiom request events on every return path (#3381) Wires the request-event side of the Axiom usage-telemetry stack. Behind USAGE_TELEMETRY=1 — no-op when the env var is unset. Emit points (each builds identity from accumulated gateway state): - origin_403 disallowed origin → reason=origin_403 - API access subscription required (403) - legacy bearer 401 / 403 / 401-without-bearer - entitlement check fail-through - endpoint rate-limit 429 → reason=rate_limit_429 - global rate-limit 429 → reason=rate_limit_429 - 405 method not allowed - 404 not found - 304 etag match (resolved cache tier) - 200 GET with body (resolved cache tier, real res_bytes) - streaming / non-GET-200 final return (res_bytes best-effort) Identity inputs (UsageIdentityInput): - sessionUserId / clerkOrgId from new resolveClerkSession (one JWT verify) - isUserApiKey + userApiKeyCustomerRef from validateUserApiKey result - enterpriseApiKey when keyCheck.valid + non-wm_ wmKey present - widgetKey from x-widget-key header (best-effort) - tier captured opportunistically from existing getEntitlements calls Header derivers reuse Vercel/CF metadata (x-vercel-id, x-vercel-ip-country, cf-ipcountry, content-length, sentry-trace) — no new geo lookup, no new crypto on the hot path. ua_hash null unless USAGE_UA_PEPPER is set. Dev-only x-usage-telemetry response header (ok \| degraded \| off) attached on the response paths for 2-second debugging in non-production. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> * feat(usage): upstream events via implicit request scope (#3381) Closes the upstream-attribution side of the Axiom usage-telemetry stack without requiring leaf-handler changes (per koala's review). server/_shared/usage.ts - AsyncLocalStorage-backed UsageScope: gateway sets it once per request, fetch helpers read from it lazily. Defensive import — if the runtime rejects node:async_hooks, scope helpers degrade to no-ops and the request event is unaffected. - runWithUsageScope(scope, fn) / getUsageScope() exports. server/gateway.ts - Wraps matchedHandler in runWithUsageScope({ ctx, requestId, customerId, route, tier }) so deep fetchers can attribute upstream calls without threading state through every handler signature. server/_shared/redis.ts - cachedFetchJsonWithMeta accepts opts.usage = { provider, operation? }. Only the provider label is required to opt in — request_id / customer_id / route / tier flow implicitly from UsageScope. - Emits on the fresh path only (cache hits don't emit; the inbound request event already records cache_status). - cache_status correctly distinguishes 'miss' vs 'neg-sentinel' by construction, matching NEG_SENTINEL handling. - Telemetry never throws — failures are swallowed in the lazy-import catch, sink itself short-circuits on USAGE_TELEMETRY=0. server/_shared/fetch-json.ts - New optional { provider, operation } in FetchJsonOptions. Same opt-in-by-provider model as cachedFetchJsonWithMeta. Auto-derives host from URL. Reads body via .text() so response_bytes is recorded (best-effort; chunked responses still report 0). Net result: any handler that uses fetchJson or cachedFetchJsonWithMeta gets full per-customer upstream attribution by adding two fields to the options bag. No signature changes anywhere else. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> * fix(gateway): address round-1 codex feedback on usage telemetry - ctx is now optional on the createDomainGateway handler signature so direct callers (tests, non-Vercel paths) no longer crash on emit - legacy premium bearer-token routes (resilience, shipping-v2) propagate session.userId into the usage accumulator so successful requests are attributed instead of emitting as anon - after checkEntitlement allows a tier-gated route, re-read entitlements (Redis-cached + in-flight coalesced) to populate usage.tier so analyze-stock & co. emit the correct tier rather than 0 - domain extraction now skips a leading vN segment, so /api/v2/shipping/* records domain="shipping" instead of "v2" Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> * test(usage): assert telemetry payload + identity resolver + operator guide - tests/usage-telemetry-emission.test.mts stubs globalThis.fetch to capture the Axiom ingest POST body and asserts the four review-flagged fields end-to-end through the gateway: domain on /api/v2/<svc>/* (was "v2"), customer_id on legacy premium bearer success (was null/anon), tier on entitlement-gated success via the Convex fallback path (was 0), plus a ctx-optional regression guard - server/__tests__/usage-identity.test.ts unit-tests the pure buildUsageIdentity() resolver across every auth_kind branch, tier coercion, and the secret-handling invariant (raw enterprise key never lands in any output field) - docs/architecture/usage-telemetry.md is the operator + dev guide: field reference, architecture, configuration, failure modes, local workflow, eight Axiom APL recipes, and runbooks for adding fields / new gateway return paths Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> * test(usage): make recorder.settled robust to nested waitUntil Promise.all(pending) snapshotted the array at call time, missing the inner ctx.waitUntil(sendToAxiom(...)) that emitUsageEvents pushes after the outer drain begins. Tests passed only because the fetch spy resolved in an earlier microtask tick. Replace with a quiescence loop so the helper survives any future async in the emit path. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> * chore: trigger preview * fix(usage): address koala #3403 review — collapse nested waitUntil, widget-key validation, neg-sentinel status, auth_* reasons P1 - Collapse nested ctx.waitUntil at all 3 emit sites (gateway.ts emitRequest, fetch-json.ts, redis.ts emitUpstreamFromHook). Export sendToAxiom and call it directly inside the outer waitUntil so Edge runtimes don't drop the delivery promise after the response phase. - Validate X-Widget-Key against WIDGET_AGENT_KEY before populating usage.widgetKey so unauthenticated callers can't spoof per-customer attribution. P2 - Emit on OPTIONS preflight (new 'preflight' RequestReason). - Gate cachedFetchJsonWithMeta upstreamStatus=200 on result != null so the neg-sentinel branch no longer reports as a successful upstream call. - Extend RequestReason with auth_401/auth_403/tier_403 and replace reason:'ok' on every auth/tier-rejection emit path. - Replace 32-bit FNV-1a with a two-round XOR-folded 64-bit variant in hashKeySync (collision space matters once widget-key adoption grows). Verification - tests/usage-telemetry-emission.test.mts — 6/6 - tests/premium-stock-gateway.test.mts + tests/gateway-cdn-origin-policy.test.mts — 15/15 - npx vitest run server/__tests__/usage-identity.test.ts — 13/13 - npx tsc --noEmit clean Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> * chore: trigger preview rebuild for AXIOM_API_TOKEN * chore(usage): note Axiom region in ingest URL comment Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> * debug(usage): unconditional logs in sendToAxiom for preview troubleshooting Temporary — to be reverted once Axiom delivery is confirmed working in preview. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> * fix(usage): add 'live' cache tier + revert preview debug logs - Sync UsageCacheTier with the local CacheTier in gateway.ts (main added 'live' in PR #3402 — synthetic merge with main was failing typecheck:api). - Revert temporary unconditional debug logs in sendToAxiom now that Axiom delivery is verified end-to-end on preview (event landed with all fields populated, including the new auth_401 reason from the koala #3403 fix). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-25 18:10:51 +03:00
Elie Habib	5c955691a9	feat(energy-atlas): live tanker map layer + contract (parity PR 3, plan U7-U8) (#3402 ) * feat(energy-atlas): live tanker map layer + contract (PR 3, plan U7-U8) Lands the third and final parity-push surface — per-vessel tanker positions inside chokepoint bounding boxes, refreshed every 60s. Closes the visual gap with peer reference energy-intel sites for the live AIS tanker view. Per docs/plans/2026-04-25-003-feat-energy-parity-pushup-plan.md PR 3. Codex-approved through 8 review rounds against origin/main @ `050073354`. U7 — Contract changes (relay + handler + proto + gateway + rate-limit + test): - scripts/ais-relay.cjs: parallel `tankerReports` Map populated for AIS ship type 80-89 (tanker class) per ITU-R M.1371. SEPARATE from the existing `candidateReports` Map (military-only) so the existing military-detection consumer's contract stays unchanged. Snapshot endpoint extended to accept `bbox=swLat,swLon,neLat,neLon` + `tankers=true` query params, with bbox-filtering applied server-side. Tanker reports cleaned up on the same retention window as candidate reports; capped at 200 per response (10× headroom for global storage). - proto/worldmonitor/maritime/v1/{get_,}vessel_snapshot.proto: - new `bool include_tankers = 6` request field - new `repeated SnapshotCandidateReport tanker_reports = 7` response field (reuses existing message shape; parallel to candidate_reports) - server/worldmonitor/maritime/v1/get-vessel-snapshot.ts: REPLACES the prior 5-minute `with\|without` cache with a request-keyed cache — (includeCandidates, includeTankers, quantizedBbox) — at 60s TTL for the live-tanker path and 5min TTL for the existing density/disruption consumers. Also adds 1° bbox quantization for cache-key reuse and a 10° max-bbox guard (BboxTooLargeError) to prevent malicious clients from pulling all tankers through one query. - server/gateway.ts: NEW `'live'` cache tier. CacheTier union extended; TIER_HEADERS + TIER_CDN_CACHE both gain entries with `s-maxage=60, stale-while-revalidate=60`. RPC_CACHE_TIER maps the maritime endpoint from `'no-store'` to `'live'` so the CDN absorbs concurrent identical requests across all viewers (without this, N viewers × 6 chokepoints hit AISStream upstream linearly). - server/_shared/rate-limit.ts: ENDPOINT_RATE_POLICIES entry for the maritime endpoint at 60 req/min/IP — enough headroom for one user's 6-chokepoint tab plus refreshes; flags only true scrape-class traffic. - tests/route-cache-tier.test.mjs: regex extended to include `live` so the every-route-has-an-explicit-tier check still recognises the new mapping. Without this, the new tier would silently drop the maritime route from the validator's route map. U8 — LiveTankersLayer consumer: - src/services/live-tankers.ts: per-chokepoint fetcher with 60s in-memory cache. Promise.allSettled — never .all — so one chokepoint failing doesn't blank the whole layer (failed zones serve last-known data). Sources bbox centroids from src/config/chokepoint-registry.ts (CORRECT location — server/.../_chokepoint-ids.ts strips lat/lon). Default chokepoint set: hormuz_strait, suez, bab_el_mandeb, malacca_strait, panama, bosphorus. - src/components/DeckGLMap.ts: new `createLiveTankersLayer()` ScatterplotLayer styled by speed (anchored amber when speed < 0.5 kn, underway cyan, unknown gray); new `loadLiveTankers()` async loader with abort-controller cancellation. Layer instantiated when `mapLayers.liveTankers && this.liveTankers.length > 0`. - src/config/map-layer-definitions.ts: `LayerDefinition` for `liveTankers` with `renderers: ['flat'], deckGLOnly: true` (matches existing storageFacilities/fuelShortages pattern). Added to `VARIANT_LAYER_ORDER.energy` near `ais` so getLayersForVariant() and sanitizeLayersForVariant() include it on the energy variant — without this addition the layer would be silently stripped even when toggled on. - src/types/index.ts: `liveTankers?: boolean` on the MapLayers union. - src/config/panels.ts: ENERGY_MAP_LAYERS + ENERGY_MOBILE_MAP_LAYERS both gain `liveTankers: true`. Default `false` everywhere else. - src/services/maritime/index.ts: existing snapshot consumer pinned to `includeTankers: false` to satisfy the proto's new required field; preserves identical behavior for the AIS-density / military-detection surfaces. Tests: - npm run typecheck clean. - 5 unit tests in tests/live-tankers-service.test.mjs cover the default chokepoint set (rejects ids that aren't in CHOKEPOINT_REGISTRY), the 60s cache TTL pin (must match gateway 'live' tier s-maxage), and bbox derivation (±2° padding, total span under the 10° handler guard). - tests/route-cache-tier.test.mjs continues to pass after the regex extension; the new maritime tier is correctly extracted. Defense in depth: - THREE-layer cache (CDN 'live' tier → handler bbox-keyed 60s → service in-memory 60s) means concurrent users hit the relay sub-linearly. - Server-side 200-vessel cap on tanker_reports + client-side cap; protects layer render perf even on a runaway relay payload. - Bbox-size guard (10° max) prevents a single global-bbox query from exfiltrating every tanker. - Per-IP rate limit at 60/min covers normal use; flags scrape-class only. - Existing military-detection contract preserved: `candidate_reports` field semantics unchanged; consumers self-select via include_tankers vs include_candidates rather than the response field changing meaning. * fix(energy-atlas): wire LiveTankers loop + 400 bbox-range guard (PR3 review) Three findings from review of #3402: P1 — loadLiveTankers() was never called (DeckGLMap.ts:2999): - Add ensureLiveTankersLoop() / stopLiveTankersLoop() helpers paired with the layer-enabled / layer-disabled branches in updateLayers(). The ensure helper kicks an immediate load + a 60s setInterval; idempotent so calling it on every layers update is safe. - Wire stopLiveTankersLoop() into destroy() and into the layer-disabled branch so we don't hammer the relay when the layer is off. - Layer factory now runs only when liveTankers.length > 0; ensureLoop fires on every observed-enabled tick so first-paint kicks the load even before the first tanker arrives. P1 — bbox lat/lon range guard (get-vessel-snapshot.ts:253): - Out-of-range bboxes (e.g. ne_lat=200) previously passed the size guard (200-195=5° < 10°) but failed at the relay, which silently drops the bbox param and returns a global capped subset — making the layer appear to "work" with stale phantom data. - Add isValidLatLon() check inside extractAndValidateBbox(): every corner must satisfy [-90, 90] / [-180, 180] before the size guard runs. Failure throws BboxValidationError. P2 — BboxTooLargeError surfaced as 500 instead of 400: - server/error-mapper.ts maps errors to HTTP status by checking `'statusCode' in error`. The previous BboxTooLargeError extended Error without that property, so the mapper fell through to "unhandled error" → 500. - Rename to BboxValidationError, add `readonly statusCode = 400`. Mapper now surfaces it as HTTP 400 with a descriptive reason. - Keep BboxTooLargeError as a backwards-compat alias so existing imports / tests don't break. Tests: - Updated tests/server-handlers.test.mjs structural test to pin the new class name + statusCode + lat/lon range checks. 24 tests pass. - typecheck (src + api) clean. * fix(energy-atlas): thread AbortSignal through fetchLiveTankers (PR3 review #2) P2 — AbortController was created + aborted but signal was never passed into the actual fetch path (DeckGLMap.ts:3048 / live-tankers.ts:100): - Toggling the layer off, destroying the map, or starting a new refresh did not actually cancel in-flight network work. A slow older refresh could complete after a newer one and overwrite this.liveTankers with stale data. Threading: - fetchLiveTankers() now accepts `options.signal: AbortSignal`. Signal is passed through to client.getVesselSnapshot() per chokepoint via the Connect-RPC client's standard `{ signal }` option. - Per-zone abort handling: bail early if signal is already aborted before the fetch starts (saves a wasted RPC + cache write); re-check after the fetch resolves so a slow resolver can't clobber cache after the caller cancelled. Stale-result race guard in DeckGLMap.loadLiveTankers: - Capture controller in a local before storing on this.liveTankersAbort. - After fetchLiveTankers resolves, drop the result if EITHER: - controller.signal is now aborted (newer load cancelled this one) - this.liveTankersAbort points to a different controller (a newer load already started + replaced us in the field) - Without these guards, an older fetch that completed despite signal.aborted could still write to this.liveTankers and call updateLayers, racing with the newer load. Tests: 1 new signature-pin test in tests/live-tankers-service.test.mts verifies fetchLiveTankers accepts options.signal — guards against future edits silently dropping the parameter and re-introducing the race. 6 tests pass. typecheck clean. * fix(energy-atlas): bound vessel-snapshot cache via LRU eviction (PR3 review) Greptile P2 finding: the in-process cache Map grows unbounded across the serverless instance lifetime. Each distinct (includeCandidates, includeTankers, quantizedBbox) triple creates a slot that's never evicted. With 1° quantization and a misbehaving client the keyspace is ~64,000 entries — realistic load is ~12, so a 128-slot cap leaves 10x headroom while making OOM impossible. Implementation: - SNAPSHOT_CACHE_MAX_SLOTS = 128. - evictIfNeeded() walks insertion order and evicts the first slot whose inFlight is null. Slots with active fetches are skipped to avoid orphaning awaiting callers; we accept brief over-cap growth until in-flight settles. - touchSlot() re-inserts a slot at the end of Map insertion order on hit / in-flight join / fresh write so it counts as most-recently-used.	2026-04-25 17:56:23 +04:00
Elie Habib	2f5445284b	fix(brief): single canonical synthesis brain — eliminate email/brief lead divergence (#3396 ) * feat(brief-llm): canonical synthesis prompt + v3 cache key Extends generateDigestProse to be the single source of truth for brief executive-summary synthesis (canonicalises what was previously split between brief-llm's generateDigestProse and seed-digest- notifications.mjs's generateAISummary). Ports Brain B's prompt features into buildDigestPrompt: - ctx={profile, greeting, isPublic} parameter (back-compat: 4-arg callers behave like today) - per-story severity uppercased + short-hash prefix [h:XXXX] so the model can emit rankedStoryHashes for stable re-ranking - profile lines + greeting opener appear only when ctx.isPublic !== true validateDigestProseShape gains optional rankedStoryHashes (≥4-char strings, capped to MAX_STORIES_PER_USER × 2). v2-shaped rows still pass — field defaults to []. hashDigestInput v3: - material includes profile-SHA, greeting bucket, isPublic flag, per-story hash - isPublic=true substitutes literal 'public' for userId in the cache key so all share-URL readers of the same (date, sensitivity, pool) hit ONE cache row (no PII in public cache key) Adds generateDigestProsePublic(stories, sensitivity, deps) wrapper — no userId param by design — for the share-URL surface. Cache prefix bumped brief:llm:digest:v2 → v3. v2 rows expire on TTL. Per the v1→v2 precedent (see hashDigestInput comment), one-tick cost on rollout is acceptable for cache-key correctness. Tests: 72/72 passing in tests/brief-llm.test.mjs (8 new for the v3 behaviors), full data suite 6952/6952. Plan: docs/plans/2026-04-25-002-fix-brief-email-two-brain-divergence-plan.md Step 1, Codex-approved (5 rounds). * feat(brief): envelope v3 — adds digest.publicLead for share-URL surface Bumps BRIEF_ENVELOPE_VERSION 2 → 3. Adds optional BriefDigest.publicLead — non-personalised executive lead generated by generateDigestProsePublic (already in this branch from the previous commit) for the public share-URL surface. Personalised `lead` is the canonical synthesis for authenticated channels; publicLead is its profile-stripped sibling so api/brief/public/* never serves user-specific content (watched assets/regions). SUPPORTED_ENVELOPE_VERSIONS = [1, 2, 3] keeps v1 + v2 envelopes in the 7-day TTL window readable through the rollout — the composer only ever writes the current version, but readers must tolerate older shapes that haven't expired yet. Same rollout pattern used at the v1 → v2 bump. Renderer changes (server/_shared/brief-render.js): - ALLOWED_DIGEST_KEYS gains 'publicLead' (closed-key-set still enforced; v2 envelopes pass because publicLead === undefined is the v2 shape). - assertBriefEnvelope: new isNonEmptyString check on publicLead when present. Type contract enforced; absence is OK. Tests (tests/brief-magazine-render.test.mjs): - New describe block "v3 publicLead field": v3 envelope renders; malformed publicLead rejected; v2 envelope still passes; ad-hoc digest keys (e.g. synthesisLevel) still rejected — confirming the closed-key-set defense holds for the cron-local-only fields the orchestrator must NOT persist. - BRIEF_ENVELOPE_VERSION pin updated 2 → 3 with rollout-rationale comment. Test results: 182 brief-related tests pass; full data suite 6956/6956. Plan: docs/plans/2026-04-25-002-fix-brief-email-two-brain-divergence-plan.md Step 2, Codex Round-3 Medium #2. * feat(brief): synthesis splice + rankedStoryHashes pre-cap re-order Plumbs the canonical synthesis output (lead, threads, signals, publicLead, rankedStoryHashes from generateDigestProse) through the pure composer so the orchestration layer can hand pre-resolved data into envelope.digest. Composer stays sync / no I/O — Codex Round-2 High #2 honored. Changes: scripts/lib/brief-compose.mjs: - digestStoryToUpstreamTopStory now emits `hash` (the digest story's stable identifier, falls back to titleHash when absent). Without this, rankedStoryHashes from the LLM has nothing to match against. - composeBriefFromDigestStories accepts opts.synthesis = {lead, threads, signals, rankedStoryHashes?, publicLead?}. When passed, splices into envelope.digest after the stub is built. Partial synthesis (e.g. only `lead` populated) keeps stub defaults for the other fields — graceful degradation when L2 fallback fires. shared/brief-filter.js: - filterTopStories accepts optional rankedStoryHashes. New helper applyRankedOrder re-orders stories by short-hash prefix match BEFORE the cap is applied, so the model's editorial judgment of importance survives MAX_STORIES_PER_USER. Stable for ties; stories not in the ranking come after in original order. Empty/missing ranking is a no-op (legacy callers unchanged). shared/brief-filter.d.ts: - filterTopStories signature gains rankedStoryHashes?: string[]. - UpstreamTopStory gains hash?: unknown (carried through from digestStoryToUpstreamTopStory). Tests added (tests/brief-from-digest-stories.test.mjs): - synthesis substitutes lead/threads/signals/publicLead. - legacy 4-arg callers (no synthesis) keep stub lead. - partial synthesis (only lead) keeps stub threads/signals. - rankedStoryHashes re-orders pool before cap. - short-hash prefix match (model emits 8 chars; story carries full). - unranked stories go after in original order. Test results: 33/33 in brief-from-digest-stories; 182/182 across all brief tests; full data suite 6956/6956. Plan: docs/plans/2026-04-25-002-fix-brief-email-two-brain-divergence-plan.md Step 3, Codex Round-2 Low + Round-2 High #2. * feat(brief): single canonical synthesis per user; rewire all channels Restructures the digest cron's per-user compose + send loops to produce ONE canonical synthesis per user per issueSlot — the lead text every channel (email HTML, plain-text, Telegram, Slack, Discord, webhook) and the magazine show is byte-identical. This eliminates the "two-brain" divergence that was producing different exec summaries on different surfaces (observed 2026-04-25 0802). Architecture: composeBriefsForRun (orchestration): - Pre-annotates every eligible rule with lastSentAt + isDue once, before the per-user pass. Same getLastSentAt helper the send loop uses so compose + send agree on lastSentAt for every rule. composeAndStoreBriefForUser (per-user): - Two-pass winner walk: try DUE rules first (sortedDue), fall back to ALL eligible rules (sortedAll) for compose-only ticks. Preserves today's dashboard refresh contract for weekly / twice_daily users on non-due ticks (Codex Round-4 High #1). - Within each pass, walk by compareRules priority and pick the FIRST candidate with a non-empty pool — mirrors today's behavior at scripts/seed-digest-notifications.mjs:1044 and prevents the "highest-priority but empty pool" edge case (Codex Round-4 Medium #2). - Three-level synthesis fallback chain: L1: generateDigestProse(fullPool, ctx={profile,greeting,!public}) L2: generateDigestProse(envelope-sized slice, ctx={}) L3: stub from assembleStubbedBriefEnvelope Distinct log lines per fallback level so ops can quantify failure-mode distribution. - Generates publicLead in parallel via generateDigestProsePublic (no userId param; cache-shared across all share-URL readers). - Splices synthesis into envelope via composer's optional `synthesis` arg (Step 3); rankedStoryHashes re-orders the pool BEFORE the cap so editorial importance survives MAX_STORIES. - synthesisLevel stored in the cron-local briefByUser entry — NOT persisted in the envelope (renderer's assertNoExtraKeys would reject; Codex Round-2 Medium #5). Send loop: - Reads lastSentAt via shared getLastSentAt helper (single source of truth with compose flow). - briefLead = brief?.envelope?.data?.digest?.lead — the canonical lead. Passed to buildChannelBodies (text/Telegram/Slack/Discord), injectEmailSummary (HTML email), and sendWebhook (webhook payload's `summary` field). All-channel parity (Codex Round-1 Medium #6). - Subject ternary reads cron-local synthesisLevel: 1 or 2 → "Intelligence Brief", 3 → "Digest" (preserves today's UX for fallback paths; Codex Round-1 Missing #5). Removed: - generateAISummary() — the second LLM call that produced the divergent email lead. ~85 lines. - AI_SUMMARY_CACHE_TTL constant — no longer referenced. The digest:ai-summary:v1:* cache rows expire on their existing 1h TTL (no cleanup pass). Helpers added: - getLastSentAt(rule) — extracted Upstash GET for digest:last-sent so compose + send both call one source of truth. - buildSynthesisCtx(rule, nowMs) — formats profile + greeting for the canonical synthesis call. Preserves all today's prefs-fetch failure-mode behavior. Composer: - compareRules now exported from scripts/lib/brief-compose.mjs so the cron can sort each pass identically to groupEligibleRulesByUser. Test results: full data suite 6962/6962 (was 6956 pre-Step 4; +6 new compose-synthesis tests from Step 3). Plan: docs/plans/2026-04-25-002-fix-brief-email-two-brain-divergence-plan.md Steps 4 + 4b. Codex-approved (5 rounds). * fix(brief-render): public-share lead fail-safe — never leak personalised lead Public-share render path (api/brief/public/[hash].ts → renderer publicMode=true) MUST NEVER serve the personalised digest.lead because that string can carry profile context — watched assets, saved-region names, etc. — written by generateDigestProse with ctx.profile populated. Previously: redactForPublic redacted user.name and stories.whyMatters but passed digest.lead through unchanged. Codex Round-2 High (security finding). Now (v3 envelope contract): - redactForPublic substitutes digest.lead = digest.publicLead when the v3 envelope carries one (generated by generateDigestProsePublic with profile=null, cache-shared across all public readers). - When publicLead is absent (v2 envelope still in TTL window OR v3 envelope where publicLead generation failed), redactForPublic sets digest.lead to empty string. - renderDigestGreeting: when lead is empty, OMIT the <blockquote> pull-quote entirely. Page still renders complete (greeting + horizontal rule), just without the italic lead block. - NEVER falls back to the original personalised lead. assertBriefEnvelope still validates publicLead's contract (when present, must be a non-empty string) BEFORE redactForPublic runs, so a malformed publicLead throws before any leak risk. Tests added (tests/brief-magazine-render.test.mjs): - v3 envelope renders publicLead in pull-quote, personalised lead text never appears. - v2 envelope (no publicLead) omits pull-quote; rest of page intact. - empty-string publicLead rejected by validator (defensive). - private render still uses personalised lead. Test results: 68 brief-magazine-render tests pass; full data suite remains green from prior commit. Plan: docs/plans/2026-04-25-002-fix-brief-email-two-brain-divergence-plan.md Step 5, Codex Round-2 High (security). * feat(digest): brief lead parity log + extra acceptance tests Adds the parity-contract observability line and supplementary acceptance tests for the canonical synthesis path. Parity log (per send, after successful delivery): [digest] brief lead parity user=<id> rule=<v>:<s>:<lang> synthesis_level=<1\|2\|3> exec_len=<n> brief_lead_len=<n> channels_equal=<bool> public_lead_len=<n> When channels_equal=false an extra WARN line fires — "PARITY REGRESSION user=… — email lead != envelope lead." Sentry's existing console-breadcrumb hook lifts this without an explicit captureMessage call. Plan acceptance criterion A5. Tests added (tests/brief-llm.test.mjs, +9): - generateDigestProsePublic: two distinct callers with identical (sensitivity, story-pool) hit the SAME cache row (per Codex Round-2 Medium #4 — "no PII in public cache key"). - public + private writes never collide on cache key (defensive). - greeting bucket change re-keys the personalised cache (Brain B parity). - profile change re-keys the personalised cache. - v3 cache prefix used (no v2 writes). Test results: 77/77 in brief-llm; full data suite 6971/6971 (was 6962 pre-Step-7; +9 new public-cache tests). Plan: docs/plans/2026-04-25-002-fix-brief-email-two-brain-divergence-plan.md Steps 6 (partial) + 7. Acceptance A5, A6.g, A6.f. * test(digest): backfill A6.h/i/l/m acceptance tests via helper extraction * fix(brief): close two correctness regressions on multi-rule + public surface Two findings from human review of the canonical-synthesis PR: 1. Public-share redaction leaked personalised signals + threads. The new prompt explicitly personalises both `lead` and `signals` ("personalise lead and signals"), but redactForPublic only substituted `lead` — leaving `signals` and `threads` intact. Public renderer's hasSignals gate would emit the signals page whenever `digest.signals.length > 0`, exposing watched-asset / region phrasing to anonymous readers. Same privacy bug class the original PR was meant to close, just on different fields. 2. Multi-rule users got cross-pool lead/storyList mismatch. composeAndStoreBriefForUser picks ONE winning rule for the canonical envelope. The send loop then injected that ONE `briefLead` into every due rule's channel body — even though each rule's storyList came from its own (per-rule) digest pool. Multi-rule users (e.g. `full` + `finance`) ended up with email bodies leading on geopolitics while listing finance stories. Cross-rule editorial mismatch reintroduced after the cross- surface fix. Fix 1 — public signals + threads: - Envelope shape: BriefDigest gains `publicSignals?: string[]` + `publicThreads?: BriefThread[]` (sibling fields to publicLead). Renderer's ALLOWED_DIGEST_KEYS extended; assertBriefEnvelope validates them when present. - generateDigestProsePublic already returned a full prose object (lead + signals + threads) — orchestration now captures all three instead of just `.lead`. Composer splices each into its envelope slot. - redactForPublic substitutes: digest.lead ← publicLead (or empty → omits pull-quote) digest.signals ← publicSignals (or empty → omits signals page) digest.threads ← publicThreads (or category-derived stub via new derivePublicThreadsStub helper — never falls back to the personalised threads) - New tests cover all three substitutions + their fail-safes. Fix 2 — per-rule synthesis in send loop: - Each due rule independently calls runSynthesisWithFallback over ITS OWN pool + ctx. Channel body lead is internally consistent with the storyList (both from the same pool). - Cache absorbs the cost: when this is the winner rule, the synthesis hits the cache row written during the compose pass (same userId/sensitivity/pool/ctx) — no extra LLM call. Only multi-rule users with non-overlapping pools incur additional LLM calls. - magazineUrl still points at the winner's envelope (single brief per user per slot — `(userId, issueSlot)` URL contract). Channel lead vs magazine lead may differ for non-winner rule sends; documented as acceptable trade-off (URL/key shape change to support per-rule magazines is out of scope for this PR). - Parity log refined: adds `winner_match=<bool>` field. The PARITY REGRESSION warning now fires only when winner_match=true AND the channel lead differs from the envelope lead (the actual contract regression). Non-winner sends with legitimately different leads no longer spam the alert. Test results: - tests/brief-magazine-render.test.mjs: 75/75 (+7 new for public signals/threads + validator + private-mode-ignores-public-fields) - Full data suite: 6995/6995 (was 6988; +7 net) - typecheck + typecheck:api: clean Plan: docs/plans/2026-04-25-002-fix-brief-email-two-brain-divergence-plan.md Addresses 2 review findings on PR #3396 not anticipated in the 5-round Codex review. * fix(brief): unify compose+send window, fall through filter-rejection Address two residual risks in PR #3396 (single-canonical-brain refactor): Risk 1 — canonical lead synthesized from a fixed 24h pool while the send loop ships stories from `lastSentAt ?? 24h`. For weekly users that meant a 24h-pool lead bolted onto a 7d email body — the same cross-surface divergence the refactor was meant to eliminate, just in a different shape. Twice-daily users hit a 12h-vs-24h variant. Fix: extract the window formula to `digestWindowStartMs(lastSentAt, nowMs, defaultLookbackMs)` in digest-orchestration-helpers.mjs and call it from BOTH the compose path's digestFor closure AND the send loop. The compose path now derives windowStart per-candidate from `cand.lastSentAt`, identical to what the send loop will use for that rule. Removed the now-unused BRIEF_STORY_WINDOW_MS constant. Side-effect: digestFor now receives the full annotated candidate (`cand`) instead of just the rule, so it can reach `cand.lastSentAt`. Backwards-compatible at the helper level — pickWinningCandidateWithPool forwards `cand` instead of `cand.rule`. Cache memo hit rate drops since lastSentAt varies per-rule, but correctness > a few extra Upstash GETs. Risk 2 — pickWinningCandidateWithPool returned the first candidate with a non-empty raw pool as winner. If composeBriefFromDigestStories then dropped every story (URL/headline/shape filters), the caller bailed without trying lower-priority candidates. Pre-PR behaviour was to keep walking. This regressed multi-rule users whose top-priority rule's pool happens to be entirely filter-rejected. Fix: optional `tryCompose(cand, stories)` callback on pickWinningCandidateWithPool. When provided, the helper calls it after the non-empty pool check; falsy return → log filter-rejected and walk to the next candidate; truthy → returns `{winner, stories, composeResult}` so the caller can reuse the result. Without the callback, legacy semantics preserved (existing tests + callers unaffected). Caller composeAndStoreBriefForUser passes a no-synthesis compose call as tryCompose — cheap pure-JS, no I/O. Synthesis only runs once after the winner is locked in, so the perf cost is one extra compose per filter-rejected candidate, no extra LLM round-trips. Tests: - 10 new cases in tests/digest-orchestration-helpers.test.mjs covering: digestFor receiving full candidate; tryCompose fall-through to lower-priority; all-rejected returns null; composeResult forwarded; legacy semantics without tryCompose; digestWindowStartMs lastSentAt-vs-default branches; weekly + twice-daily window parity assertions; epoch-zero ?? guard. - Updated tests/digest-cache-key-sensitivity.test.mjs static-shape regex to match the new `cand.rule.sensitivity` cache-key shape (intent unchanged: cache key MUST include sensitivity). Stacked on PR #3396 — targets feat/brief-two-brain-divergence.	2026-04-25 16:22:31 +04:00
Sebastien Melki	e68a7147dd	chore(api): sebuf migration follow-ups (post-#3242) (#3287 ) * chore(api-manifest): rewrite brief-why-matters reason as proper internal-helper justification Carried in from #3248 merge as a band-aid (called out in #3242 review followup checklist item 7). The endpoint genuinely belongs in internal-helper — RELAY_SHARED_SECRET-bearer auth, cron-only caller, never reached by dashboards or partners. Same shape constraint as api/notify.ts. Replaces the apologetic "filed here to keep the lint green" framing with a proper structural justification: modeling it as a generated service would publish internal cron plumbing as user-facing API surface. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> * feat(lint): premium-fetch parity check for ServiceClients (closes #3279) Adds scripts/enforce-premium-fetch.mjs — AST-walks src/, finds every `new <ServiceClient>(...)` (variable decl OR `this.foo =` assignment), tracks which methods each instance actually calls, and fails if any called method targets a path in src/shared/premium-paths.ts PREMIUM_RPC_PATHS without `{ fetch: premiumFetch }` on the constructor. Per-call-site analysis (not class-level) keeps the trade/index.ts pattern clean — publicClient with globalThis.fetch + premiumClient with premiumFetch on the same TradeServiceClient class — since publicClient never calls a premium method. Wired into: - npm run lint:premium-fetch - .husky/pre-push (right after lint:rate-limit-policies) - .github/workflows/lint-code.yml (right after lint:api-contract) Found and fixed three latent instances of the HIGH(new) #1 class from #3242 review (silent 401 → empty fallback for signed-in browser pros): - src/services/correlation-engine/engine.ts — IntelligenceServiceClient built with no fetch option called deductSituation. LLM-assessment overlay on convergence cards never landed for browser pros without a WM key. - src/services/economic/index.ts — EconomicServiceClient with globalThis.fetch called getNationalDebt. National-debt panel rendered empty for browser pros. - src/services/sanctions-pressure.ts — SanctionsServiceClient with globalThis.fetch called listSanctionsPressure. Sanctions-pressure panel rendered empty for browser pros. All three swap to premiumFetch (single shared client, mirrors the supply-chain/index.ts justification — premiumFetch no-ops safely on public methods, so the public methods on those clients keep working). Verification: - lint:premium-fetch clean (34 ServiceClient classes, 28 premium paths, 466 src/ files analyzed) - Negative test: revert any of the three to globalThis.fetch → exit 1 with file:line and called-premium-method names - typecheck + typecheck:api clean - lint:api-contract / lint:rate-limit-policies / lint:boundaries clean - tests/sanctions-pressure.test.mjs + premium-fetch.test.mts: 16/16 pass Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> * fix(military): fetchStaleFallback NEG_TTL=30s parity (closes #3277) The legacy /api/military-flights handler had NEG_TTL = 30_000ms — a short suppression window after a failed live + stale read so we don't Redis-hammer the stale key during sustained relay+seed outages. Carried into the sebuf list-military-flights handler: - Module-scoped `staleNegUntil` timestamp (per-isolate on Vercel Edge, which is fine — each warm isolate gets its own 30s suppression window). - Set whenever fetchStaleFallback returns null (key missing, parse fail, empty array after staleToProto filter, or thrown error). - Checked at the entry of fetchStaleFallback before doing the Redis read. - Test seam `_resetStaleNegativeCacheForTests()` exposed for unit tests. Test pinned in tests/redis-caching.test.mjs: drives a stale-empty cycle three times — first read hits Redis, second within window doesn't, after test-only reset it does again. Verified: 18/18 redis-caching tests pass, typecheck:api clean, lint:premium-fetch clean. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> * refactor(lint): rate-limit-policies regex → import() (closes #3278) The previous lint regex-parsed ENDPOINT_RATE_POLICIES from the source file. That worked because the literal happens to fit a single line per key today, but a future reformat (multi-line key wrap, formatter swap, etc.) would silently break the lint without breaking the build — exactly the failure mode that's worse than no lint at all. Fix: - Export ENDPOINT_RATE_POLICIES from server/_shared/rate-limit.ts. - Convert scripts/enforce-rate-limit-policies.mjs to async + dynamic import() of the policy object directly. Same TS module that the gateway uses at runtime → no source-of-truth drift possible. - Run via tsx (already a dev dep, used by test:data) so the .mjs shebang can resolve a .ts import. - npm script swapped to `tsx scripts/...`. .husky/pre-push uses `npm run lint:rate-limit-policies` so no hook change needed. Verified: - Clean: 6 policies / 182 gateway routes. - Negative test (rename a key to the original sanctions typo /api/sanctions/v1/lookup-entity): exit 1 with the same incident- attributed remedy message as before. - Reformat test (split a single-line entry across multiple lines): still passes — the property is what's read, not the source layout. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> * fix(shipping/v2): alertThreshold: 0 preserved; drop dead validation branch (#3242 followup) Before: alert_threshold was a plain int32. proto3 scalar default is 0, so the handler couldn't distinguish "partner explicitly sent 0 (deliver every disruption)" from "partner omitted the field (apply legacy default 50)" — both arrived as 0 and got coerced to 50 by `> 0 ? : 50`. Silent intent-drop for any partner who wanted every alert. The subsequent `alertThreshold < 0` branch was also unreachable after that coercion. After: - Proto field is `optional int32 alert_threshold` — TS type becomes `alertThreshold?: number`, so omitted = undefined and explicit 0 stays 0. - Handler uses `req.alertThreshold ?? 50` — undefined → 50, any number passes through unchanged. - Dead `< 0 \|\| > 100` runtime check removed; buf.validate `int32.gte = 0, int32.lte = 100` already enforces the range at the wire layer. Partner wire contract: identical for the omit-field and 1..100 cases. Only behavioural change is explicit 0 — previously impossible to request, now honored per proto3 optional semantics. Scoped `buf generate --path worldmonitor/shipping/v2` to avoid the full- regen `@ts-nocheck` drift Seb documented in the #3242 PR comments. Re-applied `@ts-nocheck` on the two regenerated files manually. Tests: - `alertThreshold 0 coerces to 50` flipped to `alertThreshold 0 preserved`. - New test: `alertThreshold omitted (undefined) applies legacy default 50`. - `rejects > 100` test removed — proto/wire validation handles it; direct handler calls intentionally bypass wire and the handler no longer carries a redundant runtime range check. Verified: 18/18 shipping-v2-handler tests pass, typecheck + typecheck:api clean, all 4 custom lints clean. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> * docs(shipping/v2): document missing webhook delivery worker + DNS-rebinding contract (#3242 followup) #3242 followup checklist item 6 from @koala73 — sanity-check that the delivery worker honors the re-resolve-and-re-check contract that isBlockedCallbackUrl explicitly delegates to it. Audit finding: no delivery worker for shipping/v2 webhooks exists in this repo. Grep across the entire tree (excluding generated/dist) shows the only readers of webhook:sub:* records are the registration / inspection / rotate-secret handlers themselves. No code reads them and POSTs to the stored callbackUrl. The delivery worker is presumed to live in Railway (separate repo) or hasn't been built yet — neither is auditable from this repo. Refreshes the comment block at the top of webhook-shared.ts to: - explicitly state DNS rebinding is NOT mitigated at registration - spell out the four-step contract the delivery worker MUST follow (re-validate URL, dns.lookup, re-check resolved IP against patterns, fetch with resolved IP + Host header preserved) - flag the in-repo gap so anyone landing delivery code can't miss it Tracking the gap as #3288 — acceptance there is "delivery worker imports the patterns + helpers from webhook-shared.ts and applies the four steps before each send." Action moves to wherever the delivery worker actually lives (Railway likely). No code change. Tests + lints unchanged. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> * ci(lint): add rate-limit-policies step (greptile P1 #3287) Pre-push hook ran lint:rate-limit-policies but the CI workflow did not, so fork PRs and --no-verify pushes bypassed the exact drift check the lint was added to enforce (closes #3278). Adding it right after lint:api-contract so it runs in the same context the lint was designed for. * refactor(lint): premium-fetch regex → import() + loop classRe (greptile P2 #3287) Two fragilities greptile flagged on enforce-premium-fetch.mjs: 1. loadPremiumPaths regex-parsed src/shared/premium-paths.ts with /'(\/api\/[^']+)'/g — same class of silent drift we just removed from enforce-rate-limit-policies in #3278. Reformatting the source Set (double quotes, spread, helper-computed entries) would drop paths from the lint while leaving the runtime untouched. Fix: flip the shebang to `#!/usr/bin/env -S npx tsx` and dynamic-import PREMIUM_RPC_PATHS directly, mirroring the rate-limit pattern. package.json lint:premium-fetch now invokes via tsx too so the npm-script path matches direct execution. 2. loadClientClassMap ran classRe.exec once, silently dropping every ServiceClient after the first if a file ever contained more than one. Current codegen emits one class per file so this was latent, but a template change would ship un-linted classes. Fix: collect every class-open match with matchAll, slice each class body with the next class's start as the boundary, and scan methods per-body so method-to-class binding stays correct even with multiple classes per file. Verification: - lint:premium-fetch clean (34 classes / 28 premium paths / 466 files — identical counts to pre-refactor, so no coverage regression). - Negative test: revert src/services/economic/index.ts to globalThis.fetch → exit 1 with file:line, bound var name, and premium method list (getNationalDebt). Restore → clean. - lint:rate-limit-policies still clean. * fix(shipping/v2): re-add alertThreshold handler range guard (greptile nit 1 #3287) Wire-layer buf.validate enforces 0..100, but direct handler invocation (internal jobs, test harnesses, future transports) bypasses it. Cheap invariant-at-the-boundary — rejects < 0 or > 100 with ValidationError before the record is stored. Tests: restored the rejects-out-of-range cases that were dropped when the branch was (correctly) deleted as dead code on the previous commit. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> * refactor(lint): premium-fetch method-regex → TS AST (greptile nits 2+5 #3287) loadClientClassMap: The method regex `async (\w+)\s$[^)]$\s:\sPromise<[^>]+>\s\{\slet path = "..."` assumed (a) no nested `)` in arg types, (b) no nested `>` in the return type, (c) `let path = "..."` as the literal first statement. Any codegen template shift would silently drop methods with the lint still passing clean — the same silent-drift class #3287 just closed on the premium-paths side. Now walks the service_client.ts AST, matches `export class ServiceClient`, iterates `MethodDeclaration` members, and reads the first `let path: string = '...'` variable statement as a StringLiteral. Tolerant to any reformatting of arg/return types or method shape. findCalls scope-blindness: Added limitation comment — the walker matches `<varName>.<method>()` anywhere in the file without respecting scope. Two constructions in different function scopes sharing a var name merge their called-method sets. No current src/ file hits this; the lint errs cautiously (flags both instances). Keeping the walker simple until scope-aware binding is needed. webhook-shared.ts: Inlined issue reference (#3288) so the breadcrumb resolves without bouncing through an MDX that isn't in the diff. Verification: - lint:premium-fetch clean — 34 classes / 28 premium paths / 489 files. Pre-refactor: 34 / 28 / 466. Class + path counts identical; file bump is from the main-branch rebase, not the refactor. - Negative test: revert src/services/economic/index.ts premiumFetch → globalThis.fetch. Lint exits 1 at `src/services/economic/index.ts:64:7` with `premium method(s) called: getNationalDebt`. Restore → clean. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> refactor(lint): rate-limit OpenAPI regex → yaml parser (greptile nit 3 #3287) Input side (ENDPOINT_RATE_POLICIES) was flipped to live `import()` in `4e79d029`. Output side (OpenAPI routes) still regex-scraped top-level `paths:` keys with `/^\s{4}(\/api\/[^\s:]+):/gm` — hard-coded 4-space indent. Any YAML formatter change (2-space indent, flow style, line folding) would silently drop routes and let policy-drift slip through — same silent-drift class the input-side fix closed. Now uses the `yaml` package (already a dep) to parse each .openapi.yaml and reads `doc.paths` directly. Verification: - Clean: 6 policies / 189 routes (was 182 — yaml parser picks up a handful the regex missed, closing a silent coverage gap). - Negative test: rename policy key back to /api/sanctions/v1/lookup-entity → exits 1 with the same incident-attributed remedy. Restore → clean. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> * chore(codegen): regenerate unified OpenAPI bundle for alert_threshold proto change The shipping/v2 webhook alert_threshold field was flipped from `int32` to `optional int32` with an expanded doc comment in `f3339464`. That comment now surfaces in the unified docs/api/worldmonitor.openapi.yaml bundle (introduced by #3341). Regenerated with sebuf v0.11.1 to pick it up. No behaviour change — bundle-only documentation drift. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-24 18:00:41 +03:00
Elie Habib	34dfc9a451	fix(news): ground LLM surfaces on real RSS description end-to-end (#3370 ) * feat(news/parser): extract RSS/Atom description for LLM grounding (U1) Add description field to ParsedItem, extract from the first non-empty of description/content:encoded (RSS) or summary/content (Atom), picking the longest after HTML-strip + entity-decode + whitespace-normalize. Clip to 400 chars. Reject empty, <40 chars after strip, or normalize-equal to the headline — downstream consumers fall back to the cleaned headline on '', preserving current behavior for feeds without a description. CDATA end is anchored to the closing tag so internal ]]> sequences do not truncate the match. Preserves cached rss:feed:v1 row compatibility during the 1h TTL bleed since the field is additive. Part of fix: pipe RSS description end-to-end so LLM surfaces stop hallucinating named actors (docs/plans/2026-04-24-001-...). Covers R1, R7. * feat(news/story-track): persist description on story:track:v1 HSET (U2) Append description to the story:track:v1 HSET only when non-empty. Additive — no key version bump. Old rows and rows from feeds without a description return undefined on HGETALL, letting downstream readers fall back to the cleaned headline (R6). Extract buildStoryTrackHsetFields as a pure helper so the inclusion gate is unit-testable without Redis. Update the contract comment in cache-keys.ts so the next reader of the schema sees description as an optional field. Covers R2, R6. * feat(proto): NewsItem.snippet + SummarizeArticleRequest.bodies (U3) Add two additive proto fields so the article description can ride to every LLM-adjacent consumer without a breaking change: - NewsItem.snippet (field 12): RSS/Atom description, HTML-stripped, ≤400 chars, empty when unavailable. Wired on toProtoItem. - SummarizeArticleRequest.bodies (field 8): optional article bodies paired 1:1 with headlines for prompt grounding. Empty array is today's headline-only behavior. Regenerated TS client/server stubs and OpenAPI YAML/JSON via sebuf v0.11.1 (PATH=~/go/bin required — Homebrew's protoc-gen-openapiv3 is an older pre-bundle-mode build that collides on duplicate emission). Pre-emptive bodies:[] placeholders at the two existing SummarizeArticle call sites in src/services/summarization.ts; U6 replaces them with real article bodies once SummarizeArticle handler reads the field. Covers R3, R5. * feat(brief/digest): forward RSS description end-to-end through brief envelope (U4) Digest accumulator reader (seed-digest-notifications.mjs::buildDigest) now plumbs the optional `description` field off each story:track:v1 HGETALL into the digest story object. The brief adapter (brief-compose.mjs:: digestStoryToUpstreamTopStory) prefers the real RSS description over the cleaned headline; when the upstream row has no description (old rows in the 48h bleed, feeds that don't carry one), we fall back to the cleaned headline so today behavior is preserved (R6). This is the upstream half of the description cache path. U5 lands the LLM- side grounding + cache-prefix bump so Gemini actually sees the article body instead of hallucinating a named actor from the headline. Covers R4 (upstream half), R6. * feat(brief/llm): RSS grounding + sanitisation + 4 cache prefix bumps (U5) The actual fix for the headline-only named-actor hallucination class: Gemini 2.5 Flash now receives the real article body as grounding context, so it paraphrases what the article says instead of filling role-label headlines from parametric priors ("Iran's new supreme leader" → "Ali Khamenei" was the 2026-04-24 reproduction; with grounding, it becomes the actual article-named actor). Changes: - buildStoryDescriptionPrompt interpolates a `Context: <body>` line between the metadata block and the "One editorial sentence" instruction when description is non-empty AND not normalise-equal to the headline. Clips to 400 chars as a second belt-and-braces after the U1 parser cap. No Context line → identical prompt to pre-fix (R6 preserved). - sanitizeStoryForPrompt extended to cover `description`. Closes the asymmetry where whyMatters was sanitised and description wasn't — untrusted RSS bodies now flow through the same injection-marker neutraliser before prompt interpolation. generateStoryDescription wraps the story in sanitizeStoryForPrompt before calling the builder, matching generateWhyMatters. - Four cache prefixes bumped atomically to evict pre-grounding rows: scripts/lib/brief-llm.mjs: brief:llm:description:v1 → v2 (Railway, description path) brief:llm:whymatters:v2 → v3 (Railway, whyMatters fallback) api/internal/brief-why-matters.ts: brief:llm:whymatters:v6 → v7 (edge, primary) brief:llm:whymatters:shadow:v4 → shadow:v5 (edge, shadow) hashBriefStory already includes description in the 6-field material (v5 contract) so identity naturally drifts; the prefix bump is the belt-and-braces that guarantees a clean cold-start on first tick. - Tests: 8 new + 2 prefix-match updates on tests/brief-llm.test.mjs. Covers Context-line injection, empty/dup-of-headline rejection, 400-char clip, sanitisation of adversarial descriptions, v2 write, and legacy-v1 row dark (forced cold-start). Covers R4 + new sanitisation requirement. * feat(news/summarize): accept bodies + bump summary cache v5→v6 (U6) SummarizeArticle now grounds on per-headline article bodies when callers supply them, so the dashboard "News summary" path stops hallucinating across unrelated headlines when the upstream RSS carried context. Three coordinated changes: 1. SummarizeArticleRequest handler reads req.bodies, sanitises each entry through sanitizeForPrompt (same trust treatment as geoContext — bodies are untrusted RSS text), clips to 400 chars, and pads to the headlines length so pair-wise identity is stable. 2. buildArticlePrompts accepts optional bodies and interleaves a ` Context: <body>` line under each numbered headline that has a non-empty body. Skipped in translate mode (headline[0]-only) and when all bodies are empty — yielding a byte-identical prompt to pre-U6 for every current caller (R6 preserved). 3. summary-cache-key bumps CACHE_VERSION v5→v6 so the pre-grounding rows (produced from headline-only prompts) cold-start cleanly. Extends canonicalizeSummaryInputs + buildSummaryCacheKey with a pair-wise bodies segment `:bd<hash>`; the prefix is `:bd` rather than `:b` to avoid colliding with `:brief:` when pattern-matching keys. Translate mode is headline[0]-only and intentionally does not shift on bodies. Dedup reorder preserved: the handler re-pairs bodies to the deduplicated top-5 via findIndex, so layout matches without breaking cache identity. New tests: 7 on buildArticlePrompts (bodies interleave, partial fill, translate-mode skip, clip, short-array tolerance), 8 on buildSummaryCacheKey (pair-wise sort, cache-bust on body drift, translate skip). Existing summary-cache-key assertions updated v5→v6. Covers R3, R4. * feat(consumers): surface RSS snippet across dashboard, email, relay, MCP + audit (U7) Thread the RSS description from the ingestion path (U1-U5) into every user-facing LLM-adjacent surface. Audit the notification producers so RSS-origin and domain-origin events stay on distinct contracts. Dashboard (proto snippet → client → panel): - src/types/index.ts NewsItem.snippet?:string (client-side field). - src/app/data-loader.ts proto→client mapper propagates p.snippet. - src/components/NewsPanel.ts renders snippet as a truncated (~200 chars, word-boundary ellipsis) `.item-snippet` line under each headline. - NewsPanel.currentBodies tracks per-headline bodies paired 1:1 with currentHeadlines; passed as options.bodies to generateSummary so the server-side SummarizeArticle LLM grounds on the article body. Summary plumbing: - src/services/summarization.ts threads bodies through SummarizeOptions → generateSummary → runApiChain → tryApiProvider; cache key now includes bodies (via U6's buildSummaryCacheKey signature). MCP world-brief: - api/mcp.ts pairs headlines with their RSS snippets and POSTs `bodies` to /api/news/v1/summarize-article so the MCP tool surface is no longer starved. Email digest: - scripts/seed-digest-notifications.mjs plain-text formatDigest appends a ~200-char truncated snippet line under each story; HTML formatDigestHtml renders a dim-grey description div between title and meta. Both gated on non-empty description (R6 — empty → today's behavior). Real-time alerts: - src/services/breaking-news-alerts.ts BreakingAlert gains optional description; checkBatchForBreakingAlerts reads item.snippet; dispatchAlert includes `description` in the /api/notify payload when present. Notification relay: - scripts/notification-relay.cjs formatMessage gated on NOTIFY_RELAY_INCLUDE_SNIPPET=1 (default off). When on, RSS-origin payloads render a `> <snippet>` context line under the title. When off or payload.description absent, output is byte-identical to pre-U7. Audit (RSS vs domain): - tests/notification-relay-payload-audit.test.mjs enforces file-level @notification-source tags on every producer, rejects `description:` in domain-origin payload blocks, and verifies the relay codepath gates snippet rendering under the flag. - Tag added to ais-relay.cjs (domain), seed-aviation.mjs (domain), alert-emitter.mjs (domain), breaking-news-alerts.ts (rss). Deferred (plan explicitly flags): InsightsPanel + cluster-producer plumbing (bodies default to [] — will unlock gradually once news:insights:v1 producer also carries primarySnippet). Covers R5, R6. * docs+test: grounding-path note + bump pinned CACHE_VERSION v5→v6 (U8) Final verification for the RSS-description-end-to-end fix: - docs/architecture.mdx — one-paragraph "News Grounding Pipeline" subsection tracing parser → story:track:v1.description → NewsItem.snippet → brief / SummarizeArticle / dashboard / email / relay / MCP, with the empty-description R6 fallback rule called out explicitly. - tests/summarize-reasoning.test.mjs — Fix-4 static-analysis pin updated to match the v6 bump from U6. Without this the summary cache bump silently regressed CI's pinned-version assertion. Final sweep (2026-04-24): - grep -rn 'brief:llm:description:v1' → only in the U5 legacy-row test simulation (by design: proves the v2 bump forces cold-start). - grep -rn 'brief:llm:whymatters:v2/v6/shadow:v4' → no live references. - grep -rn 'summary:v5' → no references. - CACHE_VERSION = 'v6' in src/utils/summary-cache-key.ts. - Full tsx --test sweep across all tests/.test.{mjs,mts}: 6747/6747 pass. - npm run typecheck + typecheck:api: both clean. Covers R4, R6, R7. fix(rss-description): address /ce:review findings before merge 14 fixes from structured code review across 13 reviewer personas. Correctness-critical (P1 — fixes that prevent R6/U7 contract violations): - NewsPanel signature covers currentBodies so view-mode toggles that leave headlines identical but bodies different now invalidate in-flight summaries. Without this, switching renderItems → renderClusters mid-summary let a grounded response arrive under a stale (now-orphaned) cache key. - summarize-article.ts re-pairs bodies with headlines BEFORE dedup via a single zip-sanitize-filter-dedup pass. Previously bodies[] was indexed by position in light-sanitized headlines while findIndex looked up the full-sanitized array — any headline that sanitizeHeadlines emptied mispaired every subsequent body, grounding the LLM on the wrong story. - Client skips the pre-chain cache lookup when bodies are present, since client builds keys from RAW bodies while server sanitizes first. The keys diverge on injection content, which would silently miss the server's authoritative cache every call. Test + audit hardening: - Legacy v1 eviction test now uses the real hashBriefStory(story()) suffix instead of a literal "somehash", so a bug where the reader still queried the v1 prefix at the real key would actually be caught. - tests/summary-cache-key.test.mts adds 400-char clip identity coverage so the canonicalizer's clip and any downstream clip can't silently drift. - tests/news-rss-description-extract.test.mts renames the well-formed CDATA test and adds a new test documenting the malformed-]]> fallback behavior (plain regex captures, article content survives). Safe_auto cleanups: - Deleted dead SNIPPET_PUSH_MAX constant in notification-relay.cjs. - BETA-mode groq warm call now passes bodies, warming the right cache slot. - seed-digest shares a local normalize-equality helper for description != headline comparison, matching the parser's contract. - Pair-wise sort in summary-cache-key tie-breaks on body so duplicate headlines produce stable order across runs. - buildSummaryCacheKey gained JSDoc documenting the client/server contract and the bodies parameter semantics. - MCP get_world_brief tool description now mentions RSS article-body grounding so calling agents see the current contract. - _shared.ts `opts.bodies![i]!` double-bang replaced with `?? ''`. - extractRawTagBody regexes cached in module-level Map, mirroring the existing TAG_REGEX_CACHE pattern. Deferred to follow-up (tracked for PR description / separate issue): - Promote shared MAX_BODY constant across the 5 clip sites - Promote shared truncateForDisplay helper across 4 render sites - Collapse NewsPanel.{currentHeadlines, currentBodies} → Array<{title, snippet}> - Promote sanitizeStoryForPrompt to shared/brief-llm-core.js - Split list-feed-digest.ts parser helpers into sibling -utils.ts - Strengthen audit test: forward-sweep + behavioral gate test Tests: 6749/6749 pass. Typecheck clean on both configs. * fix(summarization): thread bodies through browser T5 path (Codex #2) Addresses the second of two Codex-raised findings on PR #3370: The PR threaded bodies through the server-side API provider chain (Ollama → Groq → OpenRouter → /api/news/v1/summarize-article) but the local browser T5 path at tryBrowserT5 was still summarising from headlines alone. In BETA_MODE that ungrounded path runs BEFORE the grounded server providers; in normal mode it remains the last fallback. Whenever T5-small won, the dashboard summary surface regressed to the headline-only path — the exact hallucination class this PR exists to eliminate. Fix: tryBrowserT5 accepts an optional `bodies` parameter and interleaves each body with its paired headline via a `headline — body` separator in the combined text (clipped to 200 chars per body to stay within T5-small's ~512-token context window). All three call sites (BETA warm, BETA cold, normal-mode fallback) now pass the bodies threaded down from generateSummary options.bodies. When bodies is empty/omitted, the combined text is byte-identical to pre-fix (R6 preserved). On Codex finding #1 (story:track:v1 additive-only HSET keeps a body from an earlier mention of the same normalized title), declining to change. The current rule — "if this mention has a body, overwrite; otherwise leave the prior body alone" — is defensible: a body from mention A is not falsified by mention B being body-less (a wire reprint doesn't invalidate the original source's body). A feed that publishes a corrected headline creates a new normalized-title hash, so no stale body carries forward. The failure window is narrow (live story evolving while keeping the same title through hours of body-less wire reprints) and the 7-day STORY_TTL is the backstop. Opening a follow-up issue to revisit semantics if real-world evidence surfaces a stale-grounding case. * fix(story-track): description always-written to overwrite stale bodies (Codex #1) Revisiting Codex finding #1 on PR #3370 after re-review. The previous response declined the fix with reasoning; on reflection the argument was over-defending the current behavior. Problem: buildStoryTrackHsetFields previously wrote `description` only when non-empty. Because story:track:v1 rows are collapsed by normalized-title hash, an earlier mention's body would persist for up to STORY_TTL (7 days) on subsequent body-less mentions of the same story. Consumers reading `track.description` via HGETALL could not distinguish "this mention's body" from "some mention's body from the last week," silently grounding brief / whyMatters / SummarizeArticle LLMs on text the current mention never supplied. That violates the grounding contract advertised to every downstream surface in this PR. Fix: HSET `description` unconditionally on every mention — empty string when the current item has no body, real body when it does. An empty value overwrites any prior mention's body so the row is always authoritative for the current cycle. Consumers continue to treat empty description as "fall back to cleaned headline" (R6 preserved). The 7-day STORY_TTL and normalized-title hash semantics are unchanged. Trade-off accepted: a valid body from Feed A (NYT) is wiped when Feed B (AP body-less wire reprint) arrives for the same normalized title, even though Feed A's body is factually correct. Rationale: the alternative — keeping Feed A's body indefinitely — means the user sees Feed A's body attributed (by proximity) to an AP mention at a later timestamp, which is at minimum misleading and at worst carries retracted/corrected details. Honest absence beats unlabeled presence. Tests: new stale-body overwrite sequence test (T0 body → T1 empty → T2 new body), existing "writes description when non-empty" preserved, existing "omits when empty" inverted to "writes empty, overwriting." cache-keys.ts contract comment updated to mark description as always-written rather than optional.	2026-04-24 16:25:14 +04:00
Elie Habib	84ee2beb3e	feat(energy): Energy Atlas end-to-end — pipelines + storage + shortages + disruptions + country drill-down (#3294 ) * feat(energy): pipeline registries (gas + oil) — evidence-based schema Day 6 of the Energy Atlas Release 1 plan (Week 2). First curated asset registry for the atlas — the real gap vs GEF. ## Curated data (critical assets only, not global completeness) scripts/data/pipelines-gas.json — 12 critical gas lines: Nord Stream 1/2 (offline; Swedish EEZ sabotage 2022; EU sanctions refs), TurkStream, Yamal–Europe (offline; Polish counter-sanctions), Brotherhood/Soyuz (offline; Ukraine transit expired 2024-12-31), Power of Siberia, Dolphin, Medgaz, TAP, TANAP, Central Asia–China, Langeled. scripts/data/pipelines-oil.json — 12 critical oil lines: Druzhba North/South (N offline per EU 2022/879; S under landlocked derogation), CPC, ESPO (+ price-cap sanction ref), BTC, TAPS, Habshan–Fujairah (Hormuz bypass), Keystone, Kirkuk–Ceyhan (offline since 2023 ICC ruling), Baku–Supsa, Trans-Mountain (TMX expansion May 2024), ESPO spur to Daqing. Scope note: 75+ each is Week 2b work via GEM bulk import. Today's cut is curated from first-hand operator disclosures + regulator filings so I can stand behind every evidence field. ## Evidence-based schema (not conclusion labels) Per docs/methodology/pipelines.mdx: no bare `sanctions_blocked` field. Every pipeline carries an evidence bundle with `physicalState`, `physicalStateSource`, `operatorStatement`, `commercialState`, `sanctionRefs[]`, `lastEvidenceUpdate`, `classifierVersion`, `classifierConfidence`. The public badge (`flowing\|reduced\|offline\| disputed`) is derived server-side from this bundle at read time. ## Seeder scripts/seed-pipelines.mjs — single process publishes BOTH keys (energy:pipelines:{gas,oil}:v1) via two runSeed() calls. Tiny datasets (<20KB each) so co-location is cheap and guarantees classifierVersion consistency. Conventions followed (worldmonitor-bootstrap-registration skill): - TTL 21d = 3× weekly cadence (gold-standard per feedback_seeder_gold_standard.md) - maxStaleMin 20_160 = 2× cadence (health-maxstalemin-write-cadence skill) - sourceVersion + schemaVersion + recordCount + declareRecords wired (seed-contract-foundation) - Zero-case explicitly NOT allowed — MIN_PIPELINES_PER_REGISTRY=8 floor ## Health registration (dual, per feedback_two_health_endpoints_must_match) - api/health.js: BOOTSTRAP_KEYS adds pipelinesGas + pipelinesOil; SEED_META adds both with maxStaleMin=20_160. - api/seed-health.js: mirror entries with intervalMin=10_080 (maxStaleMin/2). ## Bundle registration scripts/seed-bundle-energy-sources.mjs adds a single Pipelines entry (not two) because seed-pipelines.mjs publishes both keys in one run — listing oil separately would double-execute. Monitoring of the oil key staleness happens in api/health.js instead. ## Tests (tests/pipelines-registry.test.mts) 17 passing node:test assertions covering: - Schema validation (both registries pass validateRegistry) - Identity resolution (no id collisions, id matches object key) - Country ISO2 normalization (from/to/transit all match /^[A-Z]{2}$/) - Endpoint geometry within Earth bounds - Evidence rigor: non-flowing badges require at least one supporting evidence source (operator statement / sanctionRefs / ais-relay / satellite / press) - ClassifierConfidence in 0..1 - Commodity/capacity pairing (gas uses capacityBcmYr, oil uses capacityMbd — mixing = test fail) - validateRegistry rejects: empty object, null, no-evidence fixtures, below-floor counts Typecheck clean (both tsconfig.json and tsconfig.api.json). Next: Day 7 will add list-pipelines / get-pipeline-detail RPCs in supply-chain/v1. Day 8 ships PipelineStatusPanel with DeckGL PathLayer consuming the registry. * fix(energy): split seed-pipelines.mjs into two entry points — runSeed hard-exits High finding from PR review. scripts/seed-pipelines.mjs called runSeed() twice in one process and awaited Promise.all. But runSeed() in scripts/_seed-utils.mjs hard-exits via process.exit on ~9 terminal paths (lines 816, 820, 839, 888, 917, 989, plus fetch-retry 946, fatal 859, skipped-lock 81). The first runSeed to reach any terminal path exits the entire node process, so the second runSeed's resolve never fires — only one of energy:pipelines:{gas,oil}:v1 would ever be written. Since the bundle scheduled seed-pipelines.mjs exactly once, and both api/health.js and api/seed-health.js expect both keys populated, the other registry would stay permanently EMPTY/STALE after deploy. Fix: split into two entry-point scripts around a shared utility. - scripts/_pipeline-registry.mjs (NEW, was seed-pipelines.mjs) — shared helpers ONLY. Exports GAS_CANONICAL_KEY, OIL_CANONICAL_KEY, PIPELINES_TTL_SECONDS, MAX_STALE_MIN, buildGasPayload, buildOilPayload, validateRegistry, recordCount, declareRecords. Underscore prefix marks it as non-entry-point (matches _seed-utils.mjs / _seed-envelope-source.mjs convention). - scripts/seed-pipelines-gas.mjs (NEW) — imports from the shared module, single runSeed('energy','pipelines-gas',…) call. - scripts/seed-pipelines-oil.mjs (NEW) — same shape, oil. - scripts/seed-bundle-energy-sources.mjs — register BOTH seeders (not one). - scripts/seed-pipelines.mjs — deleted. - tests/pipelines-registry.test.mts — update import path to the shared module. All 17 tests still pass. Typecheck clean (both configs). Tests pass. No other consumers import from the deleted script. * fix(energy): complete pipeline bootstrap registration per 4-file checklist High finding from PR review. My earlier PR description claimed worldmonitor-bootstrap-registration was complete, but I only touched two of the four registries (api/health.js + api/seed-health.js). The bootstrap hydration payload itself (api/bootstrap.js) and the shared cache-keys registry (server/_shared/cache-keys.ts) still had no entry for either pipeline key, so any consumer that reads bootstrap data would see pipelinesGas/pipelinesOil as missing on first load. Files updated this commit: - api/bootstrap.js — KEYS map + SLOW_KEYS set both gain pipelinesGas + pipelinesOil. Placed next to sprPolicies (same curated-registry cadence and tier). Slow tier is correct: weekly cron, not needed on first paint. - server/_shared/cache-keys.ts — PIPELINES_GAS_KEY + PIPELINES_OIL_KEY exported constants (matches SPR_POLICIES_KEY pattern), BOOTSTRAP_KEYS map entries, and BOOTSTRAP_TIERS entries (both 'slow'). Not touched (intentional): - server/gateway.ts — pipeline data is free-tier per the Energy Atlas plan; no PREMIUM_RPC_PATHS entry required. Energy Atlas monetization hooks (scenario runner, MCP tools, subscriptions) are Release 2. Full 4-file checklist now complete: ✅ server/_shared/cache-keys.ts (this commit) ✅ api/bootstrap.js (this commit) ✅ api/health.js (earlier in PR) ✅ api/seed-health.js (earlier in PR — dual-registry rule) Typecheck clean (both configs). * feat(energy): ListPipelines + GetPipelineDetail RPCs with evidence-derived badges Day 7 of the Energy Atlas Release 1 plan (Week 2). Exposes the pipeline registries (shipped in Day 6) via two supply-chain RPCs and ships the evidence-to-badge derivation server-side. ## Proto proto/worldmonitor/supply_chain/v1/list_pipelines.proto — new: - ListPipelinesRequest { commodity_type?: 'gas' \| 'oil' } - ListPipelinesResponse { pipelines[], fetched_at, classifier_version, upstream_unavailable } - GetPipelineDetailRequest { pipeline_id (required, query-param) } - GetPipelineDetailResponse { pipeline?, revisions[], fetched_at, unavailable } - PipelineEntry — wire shape mirroring scripts/data/pipelines-{gas,oil}.json + a server-derived public_badge field - PipelineEvidence, OperatorStatement, SanctionRef, LatLon, PipelineRevisionEntry service.proto adds both rpc methods with HTTP_METHOD_GET + path bindings: /api/supply-chain/v1/list-pipelines /api/supply-chain/v1/get-pipeline-detail `make generate` regenerated src/generated/{client,server}/… + docs/api/ OpenAPI json/yaml. ## Evidence-derivation server/worldmonitor/supply-chain/v1/_pipeline-evidence.ts — new. derivePublicBadge(evidence) → 'flowing' \| 'reduced' \| 'offline' \| 'disputed' is deterministic + versioned (DERIVER_VERSION='badge-deriver-v1'). Rules (first match wins): 1. offline + sanctionRef OR expired/suspended commercial → offline 2. offline + operator statement → offline 3. offline + only press/ais/satellite → disputed (single-source negative claim) 4. reduced → reduced 5. flowing → flowing 6. unknown / malformed → disputed Staleness guard: non-flowing badges on >14d-old evidence demote to disputed. Flowing is the optimistic default — stale "still flowing" is safer than stale "offline". Matches seed-pipelines-{gas,oil}.mjs maxStaleMin. Tests (tests/pipeline-evidence-derivation.test.mts) — 15 passing cases covering happy paths, disputed fallbacks, staleness guard, versioning. ## Handlers server/worldmonitor/supply-chain/v1/list-pipelines.ts - Reads energy:pipelines:{gas,oil}:v1 via getCachedJson. - projectPipeline() narrows the Upstash `unknown` into PipelineEntry shape + calls derivePublicBadge. - Honors commodity_type filter (skip the opposite registry's Redis read when the client pre-filters). - Returns upstream_unavailable=true when BOTH registries miss. server/worldmonitor/supply-chain/v1/get-pipeline-detail.ts - Scans both registries by id (ids are globally unique per tests/pipelines-registry.test.mts). - Empty revisions[] for now; auto-revision log wires up in Week 3. handler.ts registers both into supplyChainHandler. ## Gateway server/gateway.ts adds 'static' cache-tier for both new RPC paths (registry is slow-moving; 'static' matches the other read-mostly supply-chain endpoints). ## Consumer wiring Not in this commit — PipelineStatusPanel (Day 8) is what will call listPipelines/getPipelineDetail via the generated client. pipelinesGas + pipelinesOil stay in PENDING_CONSUMERS until Day 8. Typecheck clean (both configs). 15 new tests + 17 registry tests all pass. * feat(energy): PipelineStatusPanel — evidence-backed status table + drawer Day 8 of the Energy Atlas Release 1 plan. First consumer of the Day 6–7 registries + RPCs. ## What this PR adds - src/components/PipelineStatusPanel.ts — new panel (id=pipeline-status). * Bootstrap-hydrates from pipelinesGas + pipelinesOil for instant first paint; falls through to listPipelines() RPC if bootstrap misses. Background re-fetch runs on every render so a classifier-version bump between bootstrap stamp and first view produces a visible update. * Table rows sorted non-flowing-first (offline / reduced / disputed before flowing) — what an atlas reader cares about. * Click-to-expand drawer calls getPipelineDetail() lazily — operator statements, sanction refs (with clickable source URLs), commercial state, classifier version + confidence %, capacity + route metadata. * publicBadge color-chip palette matches the methodology doc. * Attribution footer with GEM (CC-BY 4.0) credit + classifier version. - src/components/index.ts — barrel export. - src/app/panel-layout.ts — import + createPanel('pipeline-status', …). - src/config/panels.ts — ENERGY_PANELS adds 'pipeline-status' at priority 1. ## PENDING_CONSUMERS cleanup tests/bootstrap.test.mjs — removes 'pipelinesGas' + 'pipelinesOil' from the allowlist. The invariant "every bootstrap key has a getHydratedData consumer" now enforces real wiring for these keys: the panel literally calls getHydratedData('pipelinesGas') and getHydratedData('pipelinesOil'). Future regressions that remove the consumer will fail pre-push. ## Consumer contract verified - 67 tests pass including bootstrap.test.mjs consumer coverage check. - Typecheck clean. - No DeckGL PathLayer in this commit — existing 'pipelines-layer' has a separate data source, so modifying DeckGLMap.ts to overlay evidence- derived badges on the map is a follow-up commit to avoid clobbering. ## Out of scope for Day 8 (next steps on same PR) - DeckGL PathLayer integration (color pipelines on the main map by publicBadge, click-to-open this drawer) — Day 8b commit. - Storage facility registry + StorageFacilityMapPanel — Days 9-10. * fix(energy): PipelineStatusPanel bootstrap path — client-side badge derivation High finding from PR review. The Day-8 panel crashed on first paint whenever bootstrap hydration succeeded, because: - Bootstrap hydrates raw scripts/data/pipelines-{gas,oil}.json verbatim. - That JSON does NOT include publicBadge — that field is only added by the server handler's projectPipeline() in list-pipelines.ts. - PipelineStatusPanel passed raw entries into badgeChip(), which called badgeLabel(undefined).charAt(0) → TypeError. The background RPC refresh that would have repaired the data never ran because the panel threw before reaching it. So the exact bootstrap path newly wired in commit `6b01fa537` was broken for the new panel. Fix: move the evidence→badge deriver to src/shared/pipeline-evidence.ts so the client panel and the server handler run the identical function on identical inputs. Panel projects raw bootstrap JSON through the shared deriver client-side, producing the same publicBadge the RPC would have returned. No UI flicker on hydration because pre- and post-RPC badges match exactly (same function, same input). ## Changes - src/shared/pipeline-evidence.ts (NEW) — pure deriver with duck-typed PipelineEvidenceInput (no generated-type dependency, so both client and server assign their proto-typed evidence bundles by structural subtyping). Exports derivePipelinePublicBadge + version + type. - server/worldmonitor/supply-chain/v1/_pipeline-evidence.ts — now a thin re-export of the shared module under its older name so in-handler imports keep working without a sweep. - src/components/PipelineStatusPanel.ts: * Imports derivePipelinePublicBadge from @/shared/pipeline-evidence. * NEW projectRawPipeline() defensively coerces every field from unknown → PipelineEntry shape, mirroring the server projection. * buildBootstrapResponse now routes every raw entry through the projection before returning, so the wire-format PipelineEntry[] the renderer receives always has publicBadge populated. * badgeChip() gained a null-guard fallback to 'disputed' — belt + braces so even if a future caller passes an undefined, the UI renders safely instead of throwing. * BootstrapRegistry renamed RawBootstrapRegistry with a comment explaining why the seeder ships raw JSON (not wire format). ## Regression tests tests/pipeline-panel-bootstrap.test.mts (NEW) — 6 tests that exercise the bootstrap-first-paint path end-to-end: - Every gas + oil curated entry produces a valid badge. - Raw entries never ship with pre-computed publicBadge (contract guard on the seed data format). - Deriver never throws on undefined/null/{} evidence (was the crash). - Nord Stream 1 regression check (offline + paperwork → offline). - Druzhba-South staleness behavior (reduced when fresh, disputed after 60 days without update). 38/38 tests now pass (17 registry + 15 deriver + 6 new bootstrap-path). Typecheck clean on both configs. ## Invariant preserved The server handler and the panel render identical badges because: 1. Same pure function (imported from the same module). 2. Same deterministic rules, same staleness window. 3. Same bootstrap data read by both paths (Redis → either bootstrap payload or RPC response). No UI flicker on hydration. * fix(energy): three PR-review P2s on PipelineStatusPanel + aggregators ## P2-1 — sanitizeUrl on external evidence links (XSS hardening) Sanction-ref URLs and operator-statement URLs were interpolated with escapeHtml only. HTML-escaping blocks tag injection but NOT javascript: or data: URL schemes, so a bad URL in the seeded registry would execute in-app when a reader clicked the evidence link. Every other panel in the codebase (NewsPanel, GdeltIntelPanel, GeoHubsPanel, AirlineIntelPanel, MonitorPanel) uses sanitizeUrl for this exact reason. Fix: import sanitizeUrl from @/utils/sanitize and route both hrefs through it. sanitizeUrl() drops non-http(s) schemes + returns '' on invalid URLs. The renderer now suppresses the <a> entirely when sanitize rejects — the date label still renders as plain text instead of becoming an executable link. ## P2-2 — loadDetail catch path missing stale-response guard The success path at loadDetail() checked `this.selectedId !== pipelineId` to suppress stale responses when the user has clicked another pipeline mid-flight. The catch path at line 219 had no such guard: if the user clicked A, then B, and A's request failed before B resolved, A's error handler cleared detailLoading and detail, showing "Pipeline detail unavailable" for B's drawer even though B was still loading. Fix: mirror the same `if (this.selectedId !== pipelineId) return` guard in the catch path. The newer request now owns the drawer state regardless of which path (success OR failure) the older one took. ## P2-3 — always-gas-preference aggregator for classifierVersion + fetchedAt Three call sites (list-pipelines.ts handler, get-pipeline-detail.ts handler, PipelineStatusPanel bootstrap projection) computed aggregate classifier version and fetchedAt by `gas?.x \|\| oil?.x \|\| fallback`. That was defensible when a single seed-pipelines.mjs wrote both keys atomically (fix commit 29b4ac78f split this into two separate Railway cron entry points). Now gas + oil cron independently, so mixed-version (gas=v1, oil=v2 during classifier rollout) and mixed-timestamp (oil refreshed 6h after gas) windows are the EXPECTED state, not the exceptional one. The comment in list-pipelines.ts even said "pick the newest classifier version" but the code didn't actually compare. Fix: add two shared helpers in src/shared/pipeline-evidence.ts — - pickNewerClassifierVersion(a,b) — parses /^v(\\d+)$/ and returns the higher-numbered version; falls back to lexicographic for non-v- prefixed values; handles single-missing inputs. - pickNewerIsoTimestamp(a,b) — Date.parse()-compares and returns the later ISO; handles missing / malformed inputs gracefully. Both server RPCs and the panel bootstrap projection now call these helpers identically, so clients are told the truth about version + freshness during partial rollouts. ## Tests Extended tests/pipeline-evidence-derivation.test.mts with 8 new assertions covering both pickers: - Higher v-number wins regardless of order (v1 vs v2 → v2 both ways) - Single-missing falls back to the one present - Missing + missing → default 'v1' for version / '' for ts - Non-v-numbered values fall back to lexicographic - Explicit regression: "gas=v1 + oil=v2 during rollout" returns v2 - Explicit regression: "oil fresher than gas" returns the oil timestamp 38 → 46 tests. All pass. Typecheck clean on both configs. * feat(energy): DeckGL PathLayer colored by evidence-derived badge + map↔panel link Day 8b of the Energy Atlas plan. Pipelines now render on the main DeckGL map of the energy variant colored by their derived publicBadge, and clicking a pipeline on the map opens the same evidence drawer the panel row-click opens. ## Why this commit Day 8 shipped the PipelineStatusPanel as a table + drawer view. Reviewer flag notwithstanding (fixed in `149d33ec3` + `db52965cd`), a table-only pipeline view is a weak product compared to the map-centric atlas it's meant to rival. The map-layer differentiation is the whole point of the feature. ## What this adds src/components/DeckGLMap.ts: - New createEnergyPipelinesLayer() — reads hydrated pipeline registries via getHydratedData, projects raw JSON through the shared deriver (src/shared/pipeline-evidence.ts), renders a DeckGL PathLayer colored by publicBadge: flowing → green (46,204,113) reduced → amber (243,156,18) offline → red (231,76,60) disputed → purple (155,89,182) Offline + disputed get thicker strokes (3px vs 2px) for at-a-glance surfacing of disrupted assets. Geometry comes from raw startPoint + waypoints[] + endPoint per asset (straight line when no waypoints). - Branching at line ~1498: SITE_VARIANT === 'energy' routes to the new method; other variants keep the static PIPELINES config (colored by oil/gas type). Existing commodity/finance/full map layers are untouched — no cross-variant leakage. - onClick handler emits `energy:open-pipeline-detail` as a window CustomEvent with { pipelineId }. Loose coupling: the map doesn't import the panel, the panel doesn't import the map. - Fallback: if bootstrap hasn't hydrated yet, createEnergyPipelinesLayer falls back to the static createPipelinesLayer() so the pipelines toggle always shows something. src/components/PipelineStatusPanel.ts: - Constructor registers a window event listener for 'energy:open-pipeline-detail' → calls this.loadDetail(pipelineId) → drawer opens on the clicked asset. Map click and row click converge on the same drawer, same evidence view. - destroy() removes the listener to prevent ghost handlers after panel unmount. ## Guarantees - Bootstrap parity: the DeckGL layer calls the SAME derivePipelinePublicBadge as the panel and the server handler, so the map color, the table row chip, and the RPC response all agree on the badge. No flicker, no drift, no confused user. - Variant isolation: only SITE_VARIANT === 'energy' triggers the new path. Commodity / finance / full map layers untouched. - No cross-component import: the panel doesn't reference the map class and vice versa. The event contract is the only coupling — testable, swappable, tauri-safe (guarded with `typeof window !== 'undefined'`). Typecheck clean. PR #3294 now has 8 commits. Follow-up backlog: - Add waypoints[] to the curated pipelines-{gas,oil}.json so the map draws real routes instead of straight lines (cosmetic; does not affect correctness). - Tooltip case in the picking tooltip registry (line ~3748) so hover shows "Nord Stream 1 · OFFLINE" before click. * fix(energy): three PR-review findings on Day 8b DeckGL integration ## P1 — getHydratedData single-use race between map + panel src/services/bootstrap.ts:34 — `if (val !== undefined) hydrationCache.delete(key);` The helper drains its slot on first read. Day 8 (PipelineStatusPanel) and Day 8b (createEnergyPipelinesLayer) BOTH call getHydratedData('pipelinesGas') and getHydratedData('pipelinesOil') — whoever renders first drains the cache and forces the loser onto its fallback path (panel → RPC, map → static PIPELINES layer). The commit's "shared bootstrap-backed data" guarantee did not actually hold. Fix: new src/shared/pipeline-registry-store.ts that reads once and memoizes. Both consumers read through getCachedPipelineRegistries() — same data, same reference, unlimited re-reads. When the panel's background RPC fetch lands, it calls setCachedPipelineRegistries() to back-propagate fresh data into the store so the map's next re-render sees the newer classifierVersion + fetchedAt too (no map/panel drift during classifier rollouts). Test-only injection hook (__setBootstrapReaderForTests) makes the drain-once semantics observable without a real bootstrap payload. ## P2 — pipelines-layer tooltip regresses to blank label on energy variant src/components/DeckGLMap.ts:3748 (pipelines-layer tooltip case) still assumed the static-config shape (obj.type). The new energy layer emits objects with commodityType + badge fields, so the tooltip's type-ternary fell through to the generic fallback — hover rendered " pipeline" (empty leading commodity) instead of "Nord Stream 1 · OFFLINE". Fix: differentiate by presence of obj.badge (only the energy layer sets it). On the energy variant, tooltip now reads name + commodity + badge. Static- config variants (commodity / finance / full) keep their existing format unchanged. ## P2 — createEnergyPipelinesLayer dropped highlightedAssets behavior The static createPipelinesLayer() reads this.highlightedAssets.pipeline and threads it into getColor / getWidth with an updateTrigger on the signature. Any caller using flashAssets('pipeline', [...]) or highlightAssets([...]) gets a visible red-outline flash on the matching paths. My Day 8b energy layer ignored the set entirely — those APIs silently no-op'd on the energy variant. Fix: createEnergyPipelinesLayer() now reads the same highlight set, applies HIGHLIGHT_COLOR + wider stroke to matching IDs, and wires updateTriggers: { getColor: sig, getWidth: sig } so DeckGL actually recomputes when the set changes. Also removed the unnecessary layerCache.set() in the energy path: the store can update via RPC back-propagation, and a cache keyed only on highlight-signature would serve stale data. With ~25 critical-asset pipelines, rebuild per render is trivial. ## Tests tests/pipeline-registry-store.test.mts (NEW) — 5 tests covering the drain-once read-many invariant: multiple consumers get cached data without re-draining, RPC back-propagation updates the source, partial updates preserve the other commodity, and pure RPC-first (no bootstrap) works without invoking the reader. All 51 PR tests pass. Typecheck clean on both configs. * feat(energy): Day 9 — storage facility registry (UGS + SPR + LNG + crude hubs) Ships 21 critical strategic storage facilities as a curated registry, same evidence-bundle pattern as the pipeline registries in Day 7/8: - scripts/data/storage-facilities.json — 4 UGS + 4 SPR + 6 LNG export + 3 LNG import + 4 crude tank farms. Each carries physicalState + sanctionRefs + classifierVersion/Confidence + fillDisclosed/fillSource. - scripts/_storage-facility-registry.mjs — shared helpers (validator, builder, canonical key, MAX_STALE_MIN). Validator enforces facility-type × capacity-unit pairing (ugs→TWh, spr/tank-farm→Mb, LNG→Mtpa) and the non-operational badge ⇒ evidence invariant. - scripts/seed-storage-facilities.mjs — single runSeed entry (only one key, so no split-seeder dance needed). - Registered in the 4-file bootstrap checklist: cache-keys.ts (STORAGE_FACILITIES_KEY + BOOTSTRAP_CACHE_KEYS + BOOTSTRAP_TIERS), api/bootstrap.js (KEYS + SLOW_KEYS), api/health.js (BOOTSTRAP_KEYS + SEED_META, 14d threshold = 2× weekly cron), api/seed-health.js (mirror). - tests/bootstrap.test.mjs PENDING_CONSUMERS adds storageFacilities — Day 10 StorageFacilityMapPanel will remove it. - tests/storage-facilities-registry.test.mts — 20 tests covering schema, identity, geometry, type×capacity pairing, evidence contract, and negative-input validator rejection. Registry fields are slow-moving; badge derivation happens at read-time server-side once the RPC handler lands in Day 10 (panel + deckGL ScatterplotLayer). Seeded data is live in Redis from this commit so the Day 10 PR only adds display surfaces. Tests: 56 pass (36 prior + 20 new). Typecheck + typecheck:api clean. * feat(energy): Day 10 — storage atlas (ListStorageFacilities RPC + DeckGL ScatterplotLayer + panel) End-to-end wiring for the strategic storage registry seeded in Day 9. Same pattern as the pipeline shipping path (Days 7+8+8b): proto → handler → shared evidence deriver → panel → DeckGL map layer, with a shared read-once store keeping map + panel aligned. Proto + generated code: - list_storage_facilities.proto: ListStorageFacilities + GetStorageFacilityDetail messages with StorageFacilityEntry, StorageEvidence, StorageSanctionRef, StorageOperatorStatement, StorageLatLon, StorageFacilityRevisionEntry. - service.proto wires both RPCs under /api/supply-chain/v1. - make generate → regenerated client + server stubs + OpenAPI. Server handlers: - src/shared/storage-evidence.ts: shared pure deriver. Duck-typed input interface avoids generated-type deps; identical rules to the pipeline deriver (sanction/commercial paperwork vs external-signal-only offline, 14d staleness window, version pin). - _storage-evidence.ts: thin re-export for server handler import ergonomics. - list-storage-facilities.ts: reads STORAGE_FACILITIES_KEY from Upstash, projects raw → wire format, attaches derived publicBadge, filters by optional facilityType query arg. - get-storage-facility-detail.ts: single-asset lookup for drawer. - handler.ts registers both new methods. - gateway.ts: both routes → 'static' cache tier (registry is near-static). Panel + map: - src/shared/storage-facility-registry-store.ts: drain-once memo mirroring pipeline-registry-store. Both panel and DeckGL layer read through this so the single-use getHydratedData drain doesn't race between consumers. RPC back-propagation via setCachedStorageFacilityRegistry() keeps map ↔ panel on the same classifierVersion during rollouts. - StorageFacilityMapPanel.ts: table + evidence drawer. Bootstrap hot path projects raw registry through same deriver as server so first-paint badge matches post-RPC badge (no flicker). sanitizeUrl + stale-response guards (success + catch paths) carried over from PipelineStatusPanel. - DeckGLMap.ts createEnergyStorageLayer(): ScatterplotLayer keyed on badge color; log-scale radius (6km–26km) keeps Rehden visible next to Ras Laffan. Click dispatches 'energy:open-storage-facility-detail' — panel listens and opens its drawer (loose coupling, no direct refs). - Tooltip branch on storage-facilities-layer shows facility type, country, capacity unit, and badge. - Added 'storageFacilities' optional field to MapLayers type (optional so existing variant literals across commodity/finance/tech/happy/full/etc. don't need touching). Wired into LAYER_REGISTRY + VARIANT_LAYER_ORDER.energy + ENERGY_MAP_LAYERS + ENERGY_MOBILE_MAP_LAYERS. Panel entry added to ENERGY_PANELS + panel-layout createPanel. PENDING_CONSUMERS entry from Day 9 removed — panel + map layer are now real consumers. Tests: - storage-evidence-derivation.test.mts (17 tests): covers every curated facility yields a valid badge, null/malformed input never throws, offline sanction/commercial/operator rules, external-signal-only offline → disputed, staleness demotion. - storage-facility-registry-store.test.mts (4 tests): drain-once, no-data drain, RPC update, pure-RPC-first path. All 6,426 unit tests pass. Typecheck + typecheck:api clean. Pre-existing src-tauri/sidecar/ test failure is unrelated (no diff touches src-tauri/). * feat(energy): Day 11 — fuel-shortage registry schema + seed + RPC (classifier post-launch) Ships v1 of the global fuel-shortage alert registry. Severity is the CLASSIFIER OUTPUT (confirmed/watch), not a client derivation — we ship the evidence alongside so readers can audit the grounds. v1 is seeded from curated JSON; post-launch the proactive-intelligence classifier (Day 12 work) extends the same key directly. Data: - scripts/data/fuel-shortages.json — 15 known active shortages (PK, LK, NG×2, CU, VE, LB, ZW, AR, IR, BO, KE, PA, EG, BY) spanning petrol/diesel/jet across confirmed + watch tiers. Each entry carries evidenceSources[] (regulator/operator/press), firstSeen, lastConfirmed, resolvedAt, impactTypes[], causeChain[], classifier version + confidence. Confirmed severity enforces authoritative evidence at schema level. Seeder: - scripts/_fuel-shortage-registry.mjs — shared validator (enforces iso2 country, enum products/severities/impacts/causes, authoritative evidence for confirmed). MIN_SHORTAGES=10. - scripts/seed-fuel-shortages.mjs — single runSeed entry. - Registered in seed-bundle-energy-sources.mjs at DAY cadence (shortages move faster than registry assets). Bootstrap 4-file registration: - cache-keys.ts: FUEL_SHORTAGES_KEY + BOOTSTRAP_CACHE_KEYS + BOOTSTRAP_TIERS. - api/bootstrap.js: KEYS + SLOW_KEYS. - api/health.js: BOOTSTRAP_KEYS + SEED_META (2880min = 2× daily cron). - api/seed-health.js: mirrors intervalMin=1440. Proto + RPC: - list_fuel_shortages.proto: ListFuelShortages (country/product/severity query facets) + GetFuelShortageDetail messages with FuelShortageEntry, FuelShortageEvidence, FuelShortageEvidenceSource. - service.proto wires both new RPCs under /api/supply-chain/v1. - list-fuel-shortages.ts handler projects raw → wire format, supports server-side country/product/severity filtering. - get-fuel-shortage-detail.ts single-shortage lookup. - handler.ts registers both. gateway.ts: 'medium' cache-tier (daily classifier updates warrant moderate freshness). Shared evidence helper: - src/shared/shortage-evidence.ts: deriveShortageEvidenceQuality maps (confidence + authoritative-source count + freshness) → 'strong' \| 'moderate' \| 'thin' for client-side sort/trust indicators. Does NOT change severity — classifier owns that decision. - countEvidenceSources buckets sources for the drawer's "n regulator / m press" line. Tests: - tests/fuel-shortages-registry.test.mts (19 tests): schema, identity, enum coverage, evidence contract (confirmed → authoritative source), validateRegistry negative cases. - tests/shortage-evidence.test.mts (10 tests): quality deriver edge cases, source bucketing. - tests/bootstrap.test.mjs PENDING_CONSUMERS adds fuelShortages — FuelShortagePanel arrives Day 12 which will remove the entry. Typecheck + typecheck:api clean. 64 tests pass. * feat(energy): Day 12 — FuelShortagePanel + DeckGL shortage pins End-to-end wiring of the fuel-shortage registry shipped in Day 11: panel on the Energy variant page, ScatterplotLayer pins on the DeckGL map, both reading through a shared single-drain store so they don't race on the bootstrap cache. Panel: - src/components/FuelShortagePanel.ts — table sorted by severity (confirmed first) then evidence quality (strong → thin) then most-recent lastConfirmed. Drawer shows short description, first-seen / last-confirmed / resolved, impact types, cause chain, classifier version/confidence, and a typed evidence-source list with regulator/operator/press chips. sanitizeUrl on every href so classifier-ingested URLs can't render as javascript:. Same stale-response guards on success + catch paths as the other detail drawers. - Consumes deriveShortageEvidenceQuality for client-side trust indicator (three-dot ●●● / ●●○ / ●○○), NOT for severity — severity is classifier output. - Registered in ENERGY_PANELS + panel-layout.ts + components barrel. Shared store: - src/shared/fuel-shortage-registry-store.ts — same drain-once memoize pattern as pipeline- and storage-facility-registry-store. Both the panel and the DeckGL shortage-pins layer read through it. DeckGL layer: - DeckGLMap.createEnergyShortagePinsLayer: ScatterplotLayer placing one pin per active shortage at the country centroid (via getCountryCentroid from services/country-geometry). Stacking offset (~0.8° lon) when multiple shortages share a country so Nigeria's petrol + diesel don't render as a single dot. Confirmed pins 55km radius; watch 38km. Click dispatches 'energy:open-fuel-shortage-detail' — panel listens. - Tooltip branch on fuel-shortages-layer: country · product · short description · severity. - Layer registered in LAYER_REGISTRY, VARIANT_LAYER_ORDER.energy, ENERGY_MAP_LAYERS, ENERGY_MOBILE_MAP_LAYERS. MapLayers.fuelShortages is optional on the type so other variants' literals remain valid. Tests: - tests/fuel-shortage-registry-store.test.mts (4 tests): drain-once, no-data, RPC back-prop, pure-RPC-first path. - tests/bootstrap.test.mjs — fuelShortages removed from PENDING_CONSUMERS. Typecheck + typecheck:api clean. 39 tests pass (plus full suite in pre-push). * feat(energy): Day 13 — energy disruption event log + asset timeline drawer Ships the energy:disruptions:v1 registry that threads together pipelines and storage facilities: state transitions (sabotage, sanction, maintenance, mechanical, weather, commercial, war) keyed by assetId so any asset's drawer can render its history without a second registry lookup. Data + seeder: - scripts/data/energy-disruptions.json — 12 curated events spanning Nord Stream 1/2 sabotage, Druzhba sanctions, CPC force majeure, TurkStream maintenance, Yamal halt, Rehden trusteeship, Arctic LNG 2 sanction, ESPO drone strikes, BTC fire (historical), Sabine Pass Hurricane Beryl, Power of Siberia ramp. Each event links back to a seeded asset. - scripts/_energy-disruption-registry.mjs — validator enforces valid assetType/eventType/cause enums, http(s) sources, startAt ≤ endAt, MIN_EVENTS=8. - scripts/seed-energy-disruptions.mjs — runSeed entry (weekly cron). - Bundle entry at 7×DAY cadence. Bootstrap 4-file registration (cache-keys.ts + bootstrap.js + health.js + seed-health.js) — energyDisruptions in PENDING_CONSUMERS because panel drawers fetch lazily via RPC on drawer-open rather than hydrating from bootstrap directly. Proto + handler: - list_energy_disruptions.proto: ListEnergyDisruptions with assetId / assetType / ongoingOnly query facets. Returns events sorted newest-first. - list-energy-disruptions.ts projects raw → wire format, supports all three query facets. - Registered in handler.ts. gateway.ts: 'medium' cache tier. Shared timeline helper: - src/shared/disruption-timeline.ts — pure formatters (formatEventWindow, formatCapacityOffline, statusForEvent). No generated-type deps so PipelineStatusPanel + StorageFacilityMapPanel import the same helpers and render the timeline identically. Panel integration: - PipelineStatusPanel.loadDetail now fetches getPipelineDetail + listEnergyDisruptions({assetId, assetType:'pipeline'}) in parallel. Drawer gains "Disruption timeline (N)" section with event type, date window, capacity offline, cause chain, and short description per entry. - StorageFacilityMapPanel gets identical treatment with assetType='storage'. - Both reset detailEvents on closeDetail and on fresh click (stale-response safety). Tests: - tests/energy-disruptions-registry.test.mts (17 tests): schema, identity, enum coverage, evidence, negative inputs. - tests/bootstrap.test.mjs — energyDisruptions added to PENDING_CONSUMERS. Typecheck + typecheck:api clean. 51 tests pass locally (plus full suite in pre-push). * feat(energy): Day 14 — country drill-down Atlas exposure section Extends CountryDeepDivePanel's existing "Energy Profile" card with a mini Atlas-exposure section that surfaces per-country exposure to the new registries we shipped in Days 7-13. For each country: - Pipelines touching this country (from, to, or transit) — clickable rows that dispatch 'energy:open-pipeline-detail' so the PipelineStatusPanel drawer opens on the energy variant; no-op on other variants. - Storage facilities in this country — same loose-coupling pattern with 'energy:open-storage-facility-detail'. - Active fuel shortages in this country — severity breakdown line (N confirmed · M watch) plus clickable rows emitting 'energy:open-fuel-shortage-detail'. Silent absence: sections render only when the country has matching assets/events, so countries with no pipeline, storage, or shortage touchpoints see the existing energy-profile card unchanged. Lazy stores: reads go through the same shared drain-once stores (getCachedPipelineRegistries, getCachedStorageFacilityRegistry, getCachedFuelShortageRegistry) so CountryDeepDivePanel does NOT race with Atlas panels over the single-drain bootstrap cache. Dynamic import() keeps the three stores out of the panel's static import graph so non-energy variants can tree-shake them. Typecheck clean. No schema changes; purely additive UI read from already-shipped registries. * docs(energy): methodology page for energy disruption event log Fills the /docs/methodology/disruptions URL referenced by list_energy_disruptions.proto, scripts/_energy-disruption-registry.mjs, and the panel attribution footers. Explains scope (state transitions not daily noise), data shape, what counts as a disruption, classifier evolution path, RPC contract, and ties into the sibling pipeline + storage + shortage methodology pages. No code change; pure docs completion for Week 4 launch polish. * fix(energy): upstreamUnavailable only fires when Redis returned nothing Two handlers (list-storage-facilities + list-pipelines) conflated "empty filter result on a healthy registry" with "upstream unavailable". A caller who queried one facilityType/commodityType and legitimately got zero matches was told the upstream was down — which may push clients to error-state rendering or suppress caching instead of showing a valid empty list. list-storage-facilities.ts — upstreamUnavailable now only fires when `raw` is null (Redis miss). Zero filtered rows on a healthy registry returns upstreamUnavailable: false + empty array. Matches the sibling list-fuel-shortages handler and the wire contract in list_storage_facilities.proto. list-pipelines.ts — same bug, subtler shape. Now checks "requested at least one side AND received nothing" rather than "zero rows after collection". A filter that legitimately matches no gas/oil pipelines on a healthy registry now returns upstreamUnavailable: false. list-energy-disruptions.ts and list-fuel-shortages.ts already had the correct shape (only flag unavailable when raw is missing) — left as-is. Typecheck + typecheck:api clean. No tests added: the existing registry schema tests cover the projection/filter helpers, and the handler-level gating change is documented in code comments for future audits. * fix(energy): three Greptile findings on PR #3294 Two P1 filter bugs (resolved shortages rendered as active) and one P2 contract inconsistency on the disruptions handler. P1: DeckGLMap createEnergyShortagePinsLayer rendered every shortage in the registry as an active crisis pin — including entries where the classifier has written resolvedAt to mark the crisis over. Added a filter so only entries with a null/empty resolvedAt become map pins. Curated v1 data has resolvedAt=null everywhere so no visible change today, but the moment the classifier starts writing resolutions post-launch, resolved shortages would have appeared as ongoing. P1: CountryDeepDivePanel renderAtlasExposure had the same bug in the country drill-down — "N confirmed · M watch" counts included resolved entries, inflating the active-crisis line per country. Same one-line filter fix. P2: list-energy-disruptions.ts gated upstreamUnavailable on `!raw?.events` — a partial write (top-level object present but `events` property missing) fired the "upstream down" flag, inconsistent with the sibling handlers (list-pipelines, list-storage-facilities, list-fuel-shortages) that only fire on `!raw`. Rewrote to match: `!raw` → upstreamUnavailable, empty events → normal empty list. This also aligns with the contract documented on the upstream-unavailable- vs-empty-filter skill extracted from the earlier P2 review. Typecheck + typecheck:api clean. All three fixes are one-liner filter or gate changes; no test additions needed (registry tests still pass with v1 data since resolvedAt is null throughout).	2026-04-23 07:34:07 +04:00
Elie Habib	b8615dd106	fix(intelligence): read regional-snapshot latestKey as raw string (#3302 ) * fix(intelligence): read regional-snapshot latestKey as raw string Regional Intelligence panel rendered "No snapshot available yet" for every region despite the 6h cron writing per-region snapshots successfully. Root cause: writer/reader encoding mismatch. Writer (scripts/regional-snapshot/persist-snapshot.mjs:60) stores the snapshot_id pointer via `['SET', latestKey, snapshotId]` — a BARE string, not JSON.stringify'd. The seeder's own reader (line 97) reads it as-is and works. Vercel RPC handler used `getCachedJson(latestKey, true)`, which internally does `JSON.parse(data.result)`. `JSON.parse('mena-20260421T000000-steady')` throws; the try/catch silently returns null; handler returns {}; panel renders empty. Fix: new `getCachedRawString()` helper in server/_shared/redis.ts that reads a Redis key as-is with no JSON.parse. Handler uses it for the latestKey read (while still using getCachedJson for the snapshot-by-id payload, which IS JSON.stringify'd). No writer or backfill change needed. Regression guard: new structural test asserts the handler reads latestKey specifically via getCachedRawString so a future refactor can't silently revert to getCachedJson and re-break every region. Health.js monitors the summary key (intelligence:regional-snapshots: summary:v1), which stays green because summary writes succeed. Per-region health probes would be worth adding as a follow-up. * fix(redis): detect AbortSignal.timeout() as TimeoutError too Greptile P2 on PR #3302: AbortSignal.timeout() throws a DOMException with name='TimeoutError' on V8 runtimes (Vercel Edge included). The existing isTimeout check only matched name==='AbortError' — what you'd get from a manual controller.abort() — so the [REDIS-TIMEOUT] structured log never fired. Every redis fetch timeout silently fell through to the generic console.warn branch, eroding the Sentry drain we added specifically to catch these per docs/plans/chokepoint-rpc-payload-split.md. Fix both getCachedJson (pre-existing) and the new getCachedRawString by matching TimeoutError OR AbortError — covers both the current AbortSignal.timeout() path and any future switch to manual AbortController. Pre-existing copy in getCachedJson fixed in the same edit since it's the same file and the same observability hole. * test(redis): update isTimeout regex to match new TimeoutError\|AbortError check Pre-push hook caught the brittle static-analysis test in tests/get-chokepoint-history.test.mjs:83 that asserted the exact old single-name pattern. Update the regex (and description) to cover both TimeoutError and AbortError, matching the observability fix in the previous commit.	2026-04-22 22:58:31 +04:00
Sebastien Melki	58e42aadf9	chore(api): enforce sebuf contract + migrate drifting endpoints (#3207 ) (#3242 ) * chore(api): enforce sebuf contract via exceptions manifest (#3207) Adds api/api-route-exceptions.json as the single source of truth for non-proto /api/ endpoints, with scripts/enforce-sebuf-api-contract.mjs gating every PR via npm run lint:api-contract. Fixes the root-only blind spot in the prior allowlist (tests/edge-functions.test.mjs), which only scanned top-level .js files and missed nested paths and .ts endpoints — the gap that let api/supply-chain/v1/country-products.ts and friends drift under proto domain URL prefixes unchallenged. Checks both directions: every api/<domain>/v<N>/[rpc].ts must pair with a generated service_server.ts (so a deleted proto fails CI), and every generated service must have an HTTP gateway (no orphaned generated code). Manifest entries require category + reason + owner, with removal_issue mandatory for temporary categories (deferred, migration-pending) and forbidden for permanent ones. .github/CODEOWNERS pins the manifest to @SebastienMelki so new exceptions don't slip through review. The manifest only shrinks: migration-pending entries (19 today) will be removed as subsequent commits in this PR land each migration. refactor(maritime): migrate /api/ais-snapshot → maritime/v1.GetVesselSnapshot (#3207) The proto VesselSnapshot was carrying density + disruptions but the frontend also needed sequence, relay status, and candidate_reports to drive the position-callback system. Those only lived on the raw relay passthrough, so the client had to keep hitting /api/ais-snapshot whenever callbacks were registered and fall back to the proto RPC only when the relay URL was gone. This commit pushes all three missing fields through the proto contract and collapses the dual-fetch-path into one proto client call. Proto changes (proto/worldmonitor/maritime/v1/): - VesselSnapshot gains sequence, status, candidate_reports. - GetVesselSnapshotRequest gains include_candidates (query: include_candidates). Handler (server/worldmonitor/maritime/v1/get-vessel-snapshot.ts): - Forwards include_candidates to ?candidates=... on the relay. - Separate 5-min in-memory caches for the candidates=on and candidates=off variants; they have very different payload sizes and should not share a slot. - Per-request in-flight dedup preserved per-variant. Frontend (src/services/maritime/index.ts): - fetchSnapshotPayload now calls MaritimeServiceClient.getVesselSnapshot directly with includeCandidates threaded through. The raw-relay path, SNAPSHOT_PROXY_URL, DIRECT_RAILWAY_SNAPSHOT_URL and LOCAL_SNAPSHOT_FALLBACK are gone — production already routed via Vercel, the "direct" branch only ever fired on localhost, and the proto gateway covers both. - New toLegacyCandidateReport helper mirrors toDensityZone/toDisruptionEvent. api/ais-snapshot.js deleted; manifest entry removed. Only reduced the codegen scope to worldmonitor.maritime.v1 (buf generate --path) — regenerating the full tree drops // @ts-nocheck from every client/server file and surfaces pre-existing type errors across 30+ unrelated services, which is not in scope for this PR. Shape-diff vs legacy payload: - disruptions / density: proto carries the same fields, just with the GeoCoordinates wrapper and enum strings (remapped client-side via existing toDisruptionEvent / toDensityZone helpers). - sequence, status.{connected,vessels,messages}: now populated from the proto response — was hardcoded to 0/false in the prior proto fallback. - candidateReports: same shape; optional numeric fields come through as 0 instead of undefined, which the legacy consumer already handled. * refactor(sanctions): migrate /api/sanctions-entity-search → LookupSanctionEntity (#3207) The proto docstring already claimed "OFAC + OpenSanctions" coverage but the handler only fuzzy-matched a local OFAC Redis index — narrower than the legacy /api/sanctions-entity-search, which proxied OpenSanctions live (the source advertised in docs/api-proxies.mdx). Deleting the legacy without expanding the handler would have been a silent coverage regression for external consumers. Handler changes (server/worldmonitor/sanctions/v1/lookup-entity.ts): - Primary path: live search against api.opensanctions.org/search/default with an 8s timeout and the same User-Agent the legacy edge fn used. - Fallback path: the existing OFAC local fuzzy match, kept intact for when OpenSanctions is unreachable / rate-limiting. - Response source field flips between 'opensanctions' (happy path) and 'ofac' (fallback) so clients can tell which index answered. - Query validation tightened: rejects q > 200 chars (matches legacy cap). Rate limiting: - Added /api/sanctions/v1/lookup-entity to ENDPOINT_RATE_POLICIES at 30/min per IP — matches the legacy createIpRateLimiter budget. The gateway already enforces per-endpoint policies via checkEndpointRateLimit. Docs: - docs/api-proxies.mdx — dropped the /api/sanctions-entity-search row (plus the orphaned /api/ais-snapshot row left over from the previous commit in this PR). - docs/panels/sanctions-pressure.mdx — points at the new RPC URL and describes the OpenSanctions-primary / OFAC-fallback semantics. api/sanctions-entity-search.js deleted; manifest entry removed. * refactor(military): migrate /api/military-flights → ListMilitaryFlights (#3207) Legacy /api/military-flights read a pre-baked Redis blob written by the seed-military-flights cron and returned flights in a flat app-friendly shape (lat/lon, lowercase enums, lastSeenMs). The proto RPC takes a bbox, fetches OpenSky live, classifies server-side, and returns nested GeoCoordinates + MILITARY__TYPE_ enum strings + lastSeenAt — same data, different contract. fetchFromRedis in src/services/military-flights.ts was doing nothing sebuf-aware. Renamed it to fetchViaProto and rewrote to: - Instantiate MilitaryServiceClient against getRpcBaseUrl(). - Iterate MILITARY_QUERY_REGIONS (PACIFIC + WESTERN) in parallel — same regions the desktop OpenSky path and the seed cron already use, so dashboard coverage tracks the analytic pipeline. - Dedup by hexCode across regions. - Map proto → app shape via new mapProtoFlight helper plus three reverse enum maps (AIRCRAFT_TYPE_REVERSE, OPERATOR_REVERSE, CONFIDENCE_REVERSE). The seed cron (scripts/seed-military-flights.mjs) stays put: it feeds regional-snapshot mobility, cross-source signals, correlation, and the health freshness check (api/health.js: 'military:flights:v1'). None of those read the legacy HTTP endpoint; they read the Redis key directly. The proto handler uses its own per-bbox cache keys under the same prefix, so dashboard traffic no longer races the seed cron's blob — the two paths diverge by a small refresh lag, which is acceptable. Docs: dropped the /api/military-flights row from docs/api-proxies.mdx. api/military-flights.js deleted; manifest entry removed. Shape-diff vs legacy: - f.location.{latitude,longitude} → f.lat, f.lon - f.aircraftType: MILITARY_AIRCRAFT_TYPE_TANKER → 'tanker' via reverse map - f.operator: MILITARY_OPERATOR_USAF → 'usaf' via reverse map - f.confidence: MILITARY_CONFIDENCE_LOW → 'low' via reverse map - f.lastSeenAt (number) → f.lastSeen (Date) - f.enrichment → f.enriched (with field renames) - Extra fields registration / aircraftModel / origin / destination / firstSeenAt now flow through where proto populates them. * fix(supply-chain): thread includeCandidates through chokepoint status (#3207) Caught by tsconfig.api.json typecheck in the pre-push hook (not covered by the plain tsc --noEmit run that ran before I pushed the ais-snapshot commit). The chokepoint status handler calls getVesselSnapshot internally with a static no-auth request — now required to include the new includeCandidates bool from the proto extension. Passing false: server-internal callers don't need per-vessel reports. * test(maritime): update getVesselSnapshot cache assertions (#3207) The ais-snapshot migration replaced the single cachedSnapshot/cacheTimestamp pair with a per-variant cache so candidates-on and candidates-off payloads don't evict each other. Pre-push hook surfaced that tests/server-handlers still asserted the old variable names. Rewriting the assertions to match the new shape while preserving the invariants they actually guard: - Freshness check against slot TTL. - Cache read before relay call. - Per-slot in-flight dedup. - Stale-serve on relay failure (result ?? slot.snapshot). * chore(proto): restore // @ts-nocheck on regenerated maritime files (#3207) I ran 'buf generate --path worldmonitor/maritime/v1' to scope the proto regen to the one service I was changing (to avoid the toolchain drift that drops @ts-nocheck from 60+ unrelated files — separate issue). But the repo convention is the 'make generate' target, which runs buf and then sed-prepends '// @ts-nocheck' to every generated .ts file. My scoped command skipped the sed step. The proto-check CI enforces the sed output, so the two maritime files need the directive restored. * refactor(enrichment): decomm /api/enrichment/{company,signals} legacy edge fns (#3207) Both endpoints were already ported to IntelligenceService: - getCompanyEnrichment (/api/intelligence/v1/get-company-enrichment) - listCompanySignals (/api/intelligence/v1/list-company-signals) No frontend callers of the legacy /api/enrichment/* paths exist. Removes: - api/enrichment/company.js, signals.js, _domain.js - api-route-exceptions.json migration-pending entries (58 remain) - docs/api-proxies.mdx rows for /api/enrichment/{company,signals} - docs/architecture.mdx reference updated to the IntelligenceService RPCs Verified: typecheck, typecheck:api, lint:api-contract (89 files / 58 entries), lint:boundaries, tests/edge-functions.test.mjs (136 pass), tests/enrichment-caching.test.mjs (14 pass — still guards the intelligence/v1 handlers), make generate is zero-diff. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> * refactor(leads): migrate /api/{contact,register-interest} → LeadsService (#3207) New leads/v1 sebuf service with two POST RPCs: - SubmitContact → /api/leads/v1/submit-contact - RegisterInterest → /api/leads/v1/register-interest Handler logic ported 1:1 from api/contact.js + api/register-interest.js: - Turnstile verification (desktop sources bypass, preserved) - Honeypot (website field) silently accepts without upstream calls - Free-email-domain gate on SubmitContact (422 ApiError) - validateEmail (disposable/offensive/typo-TLD/MX) on RegisterInterest - Convex writes via ConvexHttpClient (contactMessages:submit, registerInterest:register) - Resend notification + confirmation emails (HTML templates unchanged) Shared helpers moved to server/_shared/: - turnstile.ts (getClientIp + verifyTurnstile) - email-validation.ts (disposable/offensive/MX checks) Rate limits preserved via ENDPOINT_RATE_POLICIES: - submit-contact: 3/hour per IP (was in-memory 3/hr) - register-interest: 5/hour per IP (was in-memory 5/hr; desktop sources previously capped at 2/hr via shared in-memory map — now 5/hr like everyone else, accepting the small regression in exchange for Upstash-backed global limiting) Callers updated: - pro-test/src/App.tsx contact form → new submit-contact path - src-tauri/sidecar/local-api-server.mjs cloud-fallback rewrites /api/register-interest → /api/leads/v1/register-interest when proxying; keeps local path for older desktop builds - src/services/runtime.ts isKeyFreeApiTarget allows both old and new paths through the WORLDMONITOR_API_KEY-optional gate Tests: - tests/contact-handler.test.mjs rewritten to call submitContact handler directly; asserts on ValidationError / ApiError - tests/email-validation.test.mjs + tests/turnstile.test.mjs point at the new server/_shared/ modules Deleted: api/contact.js, api/register-interest.js, api/_ip-rate-limit.js, api/_turnstile.js, api/_email-validation.js, api/_turnstile.test.mjs. Manifest entries removed (58 → 56). Docs updated (api-platform, api-commerce, usage-rate-limits). Verified: npm run typecheck + typecheck:api + lint:api-contract (88 files / 56 entries) + lint:boundaries pass; full test:data (5852 tests) passes; make generate is zero-diff. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> * chore(pro-test): rebuild bundle for leads/v1 contact form (#3207) Updates the enterprise contact form to POST to /api/leads/v1/submit-contact (old path /api/contact removed in the previous commit). Bundle is rebuilt from pro-test/src/App.tsx source change in `9ccd309d`. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> * fix(review): address HIGH review findings 1-3 (#3207) Three review findings from @koala73 on the sebuf-migration PR, all silent bugs that would have shipped to prod: ### 1. Sanctions rate-limit policy was dead code ENDPOINT_RATE_POLICIES keyed the 30/min budget under /api/sanctions/v1/lookup-entity, but the generated route (from the proto RPC LookupSanctionEntity) is /api/sanctions/v1/lookup-sanction-entity. hasEndpointRatePolicy / getEndpointRatelimit are exact-string pathname lookups, so the mismatch meant the endpoint fell through to the generic 600/min global limiter instead of the advertised 30/min. Net effect: the live OpenSanctions proxy endpoint (unauthenticated, external upstream) had 20x the intended rate budget. Fixed by renaming the policy key to match the generated route. ### 2. Lost stale-seed fallback on military-flights Legacy api/military-flights.js cascaded military:flights:v1 → military:flights:stale:v1 before returning empty. The new proto handler went straight to live OpenSky/relay and returned null on miss. Relay or OpenSky hiccup used to serve stale seeded data (24h TTL); under the new handler it showed an empty map. Both keys are still written by scripts/seed-military-flights.mjs on every run — fix just reads the stale key when the live fetch returns null, converts the seed's app-shape flights (flat lat/lon, lowercase enums, lastSeenMs) to the proto shape (nested GeoCoordinates, enum strings, lastSeenAt), and filters to the request bbox. Read via getRawJson (unprefixed) to match the seed cron's writes, which bypass the env-prefix system. ### 3. Hex-code casing mismatch broke getFlightByHex The seed cron writes hexCode: icao24.toUpperCase() (uppercase); src/services/military-flights.ts:getFlightByHex uppercases the lookup input: f.hexCode === hexCode.toUpperCase(). The new proto handler preserved OpenSky's lowercase icao24, and mapProtoFlight is a pass-through. getFlightByHex was silently returning undefined for every call after the migration. Fix: uppercase in the proto handler (live + stale paths), and document the invariant in a comment on MilitaryFlight.hex_code in military_flight.proto so future handlers don't re-break it. ### Verified - typecheck + typecheck:api clean - lint:api-contract (56 entries) / lint:boundaries clean - tests/edge-functions.test.mjs 130 pass - make generate zero-diff (openapi spec regenerated for proto comment) Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> * fix(review): restore desktop 2/hr rate cap on register-interest (#3207) Addresses HIGH review finding #4 from @koala73. The legacy api/register-interest.js applied a nested 2/hr per-IP cap when `source === 'desktop-settings'`, on top of the generic 5/hr endpoint budget. The sebuf migration lost this — desktop-source requests now enjoy the full 5/hr cap. Since `source` is an unsigned client-supplied field, anyone sending `source: 'desktop-settings'` skips Turnstile AND gets 5/hr. Without the tighter cap the Turnstile bypass is cheaper to abuse. Added `checkScopedRateLimit` to `server/_shared/rate-limit.ts` — a reusable second-stage Upstash limiter keyed on an opaque scope string + caller identifier. Fail-open on Redis errors to match existing checkRateLimit / checkEndpointRateLimit semantics. Handlers that need per-subscope caps on top of the gateway-level endpoint budget use this helper. In register-interest: when `isDesktopSource`, call checkScopedRateLimit with scope `/api/leads/v1/register-interest#desktop`, limit=2, window=1h, IP as identifier. On exceeded → throw ApiError(429). ### What this does not fix This caps the blast radius of the Turnstile bypass but does not close it — an attacker sending `source: 'desktop-settings'` still skips Turnstile (just at 2/hr instead of 5/hr). The proper fix is a signed desktop-secret header that authenticates the bypass; filed as follow-up #3252. That requires coordinated Tauri build + Vercel env changes out of scope for #3207. ### Verified - typecheck + typecheck:api clean - lint:api-contract (56 entries) - tests/edge-functions.test.mjs + contact-handler.test.mjs (147 pass) Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> * fix(review): MEDIUM + LOW + rate-limit-policy CI check (#3207) Closes out the remaining @koala73 review findings from #3242 that didn't already land in the HIGH-fix commits, plus the requested CI check that would have caught HIGH #1 (dead-code policy key) at review time. ### MEDIUM #5 — Turnstile missing-secret policy default Flip `verifyTurnstile`'s default `missingSecretPolicy` from `'allow'` to `'allow-in-development'`. Dev with no secret = pass (expected local); prod with no secret = reject + log. submit-contact was already explicitly overriding to `'allow-in-development'`; register-interest was silently getting `'allow'`. Safe default now means a future missing-secret misconfiguration in prod gets caught instead of silently letting bots through. Removed the now-redundant override in submit-contact. ### MEDIUM #6 — Silent enum fallbacks in maritime client `toDisruptionEvent` mapped `AIS_DISRUPTION_TYPE_UNSPECIFIED` / unknown enum values → `gap_spike` / `low` silently. Refactored to return null when either enum is unknown; caller filters nulls out of the array. Handler doesn't produce UNSPECIFIED today, but the `gap_spike` default would have mislabeled the first new enum value the proto ever adds — dropping unknowns is safer than shipping wrong labels. ### LOW — Copy drift in register-interest email Email template hardcoded `435+ Sources`; PR #3241 bumped marketing to `500+`. Bumped in the rewritten file to stay consistent. The `as any` on Convex mutation names carried over from legacy and filed as follow-up #3253. ### Rate-limit-policy coverage lint `scripts/enforce-rate-limit-policies.mjs` validates every key in `ENDPOINT_RATE_POLICIES` resolves to a proto-generated gateway route by cross-referencing `docs/api/.openapi.yaml`. Fails with the sanctions-entity-search incident referenced in the error message so future drift has a paper trail. Wired into package.json (`lint:rate-limit-policies`) and the pre-push hook alongside `lint:boundaries`. Smoke-tested both directions — clean repo passes (5 policies / 175 routes), seeded drift (the exact HIGH #1 typo) fails with the advertised remedy text. ### Verified - `lint:rate-limit-policies` ✓ - `typecheck` + `typecheck:api` ✓ - `lint:api-contract` ✓ (56 entries) - `lint:boundaries` ✓ - edge-functions + contact-handler tests (147 pass) Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> refactor(commit 5): decomm /api/eia/* + migrate /api/satellites → IntelligenceService (#3207) Both targets turned out to be decomm-not-migration cases. The original plan called for two new services (economic/v1.GetEiaSeries + natural/v1.ListSatellitePositions) but research found neither was needed: ### /api/eia/[[...path]].js — pure decomm, zero consumers The "catch-all" is a misnomer — only two paths actually worked, /api/eia/health and /api/eia/petroleum, both Redis-only readers. Zero frontend callers in src/. Zero server-side readers. Nothing consumes the `energy:eia-petroleum:v1` key that seed-eia-petroleum.mjs writes daily. The EIA data the frontend actually uses goes through existing typed RPCs in economic/v1: GetEnergyPrices, GetCrudeInventories, GetNatGasStorage, GetEnergyCapacity. None of those touch /api/eia/. Building GetEiaSeries would have been dead code. Deleted the legacy file + its test (tests/api-eia-petroleum.test.mjs — it only covered the legacy endpoint, no behavior to preserve). Empty api/eia/ dir removed. Note for review:* the Redis seed cron keeps running daily and nothing consumes it. If that stays unused, seed-eia-petroleum.mjs should be retired too (separate PR). Out of scope for sebuf-migration. ### /api/satellites.js — Learning #2 strikes again IntelligenceService.ListSatellites already exists at /api/intelligence/v1/list-satellites, reads the same Redis key (intelligence:satellites:tle:v1), and supports an optional country filter the legacy didn't have. One frontend caller in src/services/satellites.ts needed to switch from `fetch(toApiUrl('/api/satellites'))` to the typed IntelligenceServiceClient.listSatellites. Shape diff was tiny — legacy `noradId` became proto `id` (handler line 36 already picks either), everything else identical. alt/velocity/inclination in the proto are ignored by the caller since it propagates positions client-side via satellite.js. Kept the client-side cache + failure cooldown + 20s timeout (still valid concerns at the caller level). ### Manifest + docs - api-route-exceptions.json: 56 → 54 entries (both removed) - docs/api-proxies.mdx: dropped the two rows from the Raw-data passthroughs table ### Verified - typecheck + typecheck:api ✓ - lint:api-contract (54 entries) / lint:boundaries / lint:rate-limit-policies ✓ - tests/edge-functions.test.mjs 127 pass (down from 130 — 3 tests were for the deleted eia endpoint) - make generate zero-diff (no proto changes) Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> * refactor(commit 6): migrate /api/supply-chain/v1/{country-products,multi-sector-cost-shock} → SupplyChainService (#3207) Both endpoints were hand-rolled TS handlers sitting under a proto URL prefix — the exact drift the manifest guardrail flagged. Promoted both to typed RPCs: - GetCountryProducts → /api/supply-chain/v1/get-country-products - GetMultiSectorCostShock → /api/supply-chain/v1/get-multi-sector-cost-shock Handlers preserve the existing semantics: PRO-gate via isCallerPremium(ctx.request), iso2 / chokepointId validation, raw bilateral-hs4 Redis read (skip env-prefix to match seeder writes), CHOKEPOINT_STATUS_KEY for war-risk tier, and the math from _multi-sector-shock.ts unchanged. Empty-data and non-PRO paths return the typed empty payload (no 403 — the sebuf gateway pattern is empty-payload-on-deny). Client wrapper switches from premiumFetch to client.getCountryProducts/ client.getMultiSectorCostShock. Legacy MultiSectorShock / MultiSectorShockResponse / CountryProductsResponse names remain as type aliases of the generated proto types so CountryBriefPanel + CountryDeepDivePanel callsites compile with zero churn. Manifest 54 → 52. Rate-limit gateway routes 175 → 177. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> * fix(gateway): add cache-tier entries for new supply-chain RPCs (#3207) Pre-push tests/route-cache-tier.test.mjs caught the missing entries. Both PRO-gated, request-varying — match the existing supply-chain PRO cohort (get-country-cost-shock, get-bypass-options, etc.) at slow-browser tier. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> * refactor(commit 7): migrate /api/scenario/v1/{run,status,templates} → ScenarioService (#3207) Promote the three literal-filename scenario endpoints to a typed sebuf service with three RPCs: POST /api/scenario/v1/run-scenario (RunScenario) GET /api/scenario/v1/get-scenario-status (GetScenarioStatus) GET /api/scenario/v1/list-scenario-templates (ListScenarioTemplates) Preserves all security invariants from the legacy handlers: - 405 for wrong method (sebuf service-config method gate) - scenarioId validation against SCENARIO_TEMPLATES registry - iso2 regex ^[A-Z]{2}$ - JOB_ID_RE path-traversal guard on status - Per-IP 10/min rate limit (moved to gateway ENDPOINT_RATE_POLICIES) - Queue-depth backpressure (>100 → 429) - PRO gating via isCallerPremium - AbortSignal.timeout on every Redis pipeline (runRedisPipeline helper) Wire-level diffs vs legacy: - Per-user RL now enforced at the gateway (same 10/min/IP budget). - Rate-limit response omits Retry-After header; retryAfter is in the body per error-mapper.ts convention. - ListScenarioTemplates emits affectedHs2: [] when the registry entry is null (all-sectors sentinel); proto repeated cannot carry null. - RunScenario returns { jobId, status } (no statusUrl field — unused by SupplyChainPanel, drop from wire). Gateway wiring: - server/gateway.ts RPC_CACHE_TIER: list-scenario-templates → 'daily' (matches legacy max-age=3600); get-scenario-status → 'slow-browser' (premium short-circuit target, explicit entry required by tests/route-cache-tier.test.mjs). - src/shared/premium-paths.ts: swap old run/status for the new run-scenario/get-scenario-status paths. - api/scenario/v1/{run,status,templates}.ts deleted; 3 manifest exceptions removed (63 → 52 → 49 migration-pending). Client: - src/services/scenario/index.ts — typed client wrapper using premiumFetch (injects Clerk bearer / API key). - src/components/SupplyChainPanel.ts — polling loop swapped from premiumFetch strings to runScenario/getScenarioStatus. Hard 20s timeout on run preserved via AbortSignal.any. Tests: - tests/scenario-handler.test.mjs — 18 new handler-level tests covering every security invariant + the worker envelope coercion. - tests/edge-functions.test.mjs — scenario sections removed, replaced with a breadcrumb pointer to the new test file. Docs: api-scenarios.mdx, scenario-engine.mdx, usage-rate-limits.mdx, usage-errors.mdx, supply-chain.mdx refreshed with new paths. Verified: typecheck, typecheck:api, lint:api-contract (49 entries), lint:rate-limit-policies (6/180), lint:boundaries, route-cache-tier (parity), full edge-functions (117) + scenario-handler (18). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> * refactor(commit 8): migrate /api/v2/shipping/{route-intelligence,webhooks} → ShippingV2Service (#3207) Partner-facing endpoints promoted to a typed sebuf service. Wire shape preserved byte-for-byte (camelCase field names, ISO-8601 fetchedAt, the same subscriberId/secret formats, the same SET + SADD + EXPIRE 30-day Redis pipeline). Partner URLs /api/v2/shipping/* are unchanged. RPCs landed: - GET /route-intelligence → RouteIntelligence (PRO, slow-browser) - POST /webhooks → RegisterWebhook (PRO) - GET /webhooks → ListWebhooks (PRO, slow-browser) The existing path-parameter URLs remain on the legacy edge-function layout because sebuf's HTTP annotations don't currently model path params (grep proto/*/.proto for `path: "{…}"` returns zero). Those endpoints are split into two Vercel dynamic-route files under api/v2/shipping/webhooks/, behaviorally identical to the previous hybrid file but cleanly separated: - GET /webhooks/{subscriberId} → [subscriberId].ts - POST /webhooks/{subscriberId}/rotate-secret → [subscriberId]/[action].ts - POST /webhooks/{subscriberId}/reactivate → [subscriberId]/[action].ts Both get manifest entries under `migration-pending` pointing at #3207. Other changes - scripts/enforce-sebuf-api-contract.mjs: extended GATEWAY_RE to accept api/v{N}/{domain}/[rpc].ts (version-first) alongside the canonical api/{domain}/v{N}/[rpc].ts; first-use of the reversed ordering is shipping/v2 because that's the partner contract. - vite.config.ts: dev-server sebuf interceptor regex extended to match both layouts; shipping/v2 import + allRoutes entry added. - server/gateway.ts: RPC_CACHE_TIER entries for /api/v2/shipping/ route-intelligence + /webhooks (slow-browser; premium-gated endpoints short-circuit to slow-browser but the entries are required by tests/route-cache-tier.test.mjs). - src/shared/premium-paths.ts: route-intelligence + webhooks added. - tests/shipping-v2-handler.test.mjs: 18 handler-level tests covering PRO gate, iso2/cargoType/hs2 coercion, SSRF guards (http://, RFC1918, cloud metadata, IMDS), chokepoint whitelist, alertThreshold range, secret/subscriberId format, pipeline shape + 30-day TTL, cross-tenant owner isolation, `secret` omission from list response. Manifest delta - Removed: api/v2/shipping/route-intelligence.ts, api/v2/shipping/webhooks.ts - Added: api/v2/shipping/webhooks/[subscriberId].ts (migration-pending) - Added: api/v2/shipping/webhooks/[subscriberId]/[action].ts (migration-pending) - Added: api/internal/brief-why-matters.ts (internal-helper) — regression surface from the #3248 main merge, which introduced the file without a manifest entry. Filed here to keep the lint green; not strictly in scope for commit 8 but unblocking. Net result: 49 → 47 `migration-pending` entries (one net-removal even though webhook path-params stay pending, because two files collapsed into two dynamic routes). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> * fix(review HIGH 1): SupplyChainServiceClient must use premiumFetch (#3207) Signed-in browser pro users were silently hitting 401 on 8 supply-chain premium endpoints (country-products, multi-sector-cost-shock, country-chokepoint-index, bypass-options, country-cost-shock, sector-dependency, route-explorer-lane, route-impact). The shared client was constructed with globalThis.fetch, so no Clerk bearer or X-WorldMonitor-Key was injected. The gateway's validateApiKey runs with forceKey=true for PREMIUM_RPC_PATHS and 401s before isCallerPremium is consulted. The generated client's try/catch collapses the 401 into an empty-fallback return, leaving panels blank with no visible error. Fix is one line at the client constructor: swap globalThis.fetch for premiumFetch. The same pattern is already in use for insider-transactions, stock-analysis, stock-backtest, scenario, trade (premiumClient) — this was an omission on this client, not a new pattern. premiumFetch no-ops safely when no credentials are available, so the 5 non-premium methods on this client (shippingRates, chokepointStatus, chokepointHistory, criticalMinerals, shippingStress) continue to work unchanged. This also fixes two panels that were pre-existing latently broken on main (chokepoint-index, bypass-options, etc. — predating #3207, not regressions from it). Commit 6 expanded the surface by routing two more methods through the same buggy client; this commit fixes the class. From koala73 review (#3242 second-pass, HIGH new #1): > Exact class PR #3233 fixed for RegionalIntelligenceBoard / > DeductionPanel / trade / country-intel. Supply-chain was not in > #3233's scope. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> * fix(review HIGH 2): restore 400 on input-shape errors for 2 supply-chain handlers (#3207) Commit 6 collapsed all non-happy paths into empty-200 on `get-country-products` and `get-multi-sector-cost-shock`, including caller-bug cases that legacy returned 400 for: - get-country-products: malformed iso2 → empty 200 (was 400) - get-multi-sector-cost-shock: malformed iso2 / missing chokepointId / unknown chokepointId → empty 200 (was 400) The commit message for 6 called out the 403-for-non-pro → empty-200 shift ("sebuf gateway pattern is empty-payload-on-deny") but not the 400 shift. They're different classes: - Empty-payload-200 for PRO-deny: intentional contract change, already documented and applied across the service. Generated clients treat "you lack PRO" as "no data" — fine. - Empty-payload-200 for malformed input: caller bug silently masked. External API consumers can't distinguish "bad wiring" from "genuinely no data", test harnesses lose the signal, bad calling code doesn't surface in Sentry. Fix: `throw new ValidationError(violations)` on the 3 input-shape branches. The generated sebuf server maps ValidationError → HTTP 400 (see src/generated/server/.../service_server.ts and leads/v1 which already uses this pattern). PRO-gate deny stays as empty-200 — that contract shift was intentional and is preserved. Regression tests added at tests/supply-chain-validation.test.mjs (8 cases) pinning the three-way contract: - bad input → 400 (ValidationError) - PRO-gate deny on valid input → 200 empty - valid PRO input, no data in Redis → 200 empty (unchanged) From koala73 review (#3242 second-pass, HIGH new #2). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> * fix(review HIGH 3): restore statusUrl on RunScenarioResponse + document 202→200 wire break (#3207) Commit 7 silently shifted /api/scenario/v1/run-scenario's response contract in two ways that the commit message covered only partially: 1. HTTP 202 Accepted → HTTP 200 OK 2. Dropped `statusUrl` string from the response body The `statusUrl` drop was mentioned as "unused by SupplyChainPanel" but not framed as a contract change. The 202 → 200 shift was not mentioned at all. This is a same-version (v1 → v1) migration, so external callers that key off either signal — `response.status === 202` or `response.body.statusUrl` — silently branch incorrectly. Evaluated options: (a) sebuf per-RPC status-code config — not available. sebuf's HttpConfig only models `path` and `method`; no status annotation. (b) Bump to scenario/v2 — judged heavier than the break itself for a single status-code shift. No in-repo caller uses 202 or statusUrl; the docs-level impact is containable. (c) Accept the break, document explicitly, partially restore. Took option (c): - Restored `statusUrl` in the proto (new field `string status_url = 3` on RunScenarioResponse). Server computes `/api/scenario/v1/get-scenario-status?jobId=<encoded job_id>` and populates it on every successful enqueue. External callers that followed this URL keep working unchanged. - 202 → 200 is not recoverable inside the sebuf generator, so it is called out explicitly in two places: - docs/api-scenarios.mdx now includes a prominent `<Warning>` block documenting the v1→v1 contract shift + the suggested migration (branch on response body shape, not HTTP status). - RunScenarioResponse proto comment explains why 200 is the new success status on enqueue. OpenAPI bundle regenerated to reflect the restored statusUrl field. - Regression test added in tests/scenario-handler.test.mjs pinning `statusUrl` to the exact URL-encoded shape — locks the invariant so a future proto rename or handler refactor can't silently drop it again. From koala73 review (#3242 second-pass, HIGH new #3). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> * fix(review HIGH 1/2): close webhook tenant-isolation gap on shipping/v2 (#3207) Koala flagged this as a merge blocker in PR #3242 review. server/worldmonitor/shipping/v2/{register-webhook,list-webhooks}.ts migrated without reinstating validateApiKey(req, { forceKey: true }), diverging from both the sibling api/v2/shipping/webhooks/[subscriberId] routes and the documented "X-WorldMonitor-Key required" contract in docs/api-shipping-v2.mdx. Attack surface: the gateway accepts Clerk bearer auth as a pro signal. A Clerk-authenticated pro user with no X-WorldMonitor-Key reaches the handler, callerFingerprint() falls back to 'anon', and every such caller collapses into a shared webhook:owner:anon:v1 bucket. The defense-in-depth ownerTag !== ownerHash check in list-webhooks.ts doesn't catch it because both sides equal 'anon' — every Clerk-session holder could enumerate / overwrite every other Clerk-session pro tenant's registered webhook URLs. Fix: reinstate validateApiKey(ctx.request, { forceKey: true }) at the top of each handler, throwing ApiError(401) when absent. Matches the sibling routes exactly and the published partner contract. Tests: - tests/shipping-v2-handler.test.mjs: two existing "non-PRO → 403" tests for register/list were using makeCtx() with no key, which now fails at the 401 layer first. Renamed to "no API key → 401 (tenant-isolation gate)" with a comment explaining the failure mode being tested. 18/18 pass. Verified: typecheck:api, lint:api-contract (no change), lint:boundaries, lint:rate-limit-policies, test:data (6005/6005). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> * fix(review HIGH 2/2): restore v1 path aliases on scenario + supply-chain (#3207) Koala flagged this as a merge blocker in PR #3242 review. Commits 6 + 7 of #3207 renamed five documented v1 URLs to the sebuf method-derived paths and deleted the legacy edge-function files: POST /api/scenario/v1/run → run-scenario GET /api/scenario/v1/status → get-scenario-status GET /api/scenario/v1/templates → list-scenario-templates GET /api/supply-chain/v1/country-products → get-country-products GET /api/supply-chain/v1/multi-sector-cost-shock → get-multi-sector-cost-shock server/router.ts is an exact static-match table (Map keyed on `METHOD PATH`), so any external caller — docs, partner scripts, grep-the- internet — hitting the old documented URL would 404 on first request after merge. Commit 8 (shipping/v2) preserved partner URLs byte-for- byte; the scenario + supply-chain renames missed that discipline. Fix: add five thin alias edge functions that rewrite the pathname to the canonical sebuf path and delegate to the domain [rpc].ts gateway via a new server/alias-rewrite.ts helper. Premium gating, rate limits, entitlement checks, and cache-tier lookups all fire on the canonical path — aliases are pure URL rewrites, not a duplicate handler pipeline. api/scenario/v1/{run,status,templates}.ts api/supply-chain/v1/{country-products,multi-sector-cost-shock}.ts Vite dev parity: file-based routing at api/ is a Vercel concern, so the dev middleware (vite.config.ts) gets a matching V1_ALIASES rewrite map before the router dispatch. Manifest: 5 new entries under `deferred` with removal_issue=#3282 (tracking their retirement at the next v1→v2 break). lint:api-contract stays green (89 files checked, 55 manifest entries validated). Docs: - docs/api-scenarios.mdx: migration callout at the top with the full old→new URL table and a link to the retirement issue. - CHANGELOG.md + docs/changelog.mdx: Changed entry documenting the rename + alias compat + the 202→200 shift (from commit `23c821a1`). Verified: typecheck:api, lint:api-contract, lint:rate-limit-policies, lint:boundaries, test:data (6005/6005). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-22 09:55:59 +03:00
Elie Habib	502bd4472c	docs(resilience): sync methodology/proto/widget to 6-domain + 3-pillar reality (#3264 ) Brings every user-facing surface into alignment with the live resilience scorer. Zero behavior change: overall_score is still the 6-domain weighted aggregate, schemaVersion is still 2.0 default, and every existing test continues to pass. Surfaces touched: - proto + OpenAPI: rewrote the ResiliencePillar + schema_version descriptions. 2.0 is correctly documented as default; shaped-but-empty language removed. - Widget: added missing recovery: 'Recovery' label (was rendering literal lowercase recovery before), retitled footer data-version chip from Data to Seed date so it is clear the value reflects the static seed bundle not every live input, rewrote help tooltip for 6 domains and 3 pillars and called out the 0.25 recovery weight. - Methodology doc: domains-and-weights table now carries all 6 rows with actual code weights (0.17/0.15/0.11/0.19/0.13/0.25), Recovery section header weight corrected from 1.0 to 0.25, new Pillar-combined score activation (pending) section with the measured Spearman 0.9935, top-5 movers, and the activation checklist. - documentation.mdx + features.mdx: product blurbs updated from 5 domains and 13 dimensions to 6 domains and 19 dimensions grouped into 3 pillars. - Tests: recovery-label regression pin, Seed date label pin, clarified pillar-schema degenerate-input semantics. New scaffolding for defensibility: - docs/snapshots/resilience-ranking-2026-04-21.json frozen published tables artifact with methodology metadata and commit SHA. - docs/snapshots/resilience-pillar-sensitivity-2026-04-21.json live Redis capture (52-country sample) combining sensitivity stability with the current-vs-proposed Spearman comparison. - scripts/freeze-resilience-ranking.mjs refresh script. - scripts/compare-resilience-current-vs-proposed.mjs comparison script. - tests/resilience-ranking-snapshot.test.mts 13 assertions auto discovered from any resilience-ranking-YYYY-MM-DD.json in snapshots. Verification: npm run typecheck:all clean, 390/390 resilience tests pass. Follow-up: pillar-combined score activation. The sensitivity artifact shows rank-preservation Spearman 0.9935 and no ceiling effects, which clears the methodological bar. Blocker is messaging because every country drops ~13 points under the penalty, so activation PR ships with re-anchored release-gate bands, refreshed frozen ranking, and a v2.0 methodology note.	2026-04-21 22:37:27 +04:00
Elie Habib	2f19d96357	feat(brief): route whyMatters through internal analyst-context endpoint (#3248 ) * feat(brief): route whyMatters through internal analyst-context endpoint The brief's "why this is important" callout currently calls Gemini on only {headline, source, threatLevel, category, country} with no live state. The LLM can't know whether a ceasefire is on day 2 or day 50, that IMF flagged >90% gas dependency in UAE/Qatar/Bahrain, or what today's forecasts look like. Output is generic prose instead of the situational analysis WMAnalyst produces when given live context. This PR adds an internal Vercel edge endpoint that reuses a trimmed variant of the analyst context (country-brief, risk scores, top-3 forecasts, macro signals, market data — no GDELT, no digest-search) and ships it through a one-sentence LLM call with the existing WHY_MATTERS_SYSTEM prompt. The endpoint owns its own Upstash cache (v3 prefix, 6h TTL), supports a shadow mode that runs both paths in parallel for offline diffing, and is auth'd via RELAY_SHARED_SECRET. Three-layer graceful degradation (endpoint → legacy Gemini-direct → stub) keeps the brief shipping on any failure. Env knobs: - BRIEF_WHY_MATTERS_PRIMARY=analyst\|gemini (default: analyst; typo → gemini) - BRIEF_WHY_MATTERS_SHADOW=0\|1 (default: 1; only '0' disables) - BRIEF_WHY_MATTERS_SHADOW_SAMPLE_PCT=0..100 (default: 100) - BRIEF_WHY_MATTERS_ENDPOINT_URL (Railway, optional override) Cache keys: - brief:llm:whymatters:v3:{hash16} — envelope {whyMatters, producedBy, at}, 6h TTL. Endpoint-owned. - brief:llm:whymatters:shadow:v1:{hash16} — {analyst, gemini, chosen, at}, 7d TTL. Fire-and-forget. - brief:llm:whymatters:v2:{hash16} — legacy. Cron's fallback path still reads/writes during the rollout window; expires in ≤24h. Tests: 6022 pass (existing 5915 + 12 core + 36 endpoint + misc). typecheck + typecheck:api + biome on changed files clean. Plan (Codex-approved after 4 rounds): docs/plans/2026-04-21-001-feat-brief-why-matters-analyst-endpoint-plan.md * fix(brief): address /ce:review round 1 findings on PR #3248 Fixes 5 findings from multi-agent review, 2 of them P1: - #241 P1: `.gitignore !api/internal/*` was too broad — it re-included `.env`, `.env.local`, and any future secret file dropped into that directory. Narrowed to explicit source extensions (`.ts`, `.js`, `.mjs`) so parent `.env` / secrets rules stay in effect inside api/internal/. - #242 P1: `Dockerfile.digest-notifications` did not COPY `shared/brief-llm-core.js` + `.d.ts`. Cron would have crashed at container start with ERR_MODULE_NOT_FOUND. Added alongside brief-envelope + brief-filter COPY lines. - #243 P2: Cron dropped the endpoint's source/producedBy ground-truth signal, violating PR #3247's own round-3 memory (feedback_gate_on_ground_truth_not_configured_state.md). Added structured log at the call site: `[brief-llm] whyMatters source=<src> producedBy=<pb> hash=<h>`. Endpoint response now includes `hash` so log + shadow-record pairs can be cross-referenced. - #244 P2: Defense-in-depth prompt-injection hardening. Story fields flowed verbatim into both LLM prompts, bypassing the repo's sanitizeForPrompt convention. Added sanitizeStoryFields helper and applied in both analyst and gemini paths. - #245 P2: Removed redundant `validate` option from callLlmReasoning. With only openrouter configured in prod, a parse-reject walked the provider chain, then fell through to the other path (same provider), then the cron's own fallback (same model) — 3x billing on one reject. Post-call parseWhyMatters check already handles rejection cleanly. Deferred to P3 follow-ups (todos 246-248): singleflight, v2 sunset, misc polish (country-normalize LOC, JSDoc pruning, shadow waitUntil, auto-sync mirror, context-assembly caching). Tests: 6022 pass. typecheck + typecheck:api clean. * fix(brief-why-matters): ctx.waitUntil for shadow write + sanitize legacy fallback Two P2 findings on PR #3248: 1. Shadow record was fire-and-forget without ctx.waitUntil on an Edge function. Vercel can terminate the isolate after response return, so the background redisPipeline write completes unreliably — i.e. the rollout-validation signal the shadow keys were supposed to provide was flaky in production. Fix: accept an optional EdgeContext 2nd arg. Build the shadow promise up front (so it starts executing immediately) then register it with ctx.waitUntil when present. Falls back to plain unawaited execution when ctx is absent (local harness / tests). 2. scripts/lib/brief-llm.mjs legacy fallback path called buildWhyMattersPrompt(story) on raw fields with no sanitization. The analyst endpoint sanitizes before its own prompt build, but the fallback is exactly what runs when the endpoint misses / errors — so hostile headlines / sources reached the LLM verbatim on that path. Fix: local sanitizeStoryForPrompt wrapper imports sanitizeForPrompt from server/_shared/llm-sanitize.js (existing pattern — see scripts/seed-digest-notifications.mjs:41). Wraps story fields before buildWhyMattersPrompt. Cache key unchanged (hash is over raw story), so cache parity with the analyst endpoint's v3 entries is preserved. Regression guard: new test asserts the fallback prompt strips "ignore previous instructions", "### Assistant:" line prefixes, and `<\|im_start\|>` tokens when injection-crafted fields arrive. Typecheck + typecheck:api clean. 6023 / 6023 data tests pass. * fix(digest-cron): COPY server/_shared/llm-sanitize into digest-notifications image Reviewer P1 on PR #3248: my previous commit (`4eee22083`) added `import sanitizeForPrompt from server/_shared/llm-sanitize.js` to scripts/lib/brief-llm.mjs, but Dockerfile.digest-notifications cherry- picks server/_shared/* files and doesn't copy llm-sanitize. Import is top-level/static — the container would crash at module load with ERR_MODULE_NOT_FOUND the moment seed-digest-notifications.mjs pulls in scripts/lib/brief-llm.mjs. Not just on fallback — every startup. Fix: add `COPY server/_shared/llm-sanitize.js server/_shared/llm-sanitize.d.ts` next to the existing brief-render COPY line. Module is pure string manipulation with zero transitive imports — nothing else needs to land. Cites feedback_validation_docker_ship_full_scripts_dir.md in the comment next to the COPY; the cherry-pick convention keeps biting when new cross-dir imports land in scripts/lib/ or scripts/shared/. Can't regression-test at build time from this branch without a docker-build CI job, but the symptom is deterministic — local runs remain green (they resolve against the live filesystem); only the container crashes. Post-merge, Railway redeploy of seed-digest- notifications should show a clean `Starting Container` log line instead of the MODULE_NOT_FOUND crash my prior commit would have caused.	2026-04-21 14:03:27 +04:00
Elie Habib	89c179e412	fix(brief): cover greeting was hardcoded "Good evening" regardless of issue time (#3254 ) * fix(brief): cover greeting was hardcoded "Good evening" regardless of issue time Reported: a brief viewed at 13:02 local time showed "Good evening" on the cover (slide 1) but "Good afternoon." on the digest greeting page (slide 2). Cause: `server/_shared/brief-render.js:renderCover` had the string `'Good evening'` hardcoded in the cover's mono-cased salutation slot. The digest greeting page (slide 2) renders the time-of-day-correct value from `envelope.data.digest.greeting`, which is computed by `shared/brief-filter.js:174-179` from `localHour` in the user's TZ (< 12 → morning, < 18 → afternoon, else → evening). So any brief viewed outside the literal evening showed an inconsistent pair. Fix: thread `digest.greeting` into `renderCover`; a small `coverGreeting()` helper strips the trailing period so the cover's no-punctuation mono style is preserved. On unexpected/missing values it falls back to a generic "Hello" rather than silently re-hardcoding a specific time of day. Tests: 5 regression cases in `tests/brief-magazine-render.test.mjs` cover afternoon/morning/evening parity, period stripping, and HTML escape (defense-in-depth). 60 total in that file pass. Full test:data 5921 pass. typecheck + typecheck:api + biome clean. * chore(brief): fix orphaned JSDoc on coverGreeting / renderCover Greptile flagged: the original `renderCover` JSDoc block stayed above `coverGreeting` when the helper was inserted, so the @param shape was misattributed to the wrong function and `renderCover` was left undocumented (plus the new `greeting` field was unlisted). Moved the opts-shape JSDoc to immediately above `renderCover` and added `greeting: string` to the param type. `coverGreeting` keeps its own prose comment. No runtime change.	2026-04-21 13:46:21 +04:00
Elie Habib	14c1314629	fix(scoring): scope "critical" to geopolitical events, not domestic tragedies (#3221 ) The weight rebalance (PR #3144) amplified a prompt gap: domestic mass shootings (e.g. "8 children killed in Louisiana") scored 88 because the LLM classified them as "critical" (mass-casualty 10+ killed) and the 55% severity weight pushed them into the critical gate. But WorldMonitor is a geopolitical monitor — domestic tragedies are terrible but not geopolitically destabilizing. Prompt change (both ais-relay.cjs + classify-event.ts): - "critical" now explicitly requires GEOPOLITICAL scope: "events that destabilize international order, threaten cross-border security, or disrupt global systems" - Domestic mass-casualty events (mass shootings, industrial accidents) moved to "high" — still important, but not critical-sensitivity alerts - Added counterexamples: "8 children killed in mass shooting in Louisiana → domestic mass-casualty → high" and "23 killed in fireworks factory explosion → industrial accident → high" - Retained: "700 killed in Sudan drone strikes → geopolitical mass- casualty in active civil war → critical" Classify cache: v2→v3 (bust stale entries that lack geopolitical scope). Shadow-log: v4→v5 (clean dataset for recalibration under the scoped prompt). 🤖 Generated with Claude Opus 4.6 via Claude Code Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-20 08:40:29 +04:00
Elie Habib	1581b2dd70	fix(scoring): upgrade LLM classifier prompt + keywords + cache v2 (#3218 ) * fix(scoring): upgrade LLM classifier prompt + keywords + cache v2 PR B of the scoring recalibration plan (docs/plans/2026-04-17-002). Builds on PR A (weight rebalance, PR #3144) which achieved Pearson 0.669. This PR targets the remaining noise in the 50-69 band where editorials, tutorials, and domestic crime score alongside real news. LLM prompt upgrade (both writers): - scripts/ais-relay.cjs CLASSIFY_SYSTEM_PROMPT: added per-level guidelines, content-type distinction (editorial/opinion/tutorial → info, domestic crime → info, mass-casualty → critical), and concrete counterexamples. - server/worldmonitor/intelligence/v1/classify-event.ts: same guidelines added to align the second cache writer. Classify cache bump: - classify:sebuf:v1: → classify:sebuf:v2: in all three locations (ais-relay.cjs classifyCacheKey, list-feed-digest.ts enrichWithAiCache, _shared.ts CLASSIFY_CACHE_PREFIX). Old v1 entries expire naturally (24h TTL). All items reclassified within 15 min of Railway deploy. Keyword additions (_classifier.ts): - HIGH: 'sanctions imposed', 'sanctions package', 'new sanctions' (phrase patterns — no false positives on 'sanctioned 10 individuals') - MEDIUM: promoted 'ceasefire' from LOW to reduce prompt/keyword misalignment during cold-cache window Shadow-log v4: - Clean dataset for post-prompt-change recalibration. v3 rolls off via 7-day TTL. Deploy order: Railway first (seedClassify prewarms v2 cache immediately), then Vercel. First ~15 min of v4 may carry stale digest-cached scores. 🤖 Generated with Claude Opus 4.6 via Claude Code + Compound Engineering v2.49.0 Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix(scoring): align classify-event prompt + remove dead keywords + update v4 docs Review findings on PR #3218: P1: classify-event.ts prompt was missing 2 counterexamples and the "Focus" line present in the relay prompt. Both writers share classify:sebuf:v2 cache, so differing prompts mean nondeterministic classification depending on which path writes first. Now both prompts have identical level guidelines and counterexamples (format differs: array vs single object, but classification logic is aligned). P2: Removed 3 dead phrase-pattern keywords (sanctions imposed/package/ new) — the existing 'sanctions' entry already substring-matches all of them and maps to the same (high, economic). Just dead code that misled readers into thinking they added coverage. P2: Updated stale v3 references in cache-keys.ts (doc block + exported constant) and shadow-score-report.mjs header to v4. --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-20 08:05:13 +04:00
Elie Habib	4853645d53	fix(brief): switch carousel to @vercel/og on edge runtime (#3210 ) * fix(brief): switch carousel to @vercel/og on edge runtime Every attempt to ship the Phase 8 Telegram carousel on Vercel's Node serverless runtime has failed at cold start: - PR #3174 direct satori + @resvg/resvg-wasm: Vercel edge bundler refused the `?url` asset import required by resvg-wasm. - PR #3174 (fix) direct satori + @resvg/resvg-js native binding: Node runtime accepted it, but Vercel's nft tracer does not follow @resvg/resvg-js/js-binding.js's conditional `require('@resvg/resvg-js-<platform>-<arch>-<libc>')` pattern, so the linux-x64-gnu peer package was never bundled. Cold start threw MODULE_NOT_FOUND, isolate crashed, FUNCTION_INVOCATION_FAILED on every request including OPTIONS, and Telegram reported WEBPAGE_CURL_FAILED with no other signal. - PR #3204 added `vercel.json` `functions.includeFiles` to force the binding in, but (a) the initial key was a literal path that Vercel micromatch read as a character class (PR #3206 fixed), (b) even with the corrected `api/brief/carousel/*` wildcard, the function still 500'd across the board. The `functions.includeFiles` path appears honored in the deployment manifest but not at runtime for this particular native-binding pattern. Fix: swap the renderer to @vercel/og's ImageResponse, which is Vercel's first-party wrapper around satori + resvg-wasm with Vercel-native bundling. Runs on Edge runtime — matches every other API route in the project. No native binding, no includeFiles, no nft tracing surprises. Cold start ~300ms, warm ~30ms. Changes: - server/_shared/brief-carousel-render.ts: replace renderCarouselPng (Uint8Array) with renderCarouselImageResponse (ImageResponse). Drop ensureLibs + satori + @resvg/resvg-js dynamic-import dance. Keep layout builders (buildCover/buildThreads/buildStory) and font loading unchanged — the Satori object trees are wire-compatible with ImageResponse. - api/brief/carousel/[userId]/[issueDate]/[page].ts: flip `runtime: 'nodejs'` -> `runtime: 'edge'`. Delegate rendering to the renderer's ImageResponse and return it directly; error path still 503 no-store so CDN + Telegram don't pin a bad render. - vercel.json: drop the now-useless `functions.includeFiles` block. - package.json: drop direct `@resvg/resvg-js` and `satori` deps (both now bundled inside @vercel/og). - tests/deploy-config.test.mjs: replace the native-binding regression guards with an assertion that no `functions` block exists (with a comment pointing at the skill documenting the micromatch gotcha for future routes). - tests/brief-carousel.test.mjs: updated comment references. Verified: - typecheck + typecheck:api clean - test:data 5814/5814 pass - node -e test: @vercel/og imports cleanly in Node (tests that reach through the renderer file no longer depend on native bindings) Post-deploy validation: curl -I -H "User-Agent: TelegramBot (like TwitterBot)" \ "https://www.worldmonitor.app/api/brief/carousel/<uid>/<slot>/0" # Expect: HTTP/2 403 (no token) or 200 (valid token) # NOT: HTTP/2 500 FUNCTION_INVOCATION_FAILED Then tail Railway digest logs on the next tick; the `[digest] Telegram carousel 400 ... WEBPAGE_CURL_FAILED` line should stop appearing, and the 3-image preview should actually land on Telegram. Add renderer smoke test + fix Cache-Control duplication Reviewer flagged residual risk: no dedicated carousel-route smoke test for the @vercel/og path. Adds one, and catches a real bug in the process. Findings during test-writing: 1. @vercel/og's ImageResponse runs CLEANLY in Node via tsx — the comment in brief-carousel.test.mjs saying "we can't test the render in Node" was true for direct satori + @resvg/resvg-wasm but no longer holds after PR #3210. Pure Node render works end-to-end: satori tree-parse, jsdelivr font fetch, resvg-wasm init, PNG output. ~850ms first call, ~20ms warm. 2. ImageResponse sets its own default `Cache-Control: public, immutable, no-transform, max-age=31536000`. Passing Cache-Control via the constructor's headers option APPENDS rather than overrides, producing a duplicated comma-joined value like `public, immutable, no-transform, max-age=31536000, public, max-age=60` on the Response. The route handler was doing exactly this via extraHeaders. Fix: drop our Cache-Control override and rely on @vercel/og's 1-year immutable default — envelope is only immutable for its 7d Redis TTL so the effective ceiling is 7d anyway (after that the route 404s before render). Changes: - tests/brief-carousel.test.mjs: 6 new assertions under `renderCarouselImageResponse`: * renders cover / threads / story pages, each returning a valid PNG (magic bytes + size range) * rejects a structurally empty envelope * threads non-cache extraHeaders onto the Response * pins @vercel/og's Cache-Control default so it survives caller-supplied Cache-Control overrides (regression guard for the bug fixed in this commit) - api/brief/carousel/[userId]/[issueDate]/[page].ts: remove the stacked Cache-Control; lean on @vercel/og default. Drop the now- unused `PAGE_CACHE_TTL` constant. Comment explains why. Verified: - test:data 5820/5820 pass (was 5814, +6 smoke) - typecheck + typecheck:api clean - Render smoke: cover 825ms / threads 23ms / story 16ms first run (wasm init dominates first render)	2026-04-19 15:18:12 +04:00
Elie Habib	38e6892995	fix(brief): per-run slot URL so same-day digests link to distinct briefs (#3205 ) * fix(brief): per-run slot URL so same-day digests link to distinct briefs Digest emails at 8am and 1pm on the same day pointed to byte-identical magazine URLs because the URL was keyed on YYYY-MM-DD in the user tz. Each compose run overwrote the single daily envelope in place, and the composer rolling 24h story window meant afternoon output often looked identical to morning. Readers clicking an older email got whatever the latest cron happened to write. Slot format is now YYYY-MM-DD-HHMM (local tz, per compose run). The magazine URL, carousel URLs, and Redis key all carry the slot, and each digest dispatch gets its own frozen envelope that lives out the 7d TTL. envelope.data.date stays YYYY-MM-DD for rendering "19 April 2026". The digest cron also writes a brief:latest:{userId} pointer (7d TTL, overwritten each compose) so the dashboard panel and share-url endpoint can locate the most recent brief without knowing the slot. The previous date-probing strategy does not work once keys carry HHMM. No back-compat for the old YYYY-MM-DD format: the verifier rejects it, the composer only ever writes the new shape, and any in-flight notifications signed under the old format will 403 on click. Acceptable at the rollout boundary per product decision. * fix(brief): carve middleware bot allowlist to accept slot-format carousel path BRIEF_CAROUSEL_PATH_RE in middleware.ts was still matching only the pre-slot YYYY-MM-DD segment, so every slot-based carousel URL emitted by the digest cron (YYYY-MM-DD-HHMM) would miss the social allowlist and fall into the generic bot gate. Telegram/Slack/Discord/LinkedIn image fetchers would 403 on sendMediaGroup, breaking previews for the new digest links. CI missed this because tests/middleware-bot-gate.test.mts still exercised the old /YYYY-MM-DD/ path shape. Swap the fixture to the slot format and add a regression asserting the pre-slot shape is now rejected, so legacy links cannot silently leak the allowlist after the rollout. * fix(brief): preserve caller-requested slot + correct no-brief share-url error Two contract bugs in the slot rollout that silently misled callers: 1. GET /api/latest-brief?slot=X where X has no envelope was returning { status: 'composing', issueDate: <today UTC> } — which reads as "today's brief is composing" instead of "the specific slot you asked about doesn't exist". A caller probing a known historical slot would get a completely unrelated "today" signal. Now we echo the requested slot back (issueSlot + issueDate derived from its date portion) when the caller supplied ?slot=, and keep the UTC-today placeholder only for the no-param path. 2. POST /api/brief/share-url with no slot and no latest-pointer was falling into the generic invalid_slot_shape 400 branch. That is not an input-shape problem; it is "no brief exists yet for this user". Return 404 brief_not_found — the same code the existing-envelope check returns — so callers get one coherent contract: either the brief exists and is shareable, or it doesn't and you get 404.	2026-04-19 14:15:59 +04:00
Elie Habib	3c47c1b222	fix(supply-chain): split chokepoint transit data + close silent zero-state cache (#3185 ) * fix(supply-chain): split chokepoint transit data + close silent zero-state cache Production supply-chain panel was rendering 13 empty chokepoints because the getChokepointStatus RPC silently cached zero-state for 5 minutes: 1. supply_chain:transit-summaries:v1 grew to ~500 KB (180d × 13 × 14 fields of history per chokepoint). 2. REDIS_OP_TIMEOUT_MS is 1.5 s. Vercel Sydney edge → Upstash for a 500 KB GET consistently exceeded the budget; getCachedJson caught the AbortError and returned null. 3. The 500 KB portwatch fallback read hit the same timeout. 4. summaries = {} → every summaries[cp.id] was undefined → 13 chokepoints got the zero-state default → cached as a non-null success response for REDIS_CACHE_TTL (5 min) instead of NEG_SENTINEL (120 s). Fix (one PR, per docs/plans/chokepoint-rpc-payload-split.md): - ais-relay.cjs: split seedTransitSummaries output. - supply_chain:transit-summaries:v1 — compact (~30 KB, no history). - supply_chain:transit-summaries:history:v1:{id} — per chokepoint (~35 KB each, 13 keys). Both under the 1.5 s Redis read budget. - New RPC GetChokepointHistory: lazy-loaded on card expand. - get-chokepoint-status.ts: drop the 500 KB portwatch/corridorrisk/ chokepoint_transits fallback reads. Treat a null transit-summaries read as upstreamUnavailable=true so cachedFetchJson writes NEG_SENTINEL (2 min) instead of a 5-min zero-state pin. Omit history from the response (proto field stays declared; empty array). - server/_shared/redis.ts: tag AbortError timeouts with [REDIS-TIMEOUT] key=… timeoutMs=… so log drains / Sentry-Vercel integration pick up large-payload timeouts instead of them being silently swallowed. - SupplyChainPanel.ts + MapPopup.ts: lazy-fetch history on card expand via fetchChokepointHistory; session-scoped cache; graceful "History unavailable" on empty/error. PRO gating on the map popup unchanged. - Gateway: cache-tier entry for /get-chokepoint-history (slow). - Tests: regression guards for upstreamUnavailable gate + per-id key shape + handler wiring + proto query annotations. Audit included in plan: no other RPC consumer read stacks >200 KB besides displacement:summary:v1:2026 (724 KB, same risk, flagged for follow-up PR). wildfire:fires:v1 at 1.7 MB loads via bootstrap (3 s timeout, different path) — monitor but out of scope. Expected impact: - supply_chain:chokepoints:v4 payload drops from ~508 KB to <100 KB. - supply_chain:transit-summaries:v1 drops from ~502 KB to <50 KB. - RPC Redis reads stay well under 1.5 s in the hot path. - Silent zero-state pinning is now impossible: null reads → 2-min neg cache → self-heal on next relay tick. * fix(supply-chain): address PR #3185 review — stop caching empty/error + fix partial coverage Two P1 regressions caught in review: 1. Client cache poisoning on empty/error (MapPopup.ts, SupplyChainPanel.ts) Empty-array is truthy in JS, so MapPopup's `!cached && !inflight` branch never fired once we cached []. Neither `cached && cached.length` fired either — popup stuck on "Loading transit history..." for the session. SupplyChainPanel had the explicit `cached && !cached.length` branch but still never retried, so the same transient became session-sticky there too. Fix: cache ONLY non-empty successful responses. Empty/error show the "History unavailable" placeholder but leave the cache untouched, so the next re-expand retries. The /get-chokepoint-history gateway tier is "slow" (5-min CF edge cache) → retries stay cheap. 2. Partial portwatch coverage treated as healthy (ais-relay.cjs) seedTransitSummaries iterated Object.entries(pw), so if seed-portwatch dropped N of 13 chokepoints (ArcGIS reject/empty), summaries had <13 keys. get-chokepoint-status upstreamUnavailable fires only on fully-empty summaries, so the N missing chokepoints fell through to zero-state rows that got pinned in cache for 5 minutes. Fix: iterate CANONICAL_IDS (Object.keys(CHOKEPOINT_THREAT_LEVELS)) and fill zero-state for any ID missing from pw. Shape is consistently 13 keys. Track pwCovered → envelope + seed-meta recordCount reflect real upstream coverage (not shape size), so health.js can distinguish 13/13 healthy from 10/13 partial. Warn-log on shortfall. Tests: new regression guards - panel must NOT cache empty arrays (historyCache.set with []). - writer must iterate CANONICAL_IDS, not Object.entries(pw). - seed-meta recordCount binds to pwCovered. 5718/5718 data tests pass. typecheck + typecheck:api clean.	2026-04-18 23:14:00 +04:00
Elie Habib	55ac431c3f	feat(brief): public share mirror + in-magazine Share button (#3183 ) * feat(brief): public share mirror + in-magazine Share button Adds the growth-vector piece listed under Future Considerations in the original brief plan (line 399): a shareable public URL and a one-click Share button on the reader's magazine. Problem: the per-user magazine at /api/brief/{userId}/{issueDate} is HMAC-signed to a specific reader. You cannot share the URL you are looking at, because the recipient either 403s (bad token) or reads your personalised issue against your userId. Result: no way to share the daily brief, no way for readers to drive discovery. Opening a growth loop requires a separate public surface. Approach: deterministic HMAC-derived short hash per {userId, issueDate} backed by a pointer key in Redis. New files - server/_shared/brief-share-url.ts Web Crypto HMAC helper. deriveShareHash returns 12 base64url chars (72 bits) from (userId, issueDate) using BRIEF_SHARE_SECRET. Pointer encode/decode helpers and a shape check. Distinct from the per-user BRIEF_URL_SIGNING_SECRET so a leak of one does not automatically unmask the other. - api/brief/share-url.ts (edge, Clerk auth, Pro gated) POST /api/brief/share-url?date=YYYY-MM-DD Idempotently writes brief:public:{hash} pointer with the same 7 day TTL as the underlying brief, then returns {shareUrl, hash, issueDate}. 404 if the per-user brief is missing. 503 on Upstash failure. Accepts an optional refCode in the JSON body for referral attribution. - api/brief/public/[hash].ts (edge, unauth) GET /api/brief/public/{hash}?ref={code} Reads pointer, reads the real brief envelope, renders with publicMode=true. Emits X-Robots-Tag: noindex,nofollow so shared briefs never get enumerated by search engines. 404 on any missing part (bad hash shape, missing pointer, missing envelope) with a neutral error page. 503 on Upstash failure. Renderer changes (server/_shared/brief-render.js) - Signature extended: renderBriefMagazine(envelope, options?) - options.publicMode: redacts user.name and whyMatters before any HTML emission; swaps the back cover to a Subscribe CTA; prepends a Subscribe strip across the top of the deck; omits the Share button + share script; adds a noindex meta tag. - options.refCode: appended as ?ref= to /pro links on public views. - Non-public views gain a sticky .wm-share pill in the top-right chrome. Inline SHARE_SCRIPT handles the click flow: POST /api/ brief/share-url then navigator.share with clipboard fallback and a prompt() ancient-browser fallback. User-visible feedback via data-state on the button (sharing / copied / error). No change to the envelope contract, no LLM calls, no composer-side work required. - Validation runs on the full unredacted envelope first, so the public path can never accept a shape the private path would reject. Tests - tests/brief-share-url.test.mts (18 assertions): determinism, secret sensitivity, userId/date sensitivity, shape validation, URL composition with/without refCode, trailing-slash handling on baseUrl, pointer encode/decode round-trip. - tests/brief-magazine-render.test.mjs (+13 assertions): Share button carries the issue date; share script emitted once; share-url endpoint wired; publicMode strips the button+script, replaces whyMatters, emits noindex meta, prepends Subscribe strip, passes refCode through with escaping, swaps the back cover, does not leak the user name, preserves story headlines, options-less call matches the empty-options call byte for byte. - Full typecheck/lint/edge-bundle/test:data/edge-functions suite all green: 5704/5704 data tests, 171/171 edge-function tests, 0 lint errors. Env vars (new) - BRIEF_SHARE_SECRET: 64+ random hex chars, Vercel (edge) only. NOT needed by the Railway composer because pointer writes are lazy (on share, not on compose). * fix(brief): public share round-trip + magazine Share button without auth Two P1 findings on #3183 review. 1) Pointer wire format: share-url.ts wrote the pointer as a raw colon-delimited string via SET. The public route reads via readRawJsonFromUpstash which ALWAYS JSON.parses. A bare non-JSON string throws at parse, the route returned 503 instead of resolving. Fix: JSON.stringify on both write sites. Regression test locks the wire format. 2) Share button auth unreachable from a standalone magazine tab: inline script needed window.WM_CLERK_JWT which is never set, endpoint hard-requires Bearer, fallback to credentials:include fails. Fix: derive share URL server-side in the per-user route (same inputs share-url uses), embed as data-share-url, click handler now reads dataset and invokes navigator.share directly. No network, no auth, works in any tab. The /api/brief/share-url endpoint stays in place for other callers (dashboard panel) with its Clerk auth intact and its pointer write now in the correct format. QA: typecheck clean, 5708/5708 data tests, 45/45 magazine, 20/20 share-url, edge bundle OK, lint exit 0. * fix(brief): address remaining review findings on #3183 P0-2 (comment-only): public/[hash].ts inline comment incorrectly described readRawJsonFromUpstash parse-failure behaviour. The helper rethrows on JSON.parse failure, it does not return null. Rewrote the comment to match reality (JSON-encoded wire format, parse-to-string round-trip, intentional 503-on-bug-value as the loud failure mode). The actual wire-format fix was in prior commit `045771d55`. P2 (consistency): publicStripHtml href was built via template literal + encodeURIComponent without the final escapeHtml wrap that renderBackCover uses. Safe in practice (encodeURIComponent handles all HTML-special chars + route boundary restricts refCode to [A-Za-z0-9_-]) but inconsistent. Unified by extracting publicStripHref and escaping on interpolation, matching the sibling function. QA: typecheck clean, 45/45 magazine tests pass, lint exit 0.	2026-04-18 22:46:22 +04:00
Elie Habib	81536cb395	feat(brief): source links, LLM descriptions, strip suffix (envelope v2) (#3181 ) * feat(brief): source links, LLM descriptions, strip publisher suffix (envelope v2) Three coordinated fixes to the magazine content pipeline. 1. Headlines were ending with " - AP News" / " \| Reuters" etc. because the composer passed RSS titles through verbatim. Added stripHeadlineSuffix() in brief-compose.mjs, conservative case- insensitive match only when the trailing token equals primarySource, so a real subtitle that happens to contain a dash still survives. 2. Story descriptions were the headline verbatim. Added generateStoryDescription to brief-llm.mjs, plumbed into enrichBriefEnvelopeWithLLM: one additional LLM call per story, cached 24h on a v1 key covering headline, source, severity, category, country. Cache hits are revalidated via parseStoryDescription so a bad row cannot flow to the envelope. Falls through to the cleaned headline on any failure. 3. Source attribution was plain text, no outgoing link. Bumped BRIEF_ENVELOPE_VERSION to 2, added BriefStory.sourceUrl. The composer now plumbs story:track:v1.link through digestStoryToUpstreamTopStory, UpstreamTopStory.primaryLink, filterTopStories, BriefStory.sourceUrl. The renderer wraps the Source line in an anchor with target=_blank, rel=noopener noreferrer, and UTM params (utm_source=worldmonitor, utm_medium=brief, utm_campaign=<issueDate>, utm_content=story- <rank>). UTM appending is idempotent, publisher-attributed URLs keep their own utm_source. Envelope validation gains a validateSourceUrl step (https/http only, no userinfo credentials, parseable absolute URL). Stories without a valid upstream link are dropped by filterTopStories rather than shipping with an unlinked source. Tests: 30 renderer tests to 38; new assertions cover UTM presence on every anchor, HTML-escaping of ampersands in hrefs, pre-existing UTM preservation, and all four validator rejection modes. New composer tests cover suffix stripping, link plumb-through, and v2 drop-on-no- link behaviour. New LLM tests for generateStoryDescription cover cache hit/miss, revalidation of bad rows, 24h TTL, and null-on- failure. * fix(brief): v1 back-compat window on renderer + consolidate story hash helper Two P1/P2 review findings on #3181. P1 (v1 back-compat). Bumping BRIEF_ENVELOPE_VERSION 1 to 2 made every v1 envelope still resident in Redis under the 7-day TTL fail assertBriefEnvelope. The hosted /api/brief route would 404 "expired" and the /api/latest-brief preview would downgrade to "composing", breaking already-issued links from the preceding week. Fix: renderer now accepts SUPPORTED_ENVELOPE_VERSIONS = Set([1, 2]) on READ. BRIEF_ENVELOPE_VERSION stays at 2 and is the only version the composer ever writes. BriefStory.sourceUrl is required when version === 2 and absent on v1; when rendering a v1 story the source line degrades to plain text (no anchor), matching pre-v2 appearance. When the TTL window passes the set can shrink to [2] in a follow-up. P2 (hash dedup). hashStoryDescription was byte-identical to hashStory, inviting silent drift if one prompt gains a field the other forgets. Consolidated into hashBriefStory. Cache key separation remains via the distinct prefixes (brief:llm:whymatters:v2:/brief:llm:description:v1:). Tests: adds 3 v1 back-compat assertions (plain source line, field validation still runs, defensive sourceUrl check), updates the version-mismatch assertion to match the new supported-set message. 161/161 pass (was 158). Full test:data 5706/5706.	2026-04-18 21:49:17 +04:00
Elie Habib	8fc302abd9	fix(brief): mobile layout — stack story callout, floor digest typography (#3180 ) * fix(brief): mobile layout — stack story callout, floor digest typography On viewports <=640px the 55/45 story grid cramped both the headline and the "Why this is important" callout to ~45% width each, and several digest rules used raw vw units (blockquote 2vw, threads 1.55vw) that collapsed to ~7-8px on a 393px iPhone frame before the browser min clamped them to barely-readable. Appends a single @media (max-width: 640px) block to the renderer's STYLE_BLOCK: - .story becomes a flex column — callout stacks under the headline, no column squeeze. Headline goes full-width at 9.5vw. - Digest blockquote, threads, signals, and stat rows get max(Npx, Nvw) floors so they never render below ~15-17px regardless of viewport. - Running-head stacks on digest and the absolute page-number gets right-hand clearance so they stop overlapping. - Tags and source labels pinned to 11px (were scaling down with vw). CSS-only; no envelope, no HTML structure, no new classes. All 30 renderBriefMagazine tests still pass. * fix(brief): raise mobile digest px floors and running-head clearance Two P2 findings from PR review on #3180: 1. .digest .running-head padding-right: 18vw left essentially zero clearance from the absolute .page-number block on iPhone SE (375px) and common Android (360px). Bumped to 22vw (~79px at 360px) which accommodates "09 / 12" in IBM Plex Mono at the right:5vw offset with a one-vw safety margin. 2. Mobile overrides were lowering base-rule px floors (thread 17px to 15px, signal 18px to 15px). On viewports <375px this rendered digest body text smaller than desktop. Kept the px floors at or above the base rules so effective size only ever goes up on mobile.	2026-04-18 21:37:40 +04:00
Elie Habib	6f6102e5a7	feat(brief): swap sienna rust for two-strength WM mint (Option B palette) (#3178 ) * feat(brief): swap sienna rust for two-strength WM mint (Option B palette) The only off-brand color in the product was the brief's sienna rust (#8b3a1f) accent. Every other surface — /pro landing, dashboard, dashboard panels — uses the WM mint green (#4ade80). Swapping the brief's accent to the brand mint makes the magazine read as a sibling of /pro rather than a separate editorial product, while keeping the magazine-grade serif typography and even/odd page inversion intact. Implementation (user picked Option B from brief-palette-playground.html): --sienna : #8b3a1f -> #3ab567 muted mint for LIGHT pages (readable on #fafafa without the bright-mint glare of a pure-brand swap) --mint : + #4ade80 bright WM mint for DARK pages (matches /pro exactly) --cream : #f1e9d8 -> #fafafa unified with --paper; one crisp white --cream-ink: #1a1612 -> #0a0a0a crisper contrast on the new paper Accent placement (unchanged structurally — only colors swapped): - Digest running heads, labels, blockquote rule, stats dividers, end-marker rule, signal/thread tags: all muted mint on light - Story source line: newly mint (was unstyled bone/ink at 0.6 opacity); two-strength — muted on light stories, bright on dark - Logo ekg dot: mint on every page so the brand 'signal' pulse threads through the whole magazine No layout changes. No HTML structure changes. Only color constants + a ~20-line CSS addition for story-source + ekg-dot accents. 165/165 brief tests pass (renderer contract unchanged — envelope shape identical, only computed styles differ). Both tsconfigs typecheck clean. * fix(brief): darken light-page mint to pass WCAG AA + fix digest ekg-dot Two P2 findings on PR #3178 review. 1. Digest ekg-dot used bright #4ade80 on a #fafafa background, contradicting the code comment that said 'light pages use the muted mint'. The rule was grouped with .cover and .story.dark (both ink backgrounds) when it should have been grouped with .story.light (paper background). Regrouped. 2. #3ab567 on #fafafa tests at ~2.31:1 — fails WCAG AA 4.5:1 for every text size and fails the 3:1 large-text floor. The PR called this a rollback trigger; contrast math says it would fail every meaningful text usage (mono running heads, source lines, labels, footer captions). Swapped --sienna from #3ab567 to #1f7a3f — tested at ~4.90:1 on #fafafa, passes AA for normal text. Kept the variable name '--sienna' for backwards compat (every .digest rule references it). The hue stays recognisably mint- family (green dominant) so the brand relationship with #4ade80 on dark pages is still clear to a reader. Dark-page mint is unchanged — #4ade80 on #0a0a0a is ~11.4:1, passes AAA. Playground (brief-palette-playground.html) updated to match so future iterations work against the accessible value. 165/165 brief tests pass. Both tsconfigs typecheck clean.	2026-04-18 20:50:16 +04:00
Elie Habib	b5824d0512	feat(brief): Phase 9 / Todo #223 — share button + referral attribution (#3175 ) * feat(brief): Phase 9 / Todo #223 — share button + referral attribution Adds a Share button to the dashboard Brief panel so PRO users can spread WorldMonitor virally. Built on the existing referral plumbing (registrations.referralCode + referredBy fields; api/register-interest already passes referredBy through) — this PR fills in the last mile: a stable referral code for signed-in Clerk users, a share URL, and a client-side share sheet. Files: server/_shared/referral-code.ts (new) Deterministic 8-char hex code: HMAC(BRIEF_URL_SIGNING_SECRET, 'referral:v1:' + userId). Same Clerk userId always produces the same code. No DB write on login, no schema migration, stable for the life of the account. api/referral/me.ts (new) GET -> { code, shareUrl, invitedCount }. Bearer-auth via Clerk. Reuses BRIEF_URL_SIGNING_SECRET to avoid another Railway env var. Stats fail gracefully to 0 on Convex outage. convex/registerInterest.ts + convex/http.ts New internal query getReferralStatsByCode({referralCode}) counts registrations rows that named this code as their referredBy. Exposed via POST /relay/referral-stats (RELAY_SHARED_SECRET auth). src/services/referral.ts (new) getReferralProfile: 5-min cache, profile is effectively immutable shareReferral: Web Share API primary (mobile native sheet), clipboard fallback on desktop. Returns 'shared'/'copied'/'blocked' /'error'. AbortError is treated as 'blocked', not failure. clearReferralCache for account-switch hygiene. src/components/LatestBriefPanel.ts + src/styles/panels.css New share row below the brief cover card. Button disabled until /api/referral/me resolves; if fetch fails the row removes itself. invitedCount > 0 renders as 'N invited' next to the button. Referral cache invalidated alongside Clerk token cache on account switch (otherwise user B would see user A's share link for 5 min). Tests: 10 new cases in tests/brief-referral-code.test.mjs - getReferralCodeForUser: hex shape, determinism, uniqueness, secret-rotation invalidates, input guards - buildShareUrl: path shape, trailing-slash trim, URL-encoding 153/153 brief + deploy tests pass. Both tsconfigs typecheck clean. Attribution flow (already working end-to-end): 1. Share button -> worldmonitor.app/pro?ref={code} 2. /pro landing page already reads ?ref= and passes to /api/register-interest as referredBy 3. convex registerInterest:register increments the referrer's referralCount and stores referredBy on the new row 4. /api/referral/me reads the count back via the relay query 5. 'N invited' updates on next 5-min cache refresh Scope boundaries (deferred): - Convex conversion tracking (invited -> PRO subscribed). Needs a join from registrations.referredBy to subscriptions.userId via email. Surface 'N converted' in a follow-up. - Referral-credit / reward system: viral loop works today, reward logic is a separate product decision. * fix(brief): address three P2 review findings on #3175 - api/referral/me.ts JSDoc said '503 if REFERRAL_SIGNING_SECRET is not configured' but the handler actually reads BRIEF_URL_SIGNING_SECRET. Updated the docstring so an operator chasing a 503 doesn't look for an env var that doesn't exist. - server/_shared/referral-code.ts carried a RESERVED_CODES Set to avoid collisions with URL-path keywords ('index', 'robots', 'admin'). The guard is dead code: the code alphabet is [0-9a-f] (hex output of the HMAC) so none of those non-hex keywords can ever appear. Removed the Set + the while loop; left a comment explaining why it was unnecessary so nobody re-adds it. - src/components/LatestBriefPanel.ts passed disabled: 'true' (string) to the h() helper. DOM-utils' h() calls setAttribute for unknown props, which does disable the button — but it's inconsistent with the later .disabled = false property write. Fixed to the boolean disabled: true so the attribute and the IDL property agree. 10/10 referral-code tests pass. Both tsconfigs typecheck clean. * fix(brief): address two review findings on #3175 — drop misleading count + fix user-agnostic cache P1: invitedCount wired to the wrong attribution store. The share URL is /pro?ref=<code>. On /pro the 'ref' feeds Dodopayments checkout metadata (affonso_referral), NOT registrations.referredBy. /api/referral/me counted only the waitlist path, so the panel would show 0 invited for anyone who converted direct-to-checkout — misleading. Rather than ship a count that measures only one of two attribution paths (and the less-common one at that), the count is removed entirely. The share button itself still works. A proper metric requires unifying the waitlist + Dodo-metadata paths into a single attribution store, which is a follow-up. Changes: - api/referral/me.ts: response shape is { code, shareUrl } — no invitedCount / convertedCount - convex/registerInterest.ts: removed getReferralStatsByCode internal query - convex/http.ts: removed /relay/referral-stats route - src/services/referral.ts: ReferralProfile interface no longer has invitedCount; fetch call unchanged in behaviour - src/components/LatestBriefPanel.ts: dropped the 'N invited' render branch P2: referral cache was user-agnostic. Module-global _cached had no userId key, so a stale cache primed by user A would hand user B user A's share link for up to 5 min after an account switch — if no panel is mounted at the transition moment to call clearReferralCache(). Per the reviewer's point, this is a real race. Fix: two-part. (a) Cache entry carries the userId it was computed for; reads check the current Clerk userId and only accept hits when they match. Mismatch → drop + re-fetch. (b) src/services/referral.ts self-subscribes to auth-state at module load (ensureAuthSubscription). On any id transition _cached is dropped. Module-level subscription means the invalidation works even when no panel is currently mounted. (c) Belt-and-suspenders: post-fetch, re-check the current user before caching. Protects against account switches that happen mid-flight between 'read cache → ask network → write cache'. Panel's local clearReferralCache() call removed — module now self-invalidates. 10/10 referral-code tests pass. Both tsconfigs typecheck clean. * fix(referral): address P1 review finding — share codes now actually credit the sharer The earlier head generated 8-char Clerk-derived HMAC codes for the share button, but the waitlist register mutation only looked up registrations.by_referral_code (6-char email-generated codes). Codes issued by the share button NEVER resolved to a sharer — the 'referral attribution' half of the feature was non-functional. Fix (schema-level, honest attribution path): convex/schema.ts - userReferralCodes { userId, code, createdAt } + by_code, by_user - userReferralCredits { referrerUserId, refereeEmail, createdAt } + by_referrer, by_referrer_email convex/registerInterest.ts - register mutation: after the existing registrations.by_referral_code lookup, falls through to userReferralCodes.by_code. On match, inserts a userReferralCredits row (the Clerk user has no registrations row to increment, so credit needs its own table). Dedupes by (referrer, refereeEmail) so returning visitors can't double-credit. - new internalMutation registerUserReferralCode({userId, code}) idempotent binding of a code to a userId. Collisions logged and ignored (keeps first writer). convex/http.ts - new POST /relay/register-referral-code (RELAY_SHARED_SECRET auth) that runs the mutation above. api/referral/me.ts - signature gains a ctx.waitUntil handle - after generating the user's code, fire-and-forget POSTs to /relay/register-referral-code so the binding is live by the time anyone clicks a shared link. Idempotent — a failure just means the NEXT call re-registers. Still deferred: display of 'N credited' / 'N converted' in the LatestBriefPanel. The waitlist side now resolves correctly, but the Dodopayments checkout path (/pro?ref=<code> → affonso_referral) is tracked in Dodo, not Convex. Surfacing a unified count requires a separate follow-up to pull Dodo metadata into Convex. Regression tests (3 new cases in tests/brief-referral-code.test.mjs): - register mutation extends to userReferralCodes + inserts credits - schema declares both tables with the right indexes - /api/referral/me registers the binding via waitUntil 13/13 referral tests pass. Both tsconfigs typecheck clean. * fix(referral): address two P1 review findings — checkout attribution + dead-link prevention P1: share URL didn't credit on the /pro?ref= checkout path. The earlier PR wired Clerk codes into the waitlist path (/api/register-interest -> userReferralCodes -> userReferralCredits) but a visitor landing on /pro?ref=<code> and going straight to Dodo checkout forwarded the code only into Dodo metadata (affonso_referral). Nothing on our side credited the sharer. Fix: convex/payments/subscriptionHelpers.ts handleSubscriptionActive now reads data.metadata.affonso_referral when inserting a NEW subscription row. If the code resolves in userReferralCodes, a userReferralCredits row crediting the sharer is inserted (deduped by (referrer, refereeEmail) so webhook replays don't double-credit). The credit only lands on first-activation — the else-branch of the existing/new split guards against replays. P1: /api/referral/me returned 200 + share link even when the (code, userId) binding failed. ctx.waitUntil(registerReferralCodeInConvex(...)) ran the binding asynchronously, swallowing missing env + non-2xx + network errors. Users got a share URL that the waitlist lookup could never resolve — dead link. Fix: registerReferralCodeInConvex is now BLOCKING (throws on any failure) and the handler awaits it before returning. On failure the endpoint responds 503 service_unavailable rather than a 200 with a non-functional URL. Mutation is idempotent so client retries are safe. Regression tests (2 updated/new in tests/brief-referral-code.test.mjs): - asserts the binding is awaited, not ctx.waitUntil'd; asserts the failure path returns 503 - asserts subscriptionHelpers reads affonso_referral, resolves via userReferralCodes.by_code, inserts a userReferralCredits row, and dedupes by (referrer, refereeEmail) 14/14 referral tests pass. Both tsconfigs typecheck clean. Net effect: /pro?ref=<code> visitors who convert (direct checkout) now credit the sharer on webhook receipt, same as waitlist signups. The share button is no longer a dead-end UI.	2026-04-18 20:39:55 +04:00
Elie Habib	122204f691	feat(brief): Phase 8 — Telegram carousel images via Satori + resvg-wasm (#3174 ) * feat(brief): Phase 8 — Telegram carousel images via Satori + resvg-wasm Implements the Phase 8 carousel renderer (Option B): server-side PNG generation in a Vercel edge function using Satori (JSX to SVG) + @resvg/resvg-wasm (SVG to PNG). Zero new Railway infra, zero Chromium, same edge runtime that already serves the magazine HTML. Files: server/_shared/brief-carousel-render.ts (new) Pure renderer: (BriefEnvelope, CarouselPage) -> Uint8Array PNG. Three layouts (cover/threads/story), 1200x630 OG size. Satori + resvg + WASM are lazy-imported so Node tests don't trip over '?url' asset imports and the 800KB wasm doesn't ship in every bundle. Font: Noto Serif regular, fetched once from Google Fonts and memoised on the edge isolate. api/brief/carousel/[userId]/[issueDate]/[page].ts (new) Public edge function reusing the magazine route's HMAC token — same signer, same (userId, issueDate) binding, so one token unlocks magazine HTML AND all three carousel images. Returns image/png with 7d immutable cache headers. 404 on invalid page index, 403 on bad token, 404 on Redis miss, 503 on missing signing secret. Render failure falls back to a 1x1 transparent PNG so Telegram's sendMediaGroup doesn't 500 the brief. scripts/seed-digest-notifications.mjs carouselUrlsFrom(magazineUrl) derives the 3 signed carousel URLs from the already-signed magazine URL. sendTelegramBriefCarousel calls Telegram's sendMediaGroup with those URLs + short caption. Runs before the existing sendTelegram(text) so the carousel is the header and the text the body — long-form stories remain forwardable as text. Best-effort: carousel failure doesn't block text delivery. package.json + package-lock.json satori ^0.10.14 + @resvg/resvg-wasm ^2.6.2. Tests (tests/brief-carousel.test.mjs, 9 cases): - pageFromIndex mapping + out-of-range - carouselUrlsFrom: valid URL, localhost origin preserved, missing token, wrong path, invalid issueDate, garbage input - Drift guard: cron must still declare the same helper + template string. If it drifts, test fails with a pointer to move the impl into a shared module. PNG render itself isn't unit-tested — Satori + WASM need a browser/edge runtime. Covered by smoke validation step in the deploy monitoring plan. Both tsconfigs typecheck clean. 152/152 brief tests pass. Scope boundaries (deferred): - Slack + Discord image attachments (different payload shapes) - notification-relay.cjs brief_ready dispatch (real-time route) - Redis caching of rendered PNG (edge Cache-Control is enough for MVP) * fix(brief): address two P1 review findings on Phase 8 carousel P1-A: 200 placeholder PNG cached 7d on render failure. Route config said runtime: 'edge' but a comment contradicted it claiming Node semantics. More importantly, any render/init failure (WASM load, Satori, Google Fonts) was converted to a 1x1 transparent PNG returned with Cache-Control: public, max-age=7d, immutable. Telegram's media fetcher and Vercel's CDN would cache that blank for the full brief TTL per chat message — one cold-start mismatch = every reader of that brief sees blank carousel previews for a week. Fix: deleted errorPng(). Render failure now returns 503 with Cache-Control: no-store. sendMediaGroup fails cleanly for that carousel (the digest cron already treats it as best-effort and still sends the long-form text message), next cron tick re-renders from a fresh isolate. Self-healing across ticks. Contradictory comment about Node runtime removed. P1-B: Google Fonts as silent hard dependency. The renderer claimed 'safe embedded/fallback path' in comments but no fallback existed. loadFont() fetches Noto Serif from gstatic.com and rethrows on any failure. Combined with P1-A's old 200-cache-7d path, a transient CDN blip would lock in a blank carousel for a week. Fix: updated comments to honestly declare the CDN dependency plus document the self-healing semantics now that P1-A's fix no longer caches the failure. If Google Fonts reliability becomes a problem, swap the fetch for a bundled base64 TTF — noted as the escape hatch. Tests (tests/brief-carousel.test.mjs): 2 new regression cases. 11/11 carousel tests pass. Both tsconfigs typecheck clean locally. Note on currently-red CI: failures are NOT typecheck errors — npm ci dies fetching libvips for sharp (504 Gateway Time-out from GitHub releases). sharp is a transitive dep via @xenova/transformers, pre-existing, not touched by this PR. Transient infra flake. * fix(brief): switch carousel to Node + @resvg/resvg-js (fixes deploy block) Vercel edge bundler fails the carousel deploy with: 'Edge Function is referencing unsupported modules: @resvg/resvg-wasm/index_bg.wasm?url' The ?url asset-import syntax is a Vite-ism that Vercel's edge bundler doesn't resolve. Two ways out: find a Vercel-blessed edge WASM import incantation, or switch to Node runtime with the native @resvg/resvg-js binding. The second is simpler, faster per request, and avoids the whole WASM-in-edge-bundler rabbit hole. Changes: - package.json: @resvg/resvg-wasm -> @resvg/resvg-js ^2.6.2 - api/brief/carousel/.../[page].ts: runtime 'edge' -> 'nodejs20.x' - server/_shared/brief-carousel-render.ts: drop initWasm path, dynamic-import resvg-js in ensureLibs(). Satori and resvg load in parallel via Promise.all, shaving ~30ms off cold start. Also addresses the P2 finding from review: the old ensureLibsAndWasm had a concurrent-cold-start race where two callers could reach 'await initWasm()' simultaneously. Replaced the boolean flag with a shared _libsLoadPromise so concurrent callers await the same load. On failure the promise resets so the NEXT request retries rather than poisoning the isolate for its lifetime. Cold start ~700ms (Satori + resvg-js native init), warm ~40ms. Carousel images are not latency-critical — fetched by Telegram's media service, CDN-cached 7d. Both tsconfigs typecheck clean. 11/11 carousel tests pass. * fix(brief): carousel runtime = 'nodejs' (was 'nodejs20.x', rejected by Vercel) Vercel's functions config validator rejects 'nodejs20.x' at deploy time: unsupported "runtime" value in config: "nodejs20.x" (must be one of: ["edge","experimental-edge","nodejs"]) The Node version comes from the project's default (currently Node 20 via package.json engines + Vercel project settings), not from the runtime string. Use 'nodejs' — unversioned — and let the platform resolve it. 11/11 carousel tests pass. * fix(brief): swap carousel font from woff2 to woff (Satori can't parse woff2) Review on PR #3174: the FONT_URL was pointing at a gstatic.com woff2 file. Satori parses ttf / otf / woff v1 — NOT woff2. Every render was about to throw on font decode, the route would return 503, and the carousel would never deliver a single image. Fix: point FONT_URL at @fontsource's Noto Serif Latin 400 WOFF v1 via jsdelivr. WOFF v1 is a TrueType wrapper that Satori parses natively (verified: file says 'Web Open Font Format, TrueType, version 1.1'). Same cold-start semantics as before — one fetch per warm isolate, memoised. Regression test: asserts FONT_URL ends in ttf/otf/woff and explicitly rejects any .woff2 suffix. A future swap that silently reintroduces woff2 now fails CI loudly instead of shipping a permanently-broken renderer. 12/12 carousel tests pass. Both tsconfigs typecheck clean.	2026-04-18 20:27:41 +04:00
Elie Habib	de769ce8e1	fix(api): unblock Pro API clients at edge + accept x-api-key alias (#3155 ) * fix(api): unblock Pro API clients at edge + accept x-api-key alias Fixes #3146: Pro API subscriber getting 403 when calling from Railway. Two independent layers were blocking server-side callers: 1. Vercel Edge Middleware (middleware.ts) blocks any UA matching /bot\|curl\/\|python-requests\|go-http\|java\//, which killed every legitimate server-to-server API client before the gateway even saw the request. Add bypass: requests carrying an `x-worldmonitor-key` or `x-api-key` header that starts with `wm_` skip the UA gate. The prefix is a cheap client-side signal, not auth — downstream server/gateway.ts still hashes the key and validates against the Convex `userApiKeys` table + entitlement check. 2. Header name mismatch. Docs/gateway only accepted `X-WorldMonitor-Key`, but most API clients default to `x-api-key`. Accept both header names in: - api/_api-key.js (legacy static-key allowlist) - server/gateway.ts (user-issued Convex-backed keys) - server/_shared/premium-check.ts (isCallerPremium) Add `X-Api-Key` to CORS Allow-Headers in server/cors.ts and api/_cors.js so browser preflights succeed. Follow-up outside this PR (Cloudflare dashboard, not in repo): - Extend the "Allow api access with WM" custom WAF rule to also match `starts_with(http.request.headers["x-api-key"][0], "wm_")`, so CF Managed Rules don't block requests using the x-api-key header name. - Update the api-cors-preflight CF Worker's corsHeaders to include `X-Api-Key` (memory: cors-cloudflare-worker.md — Worker overrides repo CORS on api.worldmonitor.app). * fix(api): tighten middleware bypass shape + finish x-api-key alias coverage Addresses review findings on #3155: 1. middleware.ts bypass was too loose. "Starts with wm_" let any caller send X-Api-Key: wm_fake and skip the UA gate, shifting unauthenticated scraper load onto the gateway's Convex lookup. Tighten to the exact key format emitted by src/services/api-keys.ts:generateKey — `^wm_[a-f0-9]{40}$` (wm_ + 20 random bytes as hex). Still a cheap edge heuristic (no hash lookup in middleware), but raises spoofing from trivial prefix match to a specific 43-char shape. 2. Alias was incomplete on bespoke endpoints outside the shared gateway: - api/v2/shipping/route-intelligence.ts: async wm_ user-key fallback now reads X-Api-Key as well - api/v2/shipping/webhooks.ts: webhook ownership fingerprint now reads X-Api-Key as well (same key value → same SHA-256 → same ownerTag, so a user registering with either header can manage their webhook from the other) - api/widget-agent.ts: accept X-Api-Key in the auth read AND in the OPTIONS Allow-Headers list - api/chat-analyst.ts: add X-Api-Key to the OPTIONS Allow-Headers list (auth path goes through shared helpers already aliased)	2026-04-18 08:18:49 +04:00
Elie Habib	d46c170012	feat(brief): hosted magazine edge route + latest-brief preview RPC (Phase 2) (#3153 ) * feat(brief): hosted magazine edge route + latest-brief preview RPC (Phase 2) Phase 2 of the WorldMonitor Brief plan (docs/plans/2026-04-17-003). Adds the read path that every downstream delivery channel binds to. Phase 1 shipped the renderer + envelope contract; Phase 3 (composer) will write brief:{userId}:{issueDate} to Redis. Until then the route returns a neutral "expired" page for every request — intentional, safe to deploy ahead of the composer. Files: - server/_shared/brief-url.ts HMAC-SHA256 sign + verify helpers using Web Crypto (edge-compatible, no node:crypto). Rotation supported via optional prevSecret in verifyBriefToken. Constant-time comparison. Strict userId and YYYY-MM-DD shape validation before any crypto operation. Typed BriefUrlError with code enum for caller branches. - api/brief/[userId]/[issueDate].ts (Vercel edge) Auth model: the HMAC-signed ?t= token IS the credential. No Clerk session required — URLs are delivered over already-authenticated channels (push, email, dashboard panel). Flow: verify token → Redis GET brief:{userId}:{issueDate} → renderBriefMagazine → HTML. 403 on bad token, 404 on Redis miss, 503 on missing secret. Never echoes the userId in error pages. - api/latest-brief.ts (Clerk JWT + PRO gated, Vercel edge) Returns { issueDate, dateLong, greeting, threadCount, magazineUrl } or { status: 'composing', issueDate } when Redis is cold. Mirrors the auth + entitlement pattern in api/notify.ts. magazineUrl is freshly signed per request against the current BRIEF_URL_SIGNING_ SECRET so rotation takes effect immediately. - tests/brief-url.test.mjs 20 tests: round-trip, tampering, wrong user/date/secret, malformed token/inputs, rotation (accept prev secret + reject after removal), URL composition (trailing-slash trim, path encoding). Quality gates: typecheck + typecheck:api clean, biome lint clean, render tests still 30/30, new HMAC tests 20/20. PRE-MERGE: BRIEF_URL_SIGNING_SECRET must be set in Vercel before this deploys. The route returns 503 (not 500) with a server-side log when the secret is missing, so a mis-configured deploy is safe to roll. * fix(brief): address Phase 2 review — broken import + HEAD body + host reflection Addresses todos 212–217 from the /ce-review on PR #3153. P0 / P1 blockers: - todo 212 (P0): renderer import path was broken — pointed at shared/render-brief-magazine.js which no longer exists (Phase 1 review moved it to server/_shared/brief-render.js). Route would have 500'd on every successful token verification; @ts-expect-error silenced the error at compile time. Fixed the path and removed the now-unnecessary @ts-expect-error since brief-render.d.ts exists. New tests/brief-edge-route-smoke.test.mjs imports the handler so a future broken path cannot pass CI. - todo 213 (P1): HEAD returned the full HTML body (RFC 7231 violation). htmlResponse() now takes the request and emits null body when method is HEAD. - todo 214 (P1): publicBaseUrl reflected x-forwarded-host, allowing a signed magazineUrl to point at a non-canonical origin (preview deploy, forwarded-host spoof). Now pinned to WORLDMONITOR_PUBLIC_BASE_URL env var, falling back to the request origin only in dev. This env var joins BRIEF_URL_SIGNING_SECRET on the pre-merge Vercel checklist. P2 cleanups: - todo 215: dropped dead 'invalid_token_shape' enum member from BriefUrlError — never thrown (verifyBriefToken returns false on shape failure by design). - todo 216: consolidated EXPIRED_PAGE + FORBIDDEN_PAGE (and added UNAVAILABLE_PAGE) behind a single renderErrorPage(heading, body) helper with shared styles. - todo 217: extracted readRawJsonFromUpstash() into api/_upstash-json.js so api/brief and api/latest-brief share one 3-second-timeout raw-GET implementation. Dodges the readJsonFromUpstash unwrap that strips seed envelopes (our brief envelope is flat, not seed-framed). Also: - brief-url.ts header now documents the rotation runbook + the emergency kill-switch path (rotate SECRET without setting PREV). - distinguishable log tag for malformed-envelope vs Redis-miss (composer-bug grep vs expired-key grep). Deferred (todo 218, P2): token-in-URL log leak is a design-level issue that wants a cookie-based first-visit exchange or iat/exp in the signed payload. Out of scope for this fix commit. Tests: 55/55 (20 HMAC + 30 render + 5 edge-route smoke). Typecheck clean, biome lint clean. * fix(brief): distinguish infra failure from miss + walk tz boundary Addresses two additional P1 review findings on PR #3153. 1. readRawJsonFromUpstash collapsed every failure mode (missing creds, HTTP non-2xx, timeout, JSON parse failure, genuine miss) into a single null return. Both edge routes then treated null as empty state: api/brief → 404 "expired", api/latest-brief → 200 "composing". During any Upstash outage a user with a valid brief would see misleading empty-state UX. Helper now throws on every infrastructure/parse failure and returns null ONLY on a genuine miss (Upstash replied 200 with result: null). Callers: - api/brief catches → 503 UNAVAILABLE_PAGE - api/latest-brief catches → 503 { error: service_unavailable } 2. api/latest-brief probed today UTC only. For users ahead of or behind UTC, a ready brief could be invisible for 12-24h around the date boundary. Route now accepts an optional ?date=YYYY-MM-DD query param so the dashboard panel (or any client) can send its local date directly. When ?date= is absent, we walk today UTC → yesterday UTC and return the most recent hit; composing is only reported when both slots miss. Malformed ?date= returns 400. Tests added: - helper-level: missing creds → throws, HTTP error → throws, genuine miss → null - route-level: api/brief returns 503 (not 404) on Upstash HTTP error Final suite: 60/60 (20 HMAC + 30 render + 10 smoke). Typecheck + lint clean. * fix(brief): tomorrow-UTC probe + unified envelope validation Addresses two third-round review findings on PR #3153. 1. Walk-back missed the tomorrow-UTC slot. Users at positive tz offsets (UTC+1..UTC+14) store today's brief under tomorrow UTC; the previous [today, yesterday] probe never saw them. Fixed to walk [tomorrow, today, yesterday] UTC — covers the full tz range and naturally prefers the most recently composed slot. Also correctly echoes the caller's ?date= in composing responses instead of always echoing today UTC. 2. readBriefPreview validated only dateLong + digest.greeting + stories (array). The renderer's assertBriefEnvelope is much stricter (closed-key contract, version guard, cross-field surfaced === stories.length, per-story 7 required fields, etc.). A partial envelope could be reported "ready" by the preview and 404-expired on click. Exported assertBriefEnvelope from the renderer module, called it in the preview reader. An envelope that would render successfully is exactly an envelope that the preview reports as ready; any divergence is logged as a composer bug and surfaced as "composing". Tests: 62/62 (20 HMAC + 30 render + 12 smoke). New cases cover the shared-validator export and catch the "partial envelope slips past weak preview" regression. Typecheck + lint clean.	2026-04-18 07:28:49 +04:00
Elie Habib	66ca645571	feat(brief): WorldMonitor Brief magazine renderer + envelope contract (Phase 1) (#3150 ) * feat(brief): add WorldMonitor Brief magazine renderer + envelope contract Phase 1 of the WorldMonitor Brief plan (docs/plans/2026-04-17-003-feat- worldmonitor-brief-magazine-plan.md). Establishes the integration boundary between the future per-user composer and every consumer surface (hosted edge route, dashboard panel, email teaser, carousel, Tauri reader). - shared/brief-envelope.{d.ts,js}: BriefEnvelope + BRIEF_ENVELOPE_VERSION - shared/render-brief-magazine.{d.ts,js}: pure (envelope) -> HTML - tests/brief-magazine-render.test.mjs: 28 shape + chrome + leak tests Page sequence is derived from data (no hardcoded counts). Threads split into 03a/03b when more than six; Signals page is omitted when signals is empty; story palette alternates light/dark by index parity. Forbidden-field guard asserts importanceScore / primaryLink / pubDate / model + provider names never appear in rendered HTML. No runtime impact: purely additive, no consumers yet. * fix(brief): address code review findings on PR #3150 Addresses all seven review items from todos/205..211. P1 (merge blockers): - todo 205: forbidden-field test no longer matches free-text content like 'openai' / 'claude' / 'gemini' (false-positives on legitimate stories). Narrowed to JSON-key structural tokens ('"importanceScore":', '"_seed":', etc.) and added a sentinel- poisoning test that injects values into non-data envelope fields and asserts they never appear in output. - todo 206: drop 'moderate' from BriefThreatLevel union (synonym of 'medium'). Four-value ladder: critical \| high \| medium \| low. Added THREAT_LABELS map + HIGHLIGHTED_LEVELS set so display label and highlight rule live together instead of inline char-case + hardcoded comparison. - todo 207: replace the minimal validator with assertBriefEnvelope() that walks every required field (user.name/tz, date YYYY-MM-DD shape, digest.greeting/lead/numbers.{clusters,multiSource,surfaced}, threads array + per-element shape, signals array of strings, per-story required fields + threatLevel enum). Throws with field-path message on first miss. Adds nine negative tests covering common omissions. - todo 208: envelope carries version guard. assertBriefEnvelope throws when envelope.version !== BRIEF_ENVELOPE_VERSION with a message naming both observed and expected versions. P2 (should-fix, now included): - todo 209: drop the _seed wrapper and make BriefEnvelope a flat { version, issuedAt, data } shape. A per-user brief is not a global seed; reusing _seed invited mis-application of seed invariants (SEED_META pairing, TTL rotation). Locked down before Phase 3 composer bakes the shape in. - todo 210: move renderer from shared/render-brief-magazine.{js,d.ts} to server/_shared/brief-render.{js,d.ts}. The 740-line template doesn't belong on the shared/ mirror hot path (Vercel+Railway). Keep only the envelope contract in shared/. Import path updated in tests. - todo 211: logo SVG now defined once per document as a <symbol> and referenced via <use> at each placement (~8 refs/doc). Drops ~7KB from rendered output (~26% total size reduction on small inputs). Tests pass (26/26), typecheck clean, lint clean, mirror check (169/169) unaffected. * fix(brief): enforce closed-key contract + surfaced-count invariant Addresses two additional P1 review findings on PR #3150. 1. Validator rejects extra keys at every level (envelope root, data, user, digest, numbers, each thread, each story). Previously the forbidden-field rule was documented in the .d.ts and proven only at render time via a sentinel test — a producer could still persist importanceScore, primaryLink, pubDate, briefModel, _seed, fetchedAt, etc. into the Redis-resident envelope and every downstream consumer (edge route, dashboard panel preview, email teaser, carousel) would accept them. The validator is the only place the invariant can live. 2. Validator now asserts digest.numbers.surfaced === stories.length. The renderer uses both values — surfaced on the at-a-glance page, stories.length for the cover blurb and page count — so allowing them to disagree produced a self-contradictory brief that the old validator silently passed. Tests: 30/30 (was 26). New negative tests cover the strict-keys rejection at root / data / numbers / story[i] levels and the surfaced mismatch. The earlier sentinel-poisoning test is superseded — the strict-keys check catches the same class of bug earlier and harder.	2026-04-18 00:01:57 +04:00
Elie Habib	dcf73385ca	fix(scoring): rebalance formula weights severity 55%, corroboration 15% (#3144 ) * fix(scoring): rebalance formula weights severity 55%, corroboration 15% PR A of the scoring recalibration plan (docs/plans/2026-04-17-002). The v2 shadow-log recalibration (690 items, Pearson 0.413) showed the formula compresses scores into a narrow 30-70 range, making the 85 critical gate unreachable and the 65 high gate marginal. Root cause: corroboration at 30% weight penalizes breaking single-source news (the most important alerts) while severity at 40% doesn't separate critical from high enough. Weight change: BEFORE: severity 0.40 + sourceTier 0.20 + corroboration 0.30 + recency 0.10 AFTER: severity 0.55 + sourceTier 0.20 + corroboration 0.15 + recency 0.10 Expected effect: critical/tier1/fresh rises from 76 to 88 (clears 85 gate). critical/tier2/fresh rises from 71 to 83 (recommend lowering critical gate to 80 at activation time). high/tier2/fresh rises from 61 to 69 (clears 65 gate). The HIGH-CRITICAL gap widens from 10 to 14 points for same-tier items. Also: - Bumps shadow-log key from v2 to v3 for a clean recalibration dataset (v2 had old-weight scores that would contaminate the 48h soak) - Updates proto/news_item.proto formula comment to reflect new weights - Updates cache-keys.ts documentation No cache migration needed: the classify cache stores {level, category}, not scores. Scores are computed at read time from the stored level + the formula, so new digest requests immediately produce new scores. Gates remain OFF. After 48h of v3 data, re-run: node scripts/shadow-score-report.mjs node scripts/shadow-score-rank.mjs sample 25 🤖 Generated with Claude Opus 4.6 via Claude Code + Compound Engineering v2.49.0 Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * chore: regenerate proto OpenAPI docs for weight rebalance * fix(scoring): bump SHADOW_SCORE_LOG_KEY export to v3 The exported constant in cache-keys.ts was left at v2 while the relay's local constant was bumped to v3. Anyone importing the export (or grep- discovering it) would get a stale key. Architecture review flagged this. * fix(scoring): update test + stale comments for shadow-log v3 Review found the regression test still asserted v2 key, causing CI failure. Also fixed stale v1/v2 references in report script header, default-key comment, report title render, and shouldNotify docstring. --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-17 17:43:39 +04:00
Sebastien Melki	a4d9b0a5fa	feat(auth): user-facing API key management (create / list / revoke) (#3125 ) * feat(auth): user-facing API key management (create / list / revoke) Adds full-stack API key management so authenticated users can create, list, and revoke their own API keys from the Settings UI. Backend: - Convex `userApiKeys` table with SHA-256 key hash storage - Mutations: createApiKey, listApiKeys, revokeApiKey - Internal query validateKeyByHash + touchKeyLastUsed for gateway - HTTP endpoints: /api/api-keys (CRUD) + /api/internal-validate-api-key - Gateway middleware validates user-owned keys via Convex + Redis cache Frontend: - New "API Keys" tab in UnifiedSettings (visible when signed in) - Create form with copy-on-creation banner (key shown once) - List with prefix display, timestamps, and revoke action - Client-side key generation + hashing (plaintext never sent to DB) Closes #3116 Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * fix(api-keys): address PR review — cache invalidation, prefix validation, revoked-key guard - Invalidate Redis cache on key revocation so gateway rejects revoked keys immediately instead of waiting for 5-min TTL expiry (P1) - Enforce `wm_` prefix format with regex instead of loose length check (P2) - Skip `touchKeyLastUsed` for revoked keys to preserve clean audit trail (P2) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * fix(api-keys): address consolidated PR review (P0–P3) P0: gate createApiKey on pro entitlement (tier >= 1); isCallerPremium now verifies key-owner tier instead of treating existence as premium. P1: wire wm_ user keys into the domain gateway auth path with async Convex-backed validation; user keys go through entitlement checks (only admin keys bypass). Lower cache TTL 300s → 60s and await revocation cache-bust instead of fire-and-forget. P2: remove dead HTTP create/list/revoke path from convex/http.ts; switch to cachedFetchJson (stampede protection, env-prefixed keys, standard NEG_SENTINEL); add tenancy check on cache-invalidation endpoint via new /api/internal-get-key-owner route; add 22 Convex tests covering tier gate, per-user limit, duplicate hash, ownership revoke guard, getKeyOwner, and touchKeyLastUsed debounce. P3: tighten keyPrefix regex to exactly 5 hex chars; debounce touchKeyLastUsed (5 min); surface PRO_REQUIRED in UI. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * fix(api-keys): gate on apiAccess (not tier), wire wm_ keys through edge routes, harden error paths - Gate API key creation/validation on features.apiAccess instead of tier >= 1. Pro (tier 1, apiAccess=false) can no longer mint keys — only API_STARTER+. - Wire wm_ user keys through standalone edge routes (shipping/route-intelligence, shipping/webhooks) that were short-circuiting on validateApiKey before async Convex validation could run. - Restore fail-soft behavior in validateUserApiKey: transient Convex/network errors degrade to unauthorized instead of bubbling a 500. - Fail-closed on cache invalidation endpoint: ownership check errors now return 503 instead of silently proceeding (surfaces Convex outages in logs). - Tests updated: positive paths use api_starter (apiAccess=true), new test locks Pro-without-API-access rejection. 23 tests pass. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * fix(webhooks): remove wm_ user key fallback from shipping webhooks Webhook ownership is keyed to SHA-256(apiKey) via callerFingerprint(), not to the user. With user-owned keys (up to 5 per user), this causes cross-key blindness (webhooks invisible when calling with a different key) and revoke-orphaning (revoking the creating key makes the webhook permanently unmanageable). User keys remain supported on the read-only route-intelligence endpoint. Webhook ownership migration to userId will follow in a separate PR. --------- Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com> Co-authored-by: Elie Habib <elie.habib@gmail.com>	2026-04-17 07:20:39 +04:00
Elie Habib	bdfb415f8f	fix(resilience-ranking): return warmed scores from memory, skip lossy re-read (#3121 ) * fix(resilience-ranking): return warmed scores from memory, skip lossy re-read Upstash REST writes via /set aren't always visible to an immediately-following /pipeline GET in the same Vercel invocation (documented in PR #3057 / feedback_upstash_write_reread_race_in_handler.md). The ranking handler was warming 222 countries then re-reading them from Redis to compute a coverage ratio; that re-read could return 0 despite every SET succeeding, collapsing coverage to 0% < 75% and silently dropping the ranking publish. Consequence: `resilience:ranking:v9` missing, per-country score keys absent, health reports EMPTY_ON_DEMAND even while the seeder keeps writing a "fresh" meta. Fix: warmMissingResilienceScores now returns Map<cc, GetResilienceScoreResponse> with every successfully computed score. The handler merges those into cachedScores directly and drops the post-warm re-read. Coverage now reflects what was actually computed in-memory this invocation, not what Redis happens to surface after write lag. Adds a regression test that simulates pipeline-GET returning null for freshly-SET score keys; it fails against the old code (coverage=0, no ranking written) and passes with the fix (coverage=100%, ranking written). Slice A of the resilience-ranking recovery plan; Slice B (seeder meta truthfulness) follows. * fix(resilience-ranking): verify score-key persistence via pipeline SET response PR review P1: trusting every fulfilled ensureResilienceScoreCached() result as "cached" turned the read-lag fix into a write-failure false positive. cachedFetchJson's underlying setCachedJson only logs and swallows write errors, so a transient /set failure on resilience:score:v9:* would leave per-country scores absent while the ranking aggregate and its seed-meta got published on top of them — worse than the bug this PR was meant to fix. Fix: use the pipeline SET response as the authoritative persistence signal. - Extract the score builder into a pure `buildResilienceScore()` with no caching side-effects (appendHistory stays — it's part of the score semantics). - `ensureResilienceScoreCached()` still wraps it in cachedFetchJson so single-country RPC callers keep their log-and-return-anyway behavior. - `warmMissingResilienceScores()` now computes in-memory, persists all scores in one pipeline SET, and only returns countries whose command reported `result: OK`. Pipeline SET's response is synchronous with the write, so OK means actually stored — no ambiguity with read-after-write lag. - When runRedisPipeline returns fewer responses than commands (transport failure), return an empty map: no proof of persistence → coverage gate can't false-positive. Adds regression test that blocks pipeline SETs to score keys and asserts the ranking + meta are NOT published. Existing race-regression test still passes. * fix(resilience-ranking): preserve env key prefix on warm pipeline SET PR review P1: the pipeline SET added to verify score-key persistence was called with raw=true, bypassing the preview/dev key prefix (preview:<sha>:). Two coupled regressions: 1. Preview/dev deploys write unprefixed `resilience:score:v9:` keys, but all reads (getCachedResilienceScores, ensureResilienceScoreCached via setCachedJson/cachedFetchJson) look in the prefixed namespace. Warmed scores become invisible to the same preview on the next read. 2. Because production uses the empty prefix, preview writes land directly in the production-visible namespace, defeating the environment isolation guard in server/_shared/redis.ts. Fix: drop the raw=true flag so runRedisPipeline applies prefixKey on each command, symmetric with the reads. Adds __resetKeyPrefixCacheForTests in redis.ts so tests can exercise a non-empty prefix without relying on process-startup memoization order. Adds regression test that simulates VERCEL_ENV=preview + a commit SHA and asserts every score-key SET in the pipeline carries the preview:<sha>: prefix. Fails on old code (raw writes), passes now. installRedis gains an opt-in `keepVercelEnv` so the test can run under a forced env without being clobbered by the helper's default reset. test(resilience-ranking): snapshot + restore VERCEL_GIT_COMMIT_SHA in afterEach PR review P2: the preview-prefix test mutates process.env.VERCEL_GIT_COMMIT_SHA but the file's afterEach only restored VERCEL_ENV. A process started with a real preview SHA (e.g. CI) would have that value unconditionally deleted after the test ran, leaking changed state into later tests and producing different prefix behavior locally vs. CI. Fix: capture originalVercelSha at module load, restore it in afterEach, and invalidate the memoized key prefix after each test so the next one recomputes against the restored env. The preview-prefix test's finally block is no longer needed — the shared teardown handles it. Verified: suite still passes 11/11 under both `VERCEL_ENV=production` (unset) and `VERCEL_ENV=preview VERCEL_GIT_COMMIT_SHA=ci-original-sha` process environments.	2026-04-16 10:17:22 +04:00
Elie Habib	044598346e	feat(seed-contract): PR 2a — runSeed envelope dual-write + 91 seeders migrated (#3097 ) * feat(seed-contract): PR 2a — runSeed envelope dual-write + 91 seeders migrated Opt-in contract path in runSeed: when opts.declareRecords is provided, write {_seed, data} envelope to the canonical key alongside legacy seed-meta:* (dual-write). State machine: OK / OK_ZERO / RETRY with zeroIsValid opt. declareRecords throws or returns non-integer → hard fail (contract violation). extraKeys[] support per-key declareRecords; each extra key writes its own envelope. Legacy seeders (no declareRecords) entirely unchanged. Migrated all 91 scripts/seed-.mjs to contract mode. Each exports declareRecords returning the canonical record count, and passes schemaVersion: 1 + maxStaleMin (matched to api/health.js SEED_META, or 2.5x interval where no registry entry exists). Contract conformance reports 84/86 seeders with full descriptor (2 pre-existing warnings). Legacy seed-meta keys still written so unmigrated readers keep working; follow-up slices flip health.js + readers to envelope-first. Tests: 61/61 PR 1 tests still pass. Next slices for PR 2: - api/health.js registry collapse + 15 seed-bundle-.mjs canonicalKey wiring - reader migration (mcp, resilience, aviation, displacement, regional-snapshot) - direct writers — ais-relay.cjs, consumer-prices-core publish.ts - public-boundary stripSeedEnvelope + test migration Plan: docs/plans/2026-04-14-002-fix-runseed-zero-record-lockout-plan.md fix(seed-contract): unwrap envelopes in internal cross-seed readers After PR 2a enveloped 91 canonical keys as {_seed, data}, every script-side reader that returned the raw parsed JSON started silently handing callers the envelope instead of the bare payload. WoW baselines (bigmac, grocery-basket, fear-greed) saw undefined .countries / .composite; seed-climate-anomalies saw undefined .normals from climate:zone-normals:v1; seed-thermal-escalation saw undefined .fireDetections from wildfire:fires:v1; seed-forecasts' ~40-key pipeline batch returned envelopes for every input. Fix: route every script-side reader through unwrapEnvelope(...).data. Legacy bare-shape values pass through unchanged (unwrapEnvelope returns {_seed: null, data: raw} for any non-envelope shape). Changed: - scripts/_seed-utils.mjs: import unwrapEnvelope; redisGet, readSeedSnapshot, verifySeedKey all unwrap. Exported new readCanonicalValue() helper for cross-seed consumers. - 18 seed-.mjs scripts with local redisGet-style helpers or inline fetch patched to unwrap via the envelope source module (subagent sweep). - scripts/seed-forecasts.mjs pipeline batch: parse() unwraps each result. - scripts/seed-energy-spine.mjs redisMget: unwraps each result. Tests: - tests/seed-utils-envelope-reads.test.mjs: 7 new cases covering envelope + legacy + null paths for readSeedSnapshot and verifySeedKey. - Full seed suite: 67/67 pass (was 61, +6 new). Addresses both of user's P1 findings on PR #3097. feat(seed-contract): envelope-aware reads in server + api helpers Every RPC and public-boundary reader now automatically strips _seed from contract-mode canonical keys. Legacy bare-shape values pass through unchanged (unwrapEnvelope no-ops on non-envelope shapes). Changed helpers (one-place fix — unblocks ~60 call sites): - server/_shared/redis.ts: getRawJson, getCachedJson, getCachedJsonBatch unwrap by default. cachedFetchJson inherits via getCachedJson. - api/_upstash-json.js: readJsonFromUpstash unwraps (covers api/mcp.ts tool responses + all its canonical-key reads). - api/bootstrap.js: getCachedJsonBatch unwraps (public-boundary — clients never see envelope metadata). Left intentionally unchanged: - api/health.js / api/seed-health.js: read only seed-meta:* keys which remain bare-shape during dual-write. unwrapEnvelope already imported at the meta-read boundary (PR 1) as a defensive no-op. Tests: 67/67 seed tests pass. typecheck + typecheck:api clean. This is the blast-radius fix the PR #3097 review called out — external readers that would otherwise see {_seed, data} after the writer side migrated. * fix(test): strip export keyword in vm.runInContext'd seed source cross-source-signals-regulatory.test.mjs loads scripts/seed-cross-source-signals.mjs via vm.runInContext, which cannot parse ESM `export` syntax. PR 2a added `export function declareRecords` to every seeder, which broke this test's static-analysis approach. Fix: strip the `export` keyword from the declareRecords line in the preprocessed source string so the function body still evaluates as a plain declaration. Full test:data suite: 5307/5307 pass. typecheck + typecheck:api clean. * feat(seed-contract): consumer-prices publish.ts writes envelopes Wrap the 5 canonical keys written by consumer-prices-core/src/jobs/publish.ts (overview, movers:7d/30d, freshness, categories:7d/30d/90d, retailer-spread, basket-series) in {_seed, data} envelopes. Legacy seed-meta:<key> writes preserved for dual-write. Inlined a buildEnvelope helper (10 lines) rather than taking a cross-package dependency — consumer-prices-core is a standalone npm package. Documented the four-file parity contract (mjs source, ts mirror, js edge mirror, this copy). Contract fields: sourceVersion='consumer-prices-core-publish-v1', schemaVersion=1, state='OK' (recordCount>0) or 'OK_ZERO' (legitimate zero). Typecheck: no new errors in publish.ts. * fix(seed-contract): 3 more server-side readers unwrap envelopes Found during final audit: - server/worldmonitor/resilience/v1/_shared.ts: resilience score reader parsed cached GetResilienceScoreResponse raw. Contract-mode seed-resilience-scores now envelopes those keys. - server/worldmonitor/resilience/v1/get-resilience-ranking.ts: p05/p95 interval lookup parsed raw from seed-resilience-scores' extra-key path. - server/worldmonitor/infrastructure/v1/_shared.ts: mgetJson() used for count-source keys (wildfire:fires:v1, news:insights:v1) which are both contract-mode now. All three now unwrap via server/_shared/seed-envelope. Legacy shapes pass through unchanged. Typecheck clean. * feat(seed-contract): ais-relay.cjs direct writes produce envelopes 32 canonical-key write sites in scripts/ais-relay.cjs now produce {_seed, data} envelopes. Inlined buildEnvelope() (CJS module can't require ESM source) + envelopeWrite(key, data, ttlSeconds, meta) wrapper. Enveloped keys span market bootstrap, aviation, cyber-threats, theater-posture, weather-alerts, economic spending/fred/worldbank, tech-events, corridor-risk, usni-fleet, shipping-stress, social:reddit, wsb-tickers, pizzint, product-catalog, chokepoint transits, ucdp-events, satellites, oref. Left bare (not seeded data keys): seed-meta:* (dual-write legacy), classifyCacheKey LLM cache, notam:prev-closed-state internal state, wm:notif:scan-dedup flags. Updated tests/ucdp-seed-resilience.test.mjs regex to accept both upstashSet (pre-contract) and envelopeWrite (post-contract) call patterns. * feat(seed-contract): 15 bundle files add canonicalKey for envelope gate 54 bundle sections across 12 files now declare canonicalKey alongside the existing seedMetaKey. _bundle-runner.mjs (from PR 1) prefers canonicalKey when both are present — gates section runs on envelope._seed.fetchedAt read directly from the data key, eliminating the meta-outlives-data class of bugs. Files touched: - climate (5), derived-signals (2), ecb-eu (3), energy-sources (6), health (2), imf-extended (4), macro (10), market-backup (9), portwatch (4), relay-backup (2), resilience-recovery (5), static-ref (2) Skipped (14 sections, 3 whole bundles): multi-key writers, dynamic templated keys (displacement year-scoped), or non-runSeed orchestrators (regional brief cron, resilience-scores' 222-country publish, validation/ benchmark scripts). These continue to use seedMetaKey or their own gate. seedMetaKey preserved everywhere — dual-write. _bundle-runner.mjs falls back to legacy when canonicalKey is absent. All 15 bundles pass node --check. test:data: 5307/5307. typecheck:all: clean. * fix(seed-contract): 4 PR #3097 review P1s — transform/declareRecords mismatches + envelope leaks Addresses both P1 findings and the extra-key seed-meta leak surfaced in review: 1. runSeed helper-level invariant: seed-meta:* keys NEVER envelope. scripts/_seed-utils.mjs exports shouldEnvelopeKey(key) — returns false for any key starting with 'seed-meta:'. Both atomicPublish (canonical) and writeExtraKey (extras) gate the envelope wrap through this helper. Fixes seed-iea-oil-stocks' ANALYSIS_META_EXTRA_KEY silently getting enveloped, which broke health.js parsing the value as bare {fetchedAt, recordCount}. Also defends against any future manual writeExtraKey(..., envelopeMeta) call that happens to target a seed-meta:* key. 2. seed-token-panels canonical + extras fixed. publishTransform returns data.defi (the defi panel itself, shape {tokens}). Old declareRecords counted data.defi.tokens + data.ai.tokens + data.other.tokens on the transformed payload → 0 → RETRY path → canonical market:defi-tokens:v1 never wrote, and because runSeed returned before the extraKeys loop, market:ai-tokens:v1 + market:other-tokens:v1 stayed stale too. New: declareRecords counts data.tokens on the transformed shape. AI_KEY + OTHER_KEY extras reuse the same function (transforms return structurally identical panels). Added isMain guard so test imports don't fire runSeed. 3. api/product-catalog.js cached reader unwraps envelope. ais-relay.cjs now envelopes product-catalog:v2 via envelopeWrite(). The edge reader did raw JSON.parse(result) and returned {_seed, data} to clients, breaking the cached path. Fix: import unwrapEnvelope from ./_seed-envelope.js, apply after JSON.parse. One site — :238-241 is downstream of getFromCache(), so the single reader fix covers both. 4. Regression lock tests/seed-contract-transform-regressions.test.mjs (11 cases): - shouldEnvelopeKey invariant: seed-meta:* false, canonical true - Token-panels declareRecords works on transformed shape (canonical + both extras) - Explicit repro of pre-fix buggy signature returning 0 — guards against revert - resolveRecordCount accepts 0, rejects non-integer - Product-catalog envelope unwrap returns bare shape; legacy passes through Verification: - npm run test:data → 5318/5318 pass (was 5307 — 11 new regressions) - npm run typecheck:all → clean - node --check on every modified script iea-oil-stocks canonical declareRecords was NOT broken (user confirmed during review — buildIndex preserves .members); only its ANALYSIS_META_EXTRA_KEY was affected, now covered generically by commit 1's helper invariant. * fix(seed-contract): seed-token-panels validateFn also runs on post-transform shape Review finding: fixing declareRecords wasn't sufficient — atomicPublish() runs validateFn(publishData) on the transformed payload too. seed-token-panels' validate() checked data.defi/.ai/.other on the transformed {tokens} shape, returned false, and runSeed took the early skipped-write branch (before even reaching the declareRecords RETRY logic). Net effect: same as before the declareRecords fix — canonical + both extras stayed stale. Fix: validate() now checks the canonical defi panel directly (Array.isArray (data?.tokens) && has at least one t.price > 0). AI/OTHER panels are validated implicitly by their own extraKey declareRecords on write. Audited the other 9 seeders with publishTransform (bls-series, bis-extended, bis-data, gdelt-intel, trade-flows, iea-oil-stocks, jodi-gas, sanctions-pressure, forecasts): all validateFn's correctly target the post-transform shape. Only token-panels regressed. Added 4 regression tests (tests/seed-contract-transform-regressions.test.mjs): - validate accepts transformed panel with priced tokens - validate rejects all-zero-price tokens - validate rejects empty/missing tokens - Explicit pre-fix repro (buggy old signature fails on transformed shape) Verification: - npm run test:data → 5322/5322 pass (was 5318; +4 new) - npm run typecheck:all → clean - node --check clean * feat(seed-contract): add /api/seed-contract-probe validation endpoint Single machine-readable gate for 'is PR #3097 working in production'. Replaces the curl/jq ritual with one authenticated edge call that returns HTTP 200 ok:true or 503 + failing check list. What it validates: - 8 canonical keys have {_seed, data} envelopes with required data fields and minRecords floors (fsi-eu, zone-normals, 3 token panels + minRecords guard against token-panels RETRY regression, product-catalog, wildfire, earthquakes). - 2 seed-meta:* keys remain BARE (shouldEnvelopeKey invariant; guards against iea-oil-stocks ANALYSIS_META_EXTRA_KEY-class regressions). - /api/product-catalog + /api/bootstrap responses contain no '_seed' leak. Auth: x-probe-secret header must match RELAY_SHARED_SECRET (reuses existing Vercel↔Railway internal trust boundary). Probe logic is exported (checkProbe, checkPublicBoundary, DEFAULT_PROBES) for hermetic testing. tests/seed-contract-probe.test.mjs covers every branch: envelope pass/fail on field/records/shape, bare pass/fail on shape/field, missing/malformed JSON, Redis non-2xx, boundary seed-leak detection, DEFAULT_PROBES sanity (seed-meta invariant present, token-panels minRecords guard present). Usage: curl -H "x-probe-secret: $RELAY_SHARED_SECRET" \ https://api.worldmonitor.app/api/seed-contract-probe PR 3 will extend the probe with a stricter mode that asserts seed-meta:* keys are GONE (not just bare) once legacy dual-write is removed. Verification: - tests/seed-contract-probe.test.mjs → 15/15 pass - npm run test:data → 5338/5338 (was 5322; +16 new incl. conformance) - npm run typecheck:all → clean * fix(seed-contract): tighten probe — minRecords on AI/OTHER + cache-path source header Review P2 findings: the probe's stated guards were weaker than advertised. 1. market:ai-tokens:v1 + market:other-tokens:v1 probes claimed to guard the token-panels extra-key RETRY regression but only checked shape='envelope' + dataHas:['tokens']. If an extra-key declareRecords regressed to 0, both probes would still pass because checkProbe() only inspects _seed.recordCount when minRecords is set. Now both enforce minRecords: 1. 2. /api/product-catalog boundary check only asserted no '_seed' leak — which is also true for the static fallback path. A broken cached reader (getFromCache returning null or throwing) could serve fallback silently and still pass this probe. Now: - api/product-catalog.js emits X-Product-Catalog-Source: cache\|dodo\|fallback on the response (the json() helper gained an optional source param wired to each of the three branches). - checkPublicBoundary declaratively requires that header's value match 'cache' for /api/product-catalog, so a fallback-serve fails the probe with reason 'source:fallback!=cache' or 'source:missing!=cache'. Test updates (tests/seed-contract-probe.test.mjs): - Boundary check reworked to use a BOUNDARY_CHECKS config with optional requireSourceHeader per endpoint. - New cases: served-from-cache passes, served-from-fallback fails with source mismatch, missing header fails, seed-leak still takes precedence, bad status fails. - Token-panels sanity test now asserts minRecords≥1 on all 3 panels. Verification: - tests/seed-contract-probe.test.mjs → 17/17 pass (was 15, +2 net) - npm run test:data → 5340/5340 - npm run typecheck:all → clean	2026-04-15 09:16:27 +04:00
Elie Habib	dc10e47197	feat(seed-contract): PR 1 foundation — envelope + contract + conformance test (#3095 ) * feat(seed-contract): PR 1 foundation — envelope helpers + contract validators + static conformance test Adds the foundational pieces for the unified seed contract rollout described in docs/plans/2026-04-14-002-fix-runseed-zero-record-lockout-plan.md. Behavior- preserving by construction: legacy-shape Redis values unwrap as { _seed: null, data: raw } and pass through every helper unchanged. New files: - scripts/_seed-envelope-source.mjs — single source of truth for unwrapEnvelope, stripSeedEnvelope, buildEnvelope. - api/_seed-envelope.js — edge-safe mirror (AGENTS.md:80 forbids api/* importing from server/). - server/_shared/seed-envelope.ts — TS mirror with SeedMeta, SeedEnvelope, UnwrapResult types. - scripts/_seed-contract.mjs — SeedContractError + validateDescriptor (10 required fields, 10 optional, unknown-field rejection) + resolveRecordCount (non-negative integer or throw). - scripts/verify-seed-envelope-parity.mjs — diffs function bodies between the two JS copies; TS copy guarded by tsc. - tests/seed-envelope.test.mjs — 14 tests for the three helpers (null, legacy-passthrough, stringified JSON, round-trip). - tests/seed-contract.test.mjs — 25 tests for validateDescriptor/ resolveRecordCount + a soft-warn conformance scan that STATICALLY parses scripts/seed-.mjs (never dynamic import — several seeders process.exit() at module load). Currently logs 91 seeders awaiting declareRecords migration. Wiring (minimal, behavior-preserving): - api/health.js: imports unwrapEnvelope; routes readSeedMeta's parsed value through it. Legacy meta has no _seed wrapper → passes through unchanged. - scripts/_bundle-runner.mjs: readSectionFreshness prefers envelope at section.canonicalKey when present, falls back to the existing seed-meta:<key> read via section.seedMetaKey (unchanged path today since no bundle defines canonicalKey yet). No seeder modified. No writes changed. All 5279 existing data tests still green; both typechecks clean; parity verifier green; 39 new tests pass. PR 2 will migrate seeders, bundles, and readers to envelope semantics. PR 3 removes the legacy path and hard-fails the conformance test. fix(seed-contract): address PR #3095 review — metaTtlSeconds opt, bundle fallback, strict conformance mode Review findings applied: P1 — metaTtlSeconds missing from OPTIONAL_FIELDS whitelist. scripts/seed-jodi-gas.mjs:250 passes metaTtlSeconds to runSeed(); field is consumed by _seed-utils writeSeedMeta. Without it in the whitelist, PR 2's validateDescriptor wiring would throw 'unknown field' the moment jodi-gas migrates. Added with a 'removed in PR 3' note. P2 — Bundle canonicalKey short-circuit over-runs during migration. readSectionFreshness previously returned null if canonicalKey had no envelope yet, even when a legacy seed-meta key was also declared — making every cron re-run the section. Fixed to fall through to seedMetaKey on null envelope so the transition state is safe. P3 — Conformance soft-warn signal was invisible in CI. tests/seed-contract.test.mjs now emits a t.diagnostic summary line ('N/M seeders export declareRecords') visible on every run and gates hard-fail behind SEED_CONTRACT_STRICT=1 so PR 3 can flip to strict without more code. Nitpick — parity regex missed 'export async function'. Added '(?:async\s+)?' to scripts/verify-seed-envelope-parity.mjs function extraction regex. Verified: 39 tests green, parity verifier green, strict mode correctly hard-fails with 91 seeders missing (expected during PR 1). * fix(seed-contract): address review round 2 — NaN/empty-string validation, Error cause, parity CI wiring P2 — Non-finite ttlSeconds/maxStaleMin bypassed validation. `typeof NaN === 'number'` and `NaN > 0 === false` meant a NaN duration passed the old typeof+<=0 checks and would have poisoned TTLs once validateDescriptor is wired into runSeed. Now gated by Number.isFinite, which rejects NaN and ±Infinity. Tests added for NaN/Infinity on both fields. P2 — Empty/whitespace-only strings for domain/resource/canonicalKey/sourceVersion bypassed validation. Added .trim() === '' rejection + tests per field. This mattered because canonicalKey='' would have landed writes at the empty key and seed-meta under a blank resource namespace. P3 — SeedContractError silently dropped the Error v2 cause option. Constructor now forwards { cause } through super() so err.cause works with standard tooling (Node's default stack printer, Sentry chained-cause serialization). resolveRecordCount's manual err.cause = err assignment was replaced with the options-bag form. Test added for both constructor direct-use and the resolveRecordCount wrap path. P3 — Parity verifier was not on an automated path. Added tests/seed-envelope-parity.test.mjs which spawns scripts/verify-seed-envelope-parity.mjs via execFile; non-zero exit (drift) → test fails. Now runs as part of `npm run test:data` (tsx --test tests/.test.mjs). Drift injection confirmed: sed -i modifying api/_seed-envelope.js makes the test fail with 'Command failed' from execFile. 51 tests total (was 39). All green on clean tree. fix(seed-contract): conformance test checks full descriptor, not just declareRecords Previous conformance check green-lit any seeder that exported declareRecords, even if the runSeed(...) call-site omitted other validateDescriptor-required opts (validateFn, ttlSeconds, sourceVersion, schemaVersion, maxStaleMin). That would have produced a false readiness signal for PR 3's strict flip: test goes green, but wiring validateDescriptor() into runSeed in PR 2 would still throw at runtime across the fleet. Examples verified on the PR head: - scripts/seed-cot.mjs:188-192 — no sourceVersion/schemaVersion/maxStaleMin - scripts/seed-market-breadth.mjs:121-124 — same - scripts/seed-jodi-gas.mjs:248-253 — no schemaVersion/maxStaleMin Now the conformance test: 1. AST-lite extracts the runSeed(...) call site with balanced parens, tolerating strings and comments. 2. Checks every REQUIRED_OPTS_FIELDS entry (validateFn, declareRecords, ttlSeconds, sourceVersion, schemaVersion, maxStaleMin) is present as an object key in that call-site. 3. Emits a per-file diagnostic listing missing fields. 4. Migration signal is now accurate: 0/91 seeders fully satisfy the descriptor (was claiming 0/91 missing just declareRecords). Matches the underlying validateDescriptor behavior. Verified: strict mode (SEED_CONTRACT_STRICT=1) surfaces 'opt:schemaVersion, opt:maxStaleMin' as missing fields per seeder — actionable for PR 2 migration work. 51 tests total (unchanged count; behavior change is in which seeders the one conformance test considers migrated). * fix(seed-contract): strip comments/strings before parsing runSeed() call site The conformance scanner located the first 'runSeed(' substring in the raw source, which caught commented-out mentions upstream of the real call. Offending files where this produced false 'incomplete' diagnoses: - scripts/seed-bis-data.mjs:209 // runSeed() calls process.exit(0)… real call at :220 - scripts/seed-economy.mjs:788 header comment mentioning runSeed() real call at :891 Three files had the same pattern. Under strict mode these would have been false hard failures in PR 3 even when the real descriptor was migrated. Fix: - stripCommentsAndStrings(src) produces a view where block comments, line comments, and string/template literals are replaced with spaces (line feeds preserved). Indices stay aligned with the original source so extractRunSeedCall can match against the stripped view and then slice the original source for the real call body. - descriptorFieldsPresent() also runs its field-presence regex against the stripped call body so '// TODO: validateFn' inside the call doesn't fool the check. - hasRunSeedCall() uses the stripped view too, which correctly excludes 5 seeders that only mentioned runSeed in comments. Count dropped 91→86 real callers. Added 4 targeted tests covering: - runSeed() inside a line comment ahead of the real call - runSeed() inside a block comment - runSeed() inside a string literal ("don't call runSeed() directly") - descriptor field names inside an inline comment don't count as present Verified on the actual files: seed-bis-data.mjs first real runSeed( in stripped source is at line 220 (was line 209 before fix). 40 tests total, all green. * fix(seed-contract): parity verifier survives unbalanced braces in string/template literals Addresses Greptile P2 on PR #3095: the body extractor in scripts/verify-seed-envelope-parity.mjs counted raw { and } on every character. A future helper body that legitimately contains `const marker = '{'` would have pushed depth past zero at the literal brace and truncated the body — silently masking drift in the rest of the function. Extracted the scan into scanBalanced(source, start, open, close) which skips characters inside line comments, block comments, and string / template literals (with escape handling and template-literal ${} recursion for interpolation). Call sites in extractFunctions updated to use the new scanner for both the arg-list parens and the function body braces. Made extractFunctions and scanBalanced exported so the new test file can exercise them directly. Gated main() behind an isMain check so importing the module from tests doesn't trigger process.exit. New tests in tests/seed-envelope-parity.test.mjs: - extractFunctions tolerates unbalanced braces in string literals - same for template literals - same for braces inside block comments - same for braces inside line comments - scanBalanced respects backslash-escapes inside strings - scanBalanced recurses into template-literal ${} interpolation Also addresses the other two Greptile P2s which were already fixed in earlier commits on this branch: - Empty-string gap (`99646dd9a`): .trim()==='' rejection added - SeedContractError cause drop (`99646dd9a`): constructor forwards cause through super's options bag per Error v2 spec 61 tests green. Both typechecks clean.	2026-04-14 22:11:56 +04:00
Elie Habib	e32d9b631c	feat(market): Hyperliquid perp positioning flow as leading indicator (#3074 ) * feat(market): Hyperliquid perp positioning flow as leading indicator Adds a 4-component composite (funding × volume × OI × basis) "positioning stress" score for ~14 perps spanning crypto (BTC/ETH/SOL), tokenized gold (PAXG), commodity perps (WTI, Brent, Gold, Silver, Pt, Pd, Cu, NatGas), and FX perps (EUR, JPY). Polls Hyperliquid /info every 5min via Railway cron; publishes a single self-contained snapshot with embedded sparkline arrays (60 samples = 5h history). Surfaces as a new "Perp Flow" tab in CommoditiesPanel with separate Commodities / FX sections. Why: existing CFTC COT is weekly + US-centric; market quotes are price-only. Hyperliquid xyz: perps give 24/7 global positioning data that has been shown to lead spot moves on commodities and FX by minutes-to-hours. Implementation: - scripts/seed-hyperliquid-flow.mjs — pure scoring math, symbol whitelist, content-type + schema validation, prior-state read via readSeedSnapshot(), warmup contract (first run / post-outage zeroes vol/OI deltas), missing-symbol carry-forward, $500k/24h min-notional guard to suppress thin xyz: noise. TTL 2700s (9× cadence). - proto/worldmonitor/market/v1/get_hyperliquid_flow.proto + service.proto registration; make generate regenerated client/server bindings. - server/worldmonitor/market/v1/get-hyperliquid-flow.ts — getCachedJson reader matching get-cot-positioning.ts seeded-handler pattern. - server/gateway.ts cache-tier entry (medium). - api/health.js: hyperliquidFlow registered with maxStaleMin:15 (3× cadence) + transitional ON_DEMAND_KEYS gate for the first ~7 days of bake-in. - api/seed-health.js mirror with intervalMin:5. - scripts/seed-bundle-market-backup.mjs entry (NIXPACKS auto-redeploy on scripts/** watch). - src/components/MarketPanel.ts: CommoditiesPanel grows a Perp Flow tab + fetchHyperliquidFlow() RPC method; OI Δ1h derived from sparkOi tail. - src/App.ts: prime via primeVisiblePanelData() + recurring refresh via refreshScheduler.scheduleRefresh() at 5min cadence (panel does NOT own setInterval; matches the App.ts:1251 lifecycle convention). - 28 unit tests covering scoring parity, warmup flag, min-notional guard, schema rejection, missing-symbol carry-forward, post-outage cold start, sparkline cap, alert threshold. Tests: test:data 5169/5169, hyperliquid-flow-seed 28/28, route-cache-tier 5/5, typecheck + typecheck:api green. One pre-existing test:sidecar failure (cloud-fallback origin headers) is unrelated and reproduces on origin/main. * fix(hyperliquid-flow): address review feedback — volume baseline window, warmup lifecycle, error logging Two real correctness bugs and four review nits from PR #3074 review pass. Correctness fixes: 1. Volume baseline was anchored to the OLDEST 12 samples, not the newest. sparkVol is newest-at-tail (shiftAndAppend), so slice(0, 12) pinned the rolling mean to the first hour of data forever once len >= 12. Volume scoring would drift further from current conditions each poll. Switched to slice(-VOLUME_BASELINE_MIN_SAMPLES) so the baseline tracks the most recent window. Regression test added. 2. Warmup flag flipped to false on the second successful poll while volume scoring still needed 12+ samples to activate. UI told users warmup lasted ~1h but the badge disappeared after 5 min. Tied per-asset warmup to real baseline readiness (coldStart OR vol samples < 12 OR prior OI missing). Snapshot-level warmup = any asset still warming. Three new tests cover the persist-through-baseline-build, clear-once-ready, and missing-OI paths. Review nits: - Handler: bare catch swallowed Redis/parse errors; now logs err.message. - Panel: bare catch in fetchHyperliquidFlow hid RPC 500s; now logs. - MarketPanel.ts: deleted hand-rolled RawHyperliquidAsset; mapHyperliquidFlowResponse now takes GetHyperliquidFlowResponse from the generated client so proto drift fails compilation instead of silently. - Seeder: added @ts-check + JSDoc on computeAsset per type-safety rule. - validateUpstream: MAX_UPSTREAM_UNIVERSE=2000 cap bounds memory. - buildSnapshot: logs unknown xyz: perps upstream (once per run) so ops sees when Hyperliquid adds markets we could whitelist. Tests: 37/37 green; typecheck + typecheck:api clean. * fix(hyperliquid-flow): wire bootstrap hydration per AGENTS.md mandate Greptile review caught that AGENTS.md:187 mandates new data sources be wired into bootstrap hydration. Plan had deferred this on "lazy deep-dive signal" grounds, but the project convention is binding. - server/_shared/cache-keys.ts: add hyperliquidFlow to BOOTSTRAP_CACHE_KEYS + BOOTSTRAP_TIERS ('slow' — non-blocking, page-load-parallel). - api/bootstrap.js: add to inlined BOOTSTRAP_CACHE_KEYS + SLOW_KEYS so bootstrap.test.mjs canonical-mirror assertions pass. - src/components/MarketPanel.ts: - Import getHydratedData from @/services/bootstrap. - New mapHyperliquidFlowSeed() normalizes the raw seed-JSON shape (numeric fields) into HyperliquidFlowView. The RPC mapper handles the proto shape (string-encoded numbers); bootstrap emits the raw blob. - fetchHyperliquidFlow now reads hydrated data first, renders immediately, then refreshes from RPC — mirrors FearGreedPanel pattern. Tests: 72/72 green (bootstrap + cache-tier + hyperliquid-flow-seed).	2026-04-14 08:05:40 +04:00
Elie Habib	7aa8dd1bf2	fix(scoring): relay recomputes importanceScore post-LLM + shadow-log v2 + parity test (#3069 ) * fix(scoring): relay recomputes importanceScore post-LLM + shadow-log v2 + parity test Before this change, classified rss_alert events published by ais-relay carried a stale importanceScore: the digest computed it from the keyword-level threat before the LLM upgrade, and the relay republished that value unchanged. Shadow log (2,850 entries / 7 days) had Pearson 0.31 vs human rating with zero events reaching the ≥85 critical gate — the score being measured was the keyword fallback, not the AI classification. Fixes: - ais-relay.cjs: recompute importanceScore locally from the post-LLM level using an exact mirror of the digest formula (SEVERITY_SCORES, SCORE_WEIGHTS, SOURCE_TIERS, formula). Publish includes corroborationCount for downstream shadow-log enrichment. - notification-relay.cjs: delete the duplicate shadowLogScore() call that produced ~50% near-duplicate pairs. Move shadow log to v2 key with JSON-encoded members carrying severity, source, corroborationCount, variant — future calibration cycles get cleaner data. - shadow-score-{report,rank}.mjs: parse both v2 JSON and legacy v1 string members; default to v2, override via SHADOW_SCORE_KEY env. - _classifier.ts: narrow keyword additions — blockade, siege, sanction (singular), escalation → HIGH; evacuation orders (plural) → CRITICAL. - tests/importance-score-parity.test.mjs: extracts tier map and formula from both TS digest and CJS relay sources, asserts identical output across 12 sample cases. Catches any future drift. - tests/relay-importance-recompute.test.mjs + notification-relay-shadow-log .test.mjs: regression tests for the publish path and single-write discipline. Gates remain OFF. After deploy, collect 48h of fresh shadow:score-log:v2 data, re-run scripts/shadow-score-rank.mjs for calibration, then set final IMPORTANCE_SCORE_MIN / high / critical thresholds before enabling IMPORTANCE_SCORE_LIVE=1. See docs/internal/scoringDiagnostic.md (local) for full diagnosis. 🤖 Generated with Claude Sonnet 4.6 via Claude Code + Compound Engineering v2.49.0 Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * fix(scoring): PR #3069 review amendments — revert risky keywords + extract SOURCE_TIERS Addresses review findings on PR #3069 (todos/193 through 204). BLOCKING (P1): - Revert 5 keyword additions in _classifier.ts. Review showed `escalation`, `sanction`, `siege`, `blockade`, `evacuation orders` fire HIGH on `de-escalation`, `sanctioned`, `besieged`, `blockaded` (substring matches), and the plural `evacuation orders` is already covered by the singular. Classifier work will land in a separate PR with phrase-based rules. - Delete dead `digestImportanceScore` field from relay allTitles metadata (written in two places, read nowhere). IMPORTANT (P2): - Extract SOURCE_TIERS to shared/source-tiers.{json,cjs} using the existing shared/rss-allowed-domains.* precedent. Dockerfile.relay already `COPY shared/` (whole dir), so no infra change. Deletes 255-line inline duplicate from ais-relay.cjs; TS digest imports the same JSON via resolveJsonModule. Tier-map parity is now structural. - Simplify parity test — tier extraction no longer needed. SEVERITY_SCORES + SCORE_WEIGHTS + scoring function parity retained across 12 cases plus an unknown-level defensiveness case. Deleted no-op regex replace (`.replace(/X/g, 'X')`). Fixed misleading recency docstring. - Pipeline shadow log: ZADD + ZREMRANGEBYSCORE + belt+suspenders EXPIRE now go in a single Upstash /pipeline POST (~50% RT reduction, no billing delta). - Bounded ZRANGE in shadow-score-report.mjs (20k cap + warn if reached). - Bump outbound webhook envelope v1→v2 to signal the new `corroborationCount` field on rss_alert payloads. - Restore rss_alert eventType gate at shadowLogScore caller (skip promise cost for non-rss events). - Align ais-relay scorer comment with reality: it has ONE intentional deviation from digest (`?? 0` on severity for defensiveness, returning 0 vs NaN on unknown levels). Documented + tested. P3: - Narrow loadEnv in scripts to only UPSTASH_REDIS_REST_* (was setting any UPPER_UNDERSCORE env var from .env.local). - Escape markdown specials in rating-sheet.md title embeds. Pre-existing activation blockers NOT fixed here (tracked as todos 196, 203): `/api/notify` accepts arbitrary importanceScore from authenticated Pro users, and notification channel bodies don't escape mrkdwn/Discord markup. Both must close before `IMPORTANCE_SCORE_LIVE=1`. Net: -614 lines (more deleted than added). 26 regression assertions pass. npm run typecheck, typecheck:api, test:data all pass. 🤖 Generated with Claude Sonnet 4.6 via Claude Code + Compound Engineering v2.49.0 Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * fix(scoring): mirror source-tiers.{json,cjs} into scripts/shared/ The `scripts-shared-mirror` enforcement in tests/edge-functions.test.mjs requires every .json and .cjs in shared/ to have a byte-identical copy in scripts/shared/ (Railway rootDirectory=scripts deploy bundle cannot reach repo-root shared/). Last commit added shared/source-tiers.{json,cjs} without mirroring them. CI caught it. * fix(scoring): revert webhook envelope to v1 + log shadow-log pipeline failures Two P1/P2 review findings on PR #3069: P1: Bumping webhook envelope to version: '2' was a unilateral breaking change — the other webhook producers (proactive-intelligence.mjs:407, seed-digest-notifications.mjs:736) still emit v1, so the same endpoint would receive mixed envelope versions per event type. A consumer validating `version === '1'` would break specifically on realtime rss_alert deliveries while proactive_brief and digest events kept working. Revert to '1' and document why — `corroborationCount` is an additive payload field, backwards-compatible for typical consumers; strict consumers using `additionalProperties: false` should be handled via a coordinated version bump across all producers in a separate PR. P2: The new shadow-log /pipeline write swallowed all errors silently (no resp.ok check, no per-command error inspection), a regression from the old upstashRest() path which at least logged non-2xx. Since the 48h recalibration cycle depends on shadow:score-log:v2 filling with clean data, a bad auth token or malformed pipeline body would leave operators staring at an empty ZSET with no signal. Now logs HTTP failures and per-command pipeline errors. * docs(scoring): fix stale v1 references + clarify two-copy source-tiers mirror Two follow-up review findings on PR #3069: P2 — Source-tier "single source of truth" comments were outdated. PR #3069 ships TWO JSON copies (shared/source-tiers.json for Vercel edge + main relay container, scripts/shared/source-tiers.json for Railway services using rootDirectory=scripts). Comments in server/_shared/source-tiers.ts and scripts/ais-relay.cjs now explicitly document the mirror setup and point at the two tests that enforce byte-identity: the existing scripts-shared-mirror test (tests/edge-functions.test.mjs:37-48) and a new explicit cross-check in tests/importance-score-parity.test.mjs. Adding the assertion here is belt-and-suspenders: if edge-functions.test.mjs is ever narrowed, the parity test still catches drift. P3 — Stale v1 references in shared metadata/docs. The actual writer moved to shadow:score-log:v2 in notification-relay.cjs, but server/_shared/cache-keys.ts:23-31 still documented v1 and exported the v1 string. No runtime impact (the export has zero importers — relay uses its own local const) but misleading. Updated the doc block to explain both v1 (legacy, self-pruning) and v2 (current), bumped the constant to v2, and added a comment that notification-relay.cjs is the owner. Header comment in scripts/shadow-score-report.mjs now documents the SHADOW_SCORE_KEY override path too. --------- Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-13 21:53:21 +04:00
Elie Habib	cd5ed0d183	feat(seeds): BIS DSR + property prices (2 of 7) (#3048 ) * feat(seeds): BIS DSR + property prices (2 of 7) Ships 2 of 7 BIS dataflows flagged as genuinely new signals in #3026 — the rest are redundant with IMF/WB or are low-fit global aggregates. New seeder: scripts/seed-bis-extended.mjs - WS_DSR household debt service ratio (% income, quarterly) - WS_SPP residential property prices (real index, quarterly) - WS_CPP commercial property prices (real index, quarterly) Gold-standard pattern: atomic publish + writeExtraKey for extras, retry on missing startPeriod, TTL = 3 days (3× 12h cron), runSeed drives seed-meta:economic:bis-extended. Series selection scores dimension matches (PP_VALUATION=R / UNIT_MEASURE=628 for property, DSR_BORROWERS=P / DSR_ADJUST=A for DSR), then falls back to observation count. Wired into: - bootstrap (slow tier) + cache-keys.ts - api/health.js (STANDALONE_KEYS + SEED_META, maxStaleMin = 24h) - api/mcp.ts get_economic_data tool (_cacheKeys + _freshnessChecks) - resilience macroFiscal: new householdDebtService sub-metric (weight 0.05, currentAccountPct rebalanced 0.3 → 0.25) - Housing Cycle tile on CountryDeepDivePanel (Economic Indicators card) with euro-area (XM) fallback for EU member states - seed-bundle-macro Railway cron (BIS-Extended, 12h interval) Tests: tests/bis-extended-seed.test.mjs covers CSV parsing, series selection, quarter math + YoY. Updated resilience golden-value tests for the macroFiscal weight rebalance. Closes #3026 https://claude.ai/code/session_01DDo39mPD9N2fNHtUntHDqN * fix(resilience): unblock PR #3048 on #3046 stack - rebase onto #3046; final macroFiscal weights: govRevenue 0.40, currentAccount 0.20, debtGrowth 0.20, unemployment 0.15, householdDebtService 0.05 (=1.00) - add updateHousingCycle? stub to CountryBriefPanel interface so country-intel dispatch typechecks - add HR to EURO_AREA fallback set (joined euro 2023-01-01) - seed-bis-extended: extend SPP/CPP TTLs when DSR fetch returns empty so the rejected publish does not silently expire the still-good property keys - update resilience goldens for the 5-sub-metric macroFiscal blend * fix(country-brief): housing tile renders em-dash for null change values The new Housing cycle tile used `?? 0` to default qoqChange/yoyChange/change when missing, fabricating a flat "0.0%" label (with positive-trend styling) for countries with no prior comparable period. Fetch path and builders correctly return null; the panel was coercing it. formatPctTrend now accepts null\|undefined and returns an em-dash, matching how other cards surface unavailable metrics. Drop the `?? 0` fallbacks at the three housing call sites. * fix(seed-health): register economic:bis-extended seed-meta monitoring 12h Railway cron writes seed-meta:economic:bis-extended but it was missing from SEED_DOMAINS, so /api/seed-health never reported its freshness. intervalMin=720 matches maxStaleMin/2 (1440/2) from api/health.js. * fix(seed-bis-extended): decouple DSR/SPP/CPP so one fetch failure doesn't block the others Previously validate() required data.entries.length > 0 on the DSR slice after publishTransform pulled it out of the aggregate payload. If WS_DSR fetch failed but WS_SPP / WS_CPP succeeded, validate() rejected the publish → afterPublish() never ran → fresh SPP/CPP data was silently discarded and only the old snapshots got a TTL bump. This treats the three datasets as independent: - SPP and CPP are now published (or have their existing TTLs extended) as side-effects of fetchAll(), per-dataset. A failure in one never affects the others. - DSR continues to flow through runSeed's canonical-key path. When DSR is empty, publishTransform yields { entries: [] } so atomicPublish skips the canonical write (preserving the old DSR snapshot); runSeed's skipped branch extends its TTL and refreshes seed-meta. Shape B (one runSeed call, semantics changed) chosen over Shape A (three sequential runSeed calls) because runSeed owns the lock + process.exit lifecycle and can't be safely called three times in a row, and Shape B keeps the single aggregate seed-meta:economic:bis-extended key that health.js already monitors. Tests cover both failure modes: - DSR empty + SPP/CPP healthy → SPP/CPP written, DSR TTL extended - DSR healthy + SPP/CPP empty → DSR written, SPP/CPP TTLs extended * fix(health): per-dataset seed-meta for BIS DSR/SPP/CPP Health was pointing bisDsr / bisPropertyResidential / bisPropertyCommercial at the shared seed-meta:economic:bis-extended key, which runSeed refreshes on every run (including its validation-failed "skipped" branch). A DSR-only outage therefore left bisDsr reporting fresh in api/health.js while the resilience scorer consumed stale/missing economic:bis:dsr:v1 data. Write a dedicated seed-meta key per dataset ONLY when that dataset actually published fresh entries. The aggregate bis-extended key stays as a "seeder ran" signal in api/seed-health.js. * fix(seed-bis-extended): write DSR seed-meta only after atomicPublish succeeds Previously fetchAll() wrote seed-meta:economic:bis-dsr inline before runSeed/atomicPublish ran. If atomicPublish then failed (Redis hiccup, validate rejection, etc.), seed-meta was already bumped — health would report DSR fresh while the canonical key was stale. Move the DSR seed-meta write into a dsrAfterPublish callback passed to runSeed via the existing afterPublish hook, which fires only after a successful canonical publish. SPP/CPP paths already used this ordering inside publishDatasetIndependently; this brings DSR in line. Adds a regression test exercising dsrAfterPublish with mocked Upstash: populated DSR -> single SET on seed-meta key; null/empty DSR -> zero Redis calls. * fix(resilience): per-dataset BIS seed-meta keys in freshness overrides SOURCE_KEY_META_OVERRIDES previously collapsed economic:bis:dsr:v1 and both property-* sourceKeys onto the aggregate seed-meta:economic:bis-extended key. api/health.js (SEED_META) writes per-dataset keys (seed-meta:economic:bis-dsr / bis-property-residential / bis-property-commercial), so a DSR-only outage showed stale in /api/health but the resilience dimension freshness code still reported macroFiscal inputs as fresh. Map each BIS sourceKey to its dedicated seed-meta key to match health.js. The aggregate bis-extended key is still written by the seeder and read by api/seed-health.js as a "seeder ran" signal, so it is retained upstream. * fix(bis): prefer households in DSR + per-dataset freshness in MCP Greptile review catches on #3048: 1. buildDsr() was selecting DSR_BORROWERS='P' (private non-financial) while the UI labels it "Household DSR" and resilience scoring uses it as `householdDebtService`. Changed to 'H' (households). Countries without an H series now get dropped rather than silently mislabeled. 2. api/mcp.ts get_economic_data still read only the aggregate seed-meta:economic:bis-extended for freshness. If DSR goes stale while SPP/CPP keep publishing, MCP would report the BIS block as fresh even though one of its returned keys is stale. Swapped to the three per-dataset seed-meta keys (bis-dsr, bis-property-residential, bis-property-commercial), matching the fix already applied to /api/health and the resilience dimension-freshness pipeline. --------- Co-authored-by: Claude <noreply@anthropic.com>	2026-04-13 15:05:44 +04:00
Elie Habib	f5d8ff9458	feat(seeds): Eurostat house prices + quarterly debt + industrial production (#3047 ) * feat(seeds): Eurostat house prices + quarterly debt + industrial production Adds three new Eurostat overlay seeders covering all 27 EU members plus EA20 and EU27_2020 aggregates (issue #3028): - prc_hpi_a (annual house price index, 10y sparkline, TTL 35d) key: economic:eurostat:house-prices:v1 complements BIS WS_SPP (#3026) for the Housing cycle tile - gov_10q_ggdebt (quarterly gov debt %GDP, 8q sparkline, TTL 14d) key: economic:eurostat:gov-debt-q:v1 upgrades National Debt card cadence from annual IMF to quarterly for EU - sts_inpr_m (monthly industrial production, 12m sparkline, TTL 5d) key: economic:eurostat:industrial-production:v1 feeds "Real economy pulse" sparkline on Economic Indicators card Shared JSON-stat parser in scripts/_eurostat-utils.mjs handles the EL/GR and EA20 geo quirks and returns full time series for sparklines. Wires each seeder into bootstrap (SLOW_KEYS), health registries (keys + seed-meta thresholds matched to cadence), macro seed bundle, cache-keys shared module, and the MCP tool registry (get_eu_housing_cycle, get_eu_quarterly_gov_debt, get_eu_industrial_production). MCP tool count updated to 31. Tests cover JSON-stat parsing, sparkline ordering, EU-only coverage gating (non-EU geos return null so panels never render blank tiles), validator thresholds, and registry wiring across all surfaces. https://claude.ai/code/session_01Tgm6gG5yUMRoc2LRAKvmza * fix(bootstrap): register new Eurostat keys in tiers, defer consumers Adds eurostatHousePrices/GovDebtQ/IndProd to BOOTSTRAP_TIERS ('slow') to match SLOW_KEYS in api/bootstrap.js, and lists them as PENDING_CONSUMERS in the hydration coverage test (panel wiring lands in follow-up). * fix(eurostat): raise seeder coverage thresholds to catch partial publishes The three Eurostat overlay seeders (house prices, quarterly gov debt, monthly industrial production) all validated with makeValidator(10) against a fixed 29-geo universe (EU27 + EA20 + EU27_2020). A bad run returning only 10-15 geos would pass validation and silently publish a snapshot missing most of the EU. Raise thresholds to near-complete coverage, with a small margin for geos with patchy reporting: - house prices (annual): 10 -> 24 - gov debt (quarterly): 10 -> 24 - industrial prod (monthly): 10 -> 22 (monthly is slightly patchier) Add a guard test that asserts every overlay seeder keeps its threshold >=22 so this regression can't reappear. * fix(seed-health): register 3 Eurostat seed-meta entries house-prices, gov-debt-q, industrial-production were wired in api/health.js SEED_META but missing from api/seed-health.js SEED_DOMAINS, so /api/seed-health would not surface their freshness. intervalMin = health.js maxStaleMin / 2 per convention. --------- Co-authored-by: Claude <noreply@anthropic.com>	2026-04-13 13:00:14 +04:00
Elie Habib	71a6309503	feat(seeds): expand IMF WEO coverage — growth, labor, external themes (#3027 ) (#3046 ) * feat(seeds): expand IMF WEO coverage — growth, labor, external themes (#3027) Adds three new SDMX-3.0 seeders alongside the existing imf-macro seeder to surface 15+ additional WEO indicators across ~210 countries at zero incremental API cost. Bundled into seed-bundle-imf-extended.mjs on the same monthly Railway cron cadence. Seeders + Redis keys: - seed-imf-growth.mjs → economic:imf:growth:v1 NGDP_RPCH, NGDPDPC, NGDP_R, PPPPC, PPPGDP, NID_NGDP, NGSD_NGDP - seed-imf-labor.mjs → economic:imf:labor:v1 LUR (unemployment), LP (population) - seed-imf-external.mjs → economic:imf:external:v1 BX, BM, BCA, TM_RPCH, TX_RPCH (+ derived trade balance) - seed-imf-macro.mjs extended with PCPI, PCPIEPCH, GGX_NGDP, GGXONLB_NGDP All four seeders share the 35-day TTL (monthly WEO release) and ~210 country coverage via the same imfSdmxFetchIndicator helper. Wiring: - api/bootstrap.js, api/health.js, server/_shared/cache-keys.ts — register new keys, mark them slow-tier, add SEED_META freshness thresholds matching the imfMacro entry (70d = 2× monthly cadence) - server/worldmonitor/resilience/v1/_dimension-freshness.ts — override entries for the dash-vs-colon seed-meta key shape - _indicator-registry.ts — add LUR as a 4th macroFiscal sub-metric (enrichment tier, weight 0.15); rebalance govRevenuePct (0.5→0.4) and currentAccountPct (0.3→0.25) so weights still sum to 1.0 - _dimension-scorers.ts — read economic:imf:labor:v1 in scoreMacroFiscal, normalize LUR with goalposts 3% (best) → 25% (worst); null-tolerant so weightedBlend redistributes when labor data is unavailable - api/mcp.ts — new get_country_macro tool bundling all four IMF keys with a single freshness check; describes per-country fields including growth/inflation/labor/BOP for LLM-driven country screening - src/services/imf-country-data.ts — bootstrap-cached client + pure buildImfEconomicIndicators helper - src/app/country-intel.ts — async-fetch the IMF bundle on country selection and merge real GDP growth, CPI inflation, unemployment, and GDP/capita rows into the Economic Indicators card; bumps card cap from 3 → 6 rows to fit live signals + IMF context Tests: - tests/seed-imf-extended.test.mjs — 13 unit tests across the three new seeders' pure helpers (canonical keys, ISO3→ISO2 mapping, aggregate filtering, derived savings-investment gap & trade balance, validate thresholds) - tests/imf-country-data.test.mts — 6 tests for the panel rendering helper, including stagflation flag and high-unemployment trend - tests/resilience-dimension-scorers.test.mts — new LUR sub-metric test (tight vs slack labor); existing scoreMacroFiscal coverage assertions updated for the new 4-metric weight split - tests/helpers/resilience-fixtures.mts — labor fixture for NO/US/YE so the existing macroFiscal ordering test still resolves the LUR weight - tests/bootstrap.test.mjs — register imfGrowth/imfLabor/imfExternal as pending consumers (matching imfMacro) - tests/mcp.test.mjs — bump tools/list count 28 → 29 https://claude.ai/code/session_018enRzZuRqaMudKsLD5RLZv * fix(resilience): update macroFiscal goldens for LUR weight rebalance Recompute pinned fixture values after adding labor-unemployment as 4th macroFiscal sub-metric (weight rebalance in _indicator-registry). Also align seed-imf-external tradeBalance to a single reference year to avoid mixing ex/im values from different WEO vintages. * fix(seeds): tighten IMF coverage gates to reject partial snapshots IMF WEO growth/labor/external indicators report ~210 countries for healthy runs. Previous thresholds (150/100/150) let a bad IMF run overwrite a good snapshot with dozens of missing countries and still pass validation. Raise all three to >=190, matching the pattern of sibling seeders and leaving a ~20-country margin for indicators with slightly narrower reporting. Labor validator unions LUR + population (LP), so healthy coverage tracks LP (~210), not LUR (~100) — the old 100 threshold was based on a misread of the union logic. * fix(seed-health): register imf-growth/labor/external seed-meta keys Missing SEED_DOMAINS entries meant the 3 new IMF WEO seeders (growth, labor, external) had no /api/seed-health visibility. intervalMin=50400 matches health.js maxStaleMin/2 (100800/2) — same monthly WEO cadence as imf-macro. --------- Co-authored-by: Claude <noreply@anthropic.com>	2026-04-13 12:51:35 +04:00
Elie Habib	3696aba2d1	fix(infra): sync health/bootstrap/cache-keys parity (4 keys + 6 DATA_KEYS + 5 SEED_META) (#3015 ) * fix(infra): sync health/bootstrap/cache-keys parity (4 BOOTSTRAP_CACHE_KEYS + 6 DATA_KEYS + 5 SEED_META) Audit found 4 bootstrap.js keys (consumerPrices) missing from cache-keys.ts BOOTSTRAP_CACHE_KEYS, 6 bootstrapped keys invisible to health DATA_KEYS monitoring (cryptoSectors, ddosAttacks, economicStress, insights, predictions, trafficAnomalies), and 5 bootstrapped keys with no SEED_META staleness detection (cryptoSectors, ddosAttacks, economicStress, marketImplications, trafficAnomalies). Keys without seed-meta writers (bisCredit, bisExchange, giving, minerals, serviceStatuses, temporalAnomalies) were verified as on-demand/derived and correctly skipped. fix(health): write seed-meta on empty DDoS/anomalies data Prevents false STALE_SEED alerts when Cloudflare returns zero events. Extracts writeSeedMeta() helper from writeExtraKeyWithMeta(). * fix(health): remove duplicate insights/predictions aliases, fix test regex P1: insights/predictions duplicate newsInsights/predictionMarkets P2: keyRe now captures non-versioned consumer-prices keys * fix(health): add ddosAttacks/trafficAnomalies to EMPTY_DATA_OK_KEYS Zero DDoS events or traffic anomalies is a valid quiet-period state, not a critical failure.	2026-04-12 20:38:08 +04:00
Elie Habib	793d7df9dc	feat(energy-crisis): add IEA 2026 Energy Crisis Policy Response Tracker panel and seeder (#3008 )	2026-04-12 15:09:54 +04:00
Elie Habib	c26ae6b827	feat(energy): add Oil Inventories panel with SVG charts (#3003 )	2026-04-12 14:04:31 +04:00
Elie Habib	c72251178c	feat(route-explorer): Sprint 4 — strategic-product impact tab + get-route-impact RPC (#2996 ) * feat(route-explorer): Sprint 4 — strategic-product impact tab Adds the Impact tab to the Route Explorer, powered by a new get-route-impact RPC that returns strategic-product trade data for any country pair. Backend: - New proto get_route_impact.proto with GetRouteImpact{Request,Response} + StrategicProduct message - New handler server/worldmonitor/supply-chain/v1/get-route-impact.ts: reads comtrade:bilateral-hs4:{iso2}:v1 store, computes lane value for selected HS2, top 5 strategic products by value with chokepoint exposure, resilience score (server-side from Redis), dependency flags - Cache key ROUTE_IMPACT_KEY in cache-keys.ts (NOT in BOOTSTRAP_KEYS) - Gateway + premium-paths registered as slow-browser premium RPC - Client wrapper fetchRouteImpact in supply-chain/index.ts Impact tab UI: - CountryImpactTab.ts: strategic products table (top 5 by value), lane value card for selected HS2, hs2InSeededUniverse banner when HS2 is not in the 14 seeded sectors, comtradeSource states (missing/empty/bilateral-hs4), drill-sideways on product row click - LeftRail.updateDependencyFlags: renders flags from Impact response with color-coded badges (compound_risk/single_source/diversifiable) Data flow: - fetchImpact fires in parallel with fetchResilience after lane data loads, generation-scoped - Impact response updates left-rail flags + resilience score - Drill-sideways: clicking a product row switches the explorer's HS2 and re-queries all tabs Server-side resilience: - get-route-impact reads resilience:score:v8:{iso2} from Redis directly so the data is available for future email briefs without client calls Plan: docs/plans/2026-04-11-001-feat-worldwide-route-explorer-plan.md * fix(route-explorer): real exposure score for flags + tabstrip sync on drill P1: computeDependencyFlags hardcoded primaryExposure=80 whenever any chokepoint existed, fabricating SINGLE_CORRIDOR_CRITICAL without using real exposure data. Replaced with computeRealExposureScore that uses the same route-cluster overlap logic as get-sector-dependency, computing the actual exposure percentage before comparing against the >80 threshold. P2: handleDrillSideways set state.tab=1 directly without going through setTab(), leaving the tabstrip visually and semantically on Impact while content showed Current. Now calls setTab(1) which updates both the tabstrip active state and aria-selected. * fix(route-explorer): guard resilience overwrite + normalize HS2 filter P1: fetchImpact could zero the left-rail resilience score when get-route-impact returned resilienceScore=0 (Redis miss fallback), overwriting a valid score set by the concurrent fetchResilience call. Now only applies the server-side score when it is actually > 0. P2: HS4-to-HS2 matching used a redundant dual-condition filter (hs4ToHs2 + startsWith) that masked a potential normalization bug. Simplified to normalize hs2 once via parseInt then use a single hs4ToHs2 comparison.	2026-04-12 10:25:13 +04:00
Elie Habib	822eef0fa6	feat(supply-chain): Sprint 1 — Route Explorer wrapper RPC (#2980 ) * feat(supply-chain): Sprint 1 — Route Explorer wrapper RPC Adds an internal wrapper around the vendor-only route-intelligence compute so the upcoming Route Explorer UI can call it from a browser PRO session instead of forcing an X-WorldMonitor-Key API gate. Backend: - New proto get-route-explorer-lane.proto with GetRouteExplorerLane{Request,Response} - New handler server/worldmonitor/supply-chain/v1/get-route-explorer-lane.ts - New static lookup tables _route-explorer-static-tables.ts: TRANSIT_DAYS_BY_ROUTE_ID, FREIGHT_USD_BY_CARGO_TYPE, BYPASS_CORRIDOR_GEOMETRY_BY_ID — covers all 5 land-bridge corridors plus every sea-alternative corridor with hand-curated coordinates - Wired into supply-chain handler.ts service dispatcher - Cache key ROUTE_EXPLORER_LANE_KEY in cache-keys.ts (NOT in BOOTSTRAP_KEYS) - Gateway entry: PREMIUM_RPC_PATHS + RPC_CACHE_TIER 'slow-browser' - Premium path entry in src/shared/premium-paths.ts so browser PRO auth attaches Response contract enriches route-intelligence with: - primaryRouteGeometry polyline from TRADE_ROUTES (lon/lat pairs) - fromPort/toPort coords on every bypass option so the client can call MapContainer.setBypassRoutes directly without geometry lookups - status: 'active' \| 'proposed' \| 'unavailable' derived from corridor notes to honestly label kra_canal_future and black_sea_western_ports - estTransitDaysRange + estFreightUsdPerTeuRange from static tables - noModeledLane: true when origin/destination clusters share no routes Client wrapper fetchRouteExplorerLane added to src/services/supply-chain/index.ts. Tests: tests/route-explorer-lane.test.mts — 30-query smoke matrix (10 country pairs × 3 HS2 codes), structural assertions only, no hard-coded transit/cost values. Test exposes a pure computeLane() function with an injectable status map so it does not need Redis. Gap report (from smoke run): 12 of 30 queries fall back to a synthetic primaryRouteId because the destination's port cluster has no shared route with the origin (US-JP, ZA-IN, CL-CN, TR-DE × 3 HS2 each). These pairs return noModeledLane:true; Sprint 3 will render an empty-state for them. Plan: docs/plans/2026-04-11-001-feat-worldwide-route-explorer-plan.md * fix(route-explorer): address PR #2980 review findings P1: bypass warRiskTier was hard-coded to WAR_RISK_TIER_NORMAL, dropping the live risk signal from chokepoint status. Now derived from the statusMap via the corridor's primaryChokepointId. P2: freight fallback in emptyResponse and client-side empty payload used a cargo-agnostic container range for all cargo types. Removed the ranges entirely from fallback/noModeledLane responses; they are only present when the lane is actually modeled. Suggestion: when noModeledLane is true, the response now returns empty primaryRouteId, empty geometry, empty exposures, empty bypasses, and omits transit/freight ranges. Previously it returned plausible-looking synthetic data from the origin's first route which could mislead the UI. Tests updated to assert the noModeledLane contract: empty fields when the flag is set, non-empty ranges only when the lane is modeled. * fix(route-explorer): cargo-aware route ranking + bypass waypoint risk P1: primary route selection was order-dependent, picking whichever shared route the origin cluster listed first. Mixed clusters like CN/JP could return an energy lane for a container request. Now ranks shared routes by cargo-category compatibility (container→container, tanker→energy, bulk→bulk, roro→container) before selecting. P1: bypass warRiskTier was copied from the primary chokepoint instead of derived from the corridor's own waypointChokepointIds. This overstated risk for alternatives like Cape of Good Hope whose waypoints may have a lower risk tier. Now uses max-tier across waypoint chokepoints, matching get-bypass-options.ts logic. Suggestion: placeholder corridors with addedTransitDays=0 (like gibraltar_no_bypass, cape_of_good_hope_is_bypass) are now filtered out. Previously they could surface as active alternatives. Regression tests added: - CN→JP tanker: asserts energy route is selected over container route - CN→DE with faked Suez=CRITICAL / Cape=NORMAL: asserts Cape bypass shows NORMAL, not CRITICAL - ES→EG: asserts zero-transit-day placeholders are excluded * fix(route-explorer): scope exposures to primary route + narrow placeholder filter P1: chokepointExposures and bypassOptions were computed from the full sharedRoutes set, mixing data from energy/container corridors into a single response. Now scoped to the cargo-ranked primaryRouteId only, matching the proto contract that exposures are "on the primary route." P2: the addedTransitDays === 0 filter was too broad and removed kra_canal_future (a proposed bypass with real modeling). Narrowed to an explicit PLACEHOLDER_CORRIDOR_IDS set (gibraltar_no_bypass, cape_of_good_hope_is_bypass) so proposed zero-day corridors survive and are surfaced with CORRIDOR_STATUS_PROPOSED. Regression tests: - chokepointExposures follow primaryRouteId (CN->JP container) - kra_canal_future appears as CORRIDOR_STATUS_PROPOSED for Malacca routes - placeholder filter still excludes explicit placeholders * fix(route-explorer): address PR #2980 review comments 1. Unavailable corridors without waypoints (e.g. black_sea_western_ports) now derive WAR_RISK_TIER_WAR_ZONE from their CORRIDOR_STATUS_UNAVAILABLE status, instead of returning WAR_RISK_TIER_UNSPECIFIED. Corridors with waypointChokepointIds still use max-tier across those waypoints. 2. Added fixture test with non-empty status map (suez=75/HIGH, malacca=30/ELEVATED) so disruptionScore and warRiskTier assertions are not trivially satisfied by the empty-map default path. 3. Documented the single-chokepoint bypass design gap in the test gap report: bypassOptions only cover the primary chokepoint; multi-chokepoint routes show exposure for all but bypass guidance for only the top one. Sprint 3 will decide whether to expand to top-N or add a UI hint.	2026-04-12 08:16:02 +04:00
Elie Habib	cf028b34b2	fix(resilience): address PR #2947 review (3 P2 nits on staleness classifier) (#2953 ) Addresses all three Greptile P2 comments on PR #2947: 1. Redundant `Math.max(0, ...)` guard (line 97). The defensive branch three lines above already rejects null, undefined, NaN, and future timestamps, so `nowMs - lastObservedAtMs` is guaranteed to be `>= 0` by the time execution reaches the age calculation. The `Math.max` was a misleading hint that a negative value could arrive. Removed, with a comment explaining why the branch above covers the case. 2. `ageMs` can be `Infinity`, worth documenting on the field. When the defensive branch returns, `ageMs` and `ageInCadenceUnits` are both `Number.POSITIVE_INFINITY`, but the field type was just `number`, so callers doing arithmetic or string formatting on the value (e.g. a "last seen X ago" label) would silently emit `Infinity` / `NaN` strings. Added JSDoc to both fields explaining the infinity contract and recommending `Number.isFinite` before arithmetic. 3. Missing `ageInCadenceUnits` assertion for defensive cases. The earlier test checked `ageMs === POSITIVE_INFINITY` only for the null branch, and never checked `ageInCadenceUnits` at all. Hardened the defensive-cases test to pin BOTH fields as `Number.POSITIVE_INFINITY` across every branch (null, undefined, NaN, future). Coverage went from 1 of 8 field/branch pairs to all 8. If a future regression drops `ageInCadenceUnits` from the defensive return, the test now fails loudly. Testing: - npx tsx --test tests/resilience-freshness.test.mts: 10/10 pass - npm run typecheck: clean Generated with Claude Opus 4.6 (1M context) via Claude Code + Compound Engineering v2.49.0 Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-11 20:04:26 +04:00
Elie Habib	d35dc4110e	feat(resilience): staleness classifier foundation (T1.5) (#2947 ) Why this PR? Ships the foundation-only slice of Phase 1 T1.5 of the country- resilience reference-grade upgrade plan: a pure staleness classifier that maps a `lastObservedAt` timestamp and a source cadence to one of three staleness levels (fresh, aging, stale). This is the primitive that T1.6 (widget dimension confidence bar with freshness badge) and the later T1.5 scorer propagation pass both consume. Same pattern as the T1.7 foundation PR (#2944): define the type and the primitive in isolation with comprehensive tests, then land the consumer wiring in separate PRs so each unit is bounded and reviewable. What this PR commits: - New module `server/_shared/resilience-freshness.ts` (110 lines) exporting: - `ResilienceCadence` type union covering the 5 cadences the methodology document lists (realtime, daily, weekly, monthly, annual). - `StalenessLevel` type union: fresh / aging / stale. - `cadenceUnitMs(cadence)` helper returning a canonical duration per cadence: realtime = 1 hour, daily = 1 day, weekly = 7 days, monthly = 30 days, annual = 365 days. - `FRESH_MULTIPLIER` = 1.5 and `AGING_MULTIPLIER` = 3. A signal is fresh when age is strictly less than 1.5x its cadence unit, aging when strictly less than 3x, stale otherwise. - `classifyStaleness({ lastObservedAtMs, cadence, nowMs })` pure function returning `{ staleness, ageMs, ageInCadenceUnits }`. Null / undefined / NaN / future timestamps return stale with positive-infinity age. `nowMs` is accepted as a deterministic override for unit testing. - New test file `tests/resilience-freshness.test.mts` (170 lines, 10 tests covering cadence ordering, fresh/aging/stale classification across all 5 cadences, defensive handling of null/NaN/future timestamps, exact threshold boundaries, internal consistency, and classifier purity). What is deliberately NOT in this PR: - No changes to the 13 dimension scorers. Propagating `lastObservedAt` through each scorer and aggregating max age per dimension is the next slice of T1.5 and will consume this classifier as a pure import. - No schema changes (proto, OpenAPI, `ResilienceDimension` response type). The schema field `freshness: { lastObservedAt, staleness }` lands alongside the widget rendering in T1.6. - No widget rendering. T1.6 owns the per-dimension freshness badge UI and will call `classifyStaleness` at render time. Prerequisite PRs verified merged: - #2821 (baseline / stress engine) - #2847 (formula revert + RSF direction fix) - #2858 (seed direct scoring) Related in-flight Phase 1 PRs this session: - #2941 T1.1 regression test - #2943 T1.4 dataVersion widget wire - #2944 T1.7 imputation taxonomy foundation - #2945 T1.3 methodology mdx promotion - #2946 T1.8 methodology doc linter Testing: - npx tsx --test tests/resilience-freshness.test.mts: 10/10 pass - npm run typecheck: clean Generated with Claude Opus 4.6 (1M context) via Claude Code + Compound Engineering v2.49.0 Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-11 19:48:11 +04:00
Elie Habib	46c35e6073	feat(breadth): add market breadth history chart (#2932 )	2026-04-11 17:54:26 +04:00
Elie Habib	d3836ba49b	feat(sentiment): add AAII investor sentiment survey (#2930 ) * feat(sentiment): add AAII investor sentiment survey Weekly bull/bear/neutral sentiment from AAII (1987-present). Shows current reading, bull-bear spread, and 52-week historical chart. Seeder fetches from AAII CSV, stores last 52 weeks in Redis. * fix(aaii): wire panel loading + mark fallback data explicitly * fix(aaii): keep panel live across refreshes + surface in health monitoring - fetchData now falls back to /api/bootstrap?keys=aaiiSentiment on refresh (getHydratedData is one-shot and returns undefined after the first read, causing a permanent spinner on hourly refresh) - Shows an error state with auto-retry when both hydrated and bootstrap-fetch miss, matching the WsbTickerScannerPanel pattern - Registered aaiiSentiment in api/health.js BOOTSTRAP_KEYS and api/seed-health.js SEED_DOMAINS so rollout failures and fallback-only operation are observable in the monitoring dashboards * fix(sentiment): handle BIFF8 SST trailing bytes and use UTC for AAII Thursday calc Two P2 greptile fixes from PR #2930 review: 1. BIFF8 SST parser was reading the rich-text run count (cRun, flags & 0x08) and extended-string size (cbExtRst, flags & 0x04) to advance past those header fields, but never skipped the trailing bytes AFTER the char data: 4 * cRun formatting-run bytes and cbExtRst ext-rst bytes. If any string before the column header was rich-text formatted, every subsequent SST entry parsed from the wrong offset, silently breaking XLS extraction and falling back to HTML scraping. 2. parseHtmlSentiment() computed last-Thursday via today.getDay() + setDate(today.getDate() - daysToThursday), both local-TZ-dependent. On Railway (non-UTC TZ) the inferred Thursday could drift by a day, causing the HTML-derived row to mismatch the XLS historical rows. Switched to getUTCDay() + Date.UTC() for TZ-stable arithmetic.	2026-04-11 17:05:39 +04:00
Elie Habib	d1cb0e3c10	feat(sectors): add P/E valuation benchmarking to sector heatmap (#2929 ) * feat(sectors): add P/E valuation benchmarking to sector heatmap Trailing/forward P/E, beta, and returns for 12 sector ETFs from Yahoo Finance. Horizontal bar chart color-coded by valuation level plus sortable table. Extends existing sector data pipeline. * fix(sectors): clear stale valuations on empty refresh + document cache behavior * fix(sectors): force valuation rollout for cached + breaker-persisted bootstraps - Bumped market:sectors bootstrap key v1 -> v2 so stale 24h slow-tier payloads without the new valuations field are invisible to returning users on next page load - Versioned the fetchSectors circuit-breaker (name -> "Sector Summary v2") so old localStorage/IndexedDB entries predating this PR cannot be returned as stale via the SWR path - shouldCache now requires the valuations field to be present on the cached response, not just a non-empty sectors array - loadMarkets no longer clears the valuations tab when a hydrated or fresh payload lacks the field; prior render is left intact, matching the finding's requirement - Defensive check: hydrated payloads without valuations fall through to a live fetch instead of rendering an empty valuations tab * fix(stocks): correct beta3Year source and null YTD color in sector P/E view - scripts/ais-relay.cjs: beta3Year lives on defaultKeyStatistics (ks), not summaryDetail (sd); the previous fallback was a silent no-op. - src/components/MarketPanel.ts: null ytdReturn now renders with var(--text-dim) instead of var(--red); the '--' placeholder no longer looks like a loss. Addresses greptile review on PR #2929.	2026-04-11 16:51:35 +04:00
Elie Habib	2decda6508	feat(wsb): add Reddit/WSB ticker scanner seeder + bootstrap registration (#2916 ) * feat(wsb): add Reddit/WSB ticker scanner seeder + bootstrap registration Seeder in ais-relay.cjs fetches from r/wallstreetbets, r/stocks, r/investing every 10min. Extracts ticker mentions, validates against known ticker set, aggregates by frequency and engagement, writes top 50 to intelligence:wsb-tickers:v1. 4-file bootstrap registration: cache-keys.ts, bootstrap.js, health.js with SEED_META maxStaleMin=30. * fix(wsb): remove duplicate CEO + fix avgUpvoteRatio divisor * fix(wsb): require ticker validation set + condition seed-meta on write + add seed-health 1. Skip seed when ticker validation set is empty (cold start/bootstrap miss) 2. Only write seed-meta after successful canonical write 3. Register in api/seed-health.js for dedicated monitoring * fix(wsb): case-insensitive $ticker matching + BRK.B dotted symbol support * fix(wsb): split $-prefixed vs bare ticker extraction + BRK.B→BRK-B normalization 1. $-prefixed tickers ($nvda, $BRK.B) skip whitelist validation (strong signal) — catches GME, AMC, PLTR etc. not in the narrow market watchlist 2. Bare uppercase tokens still validated against known set (high false-positive) 3. BRK.B normalized to BRK-B before validation (dot→dash) 4. Empty known set no longer skips seed — $-prefixed tickers still extracted * fix(wsb): skip bare-uppercase branch entirely when ticker set unavailable	2026-04-11 07:07:11 +04:00
Elie Habib	a742537ae5	feat(supply-chain): Sprint D — GetSectorDependency RPC + vendor route-intelligence API + webhooks (#2905 ) * feat(supply-chain): Sprint D — GetSectorDependency RPC + vendor route-intelligence API + webhooks * fix(supply-chain): move bypass-corridors + chokepoint-registry to server/_shared to fix api/ boundary violations * fix(supply-chain): webhooks — persist secret, fix sub-resource routing, add ownership check * fix(supply-chain): address PR #2905 review findings - Use SHA-256(apiKey) for ownerTag instead of last-12-chars (unambiguous ownership) - Implement GET /api/v2/shipping/webhooks list route via per-owner Redis Set index - Tighten SSRF: https-only, expanded metadata hostname blocklist, document DNS rebinding edge-runtime limitation - Fix get-sector-dependency.ts stale src/config/ imports → server/_shared/ (Greptile P1) * fix(supply-chain): getSectorDependency returns blank primaryChokepointId for landlocked countries computeExposures() previously mapped over all of CHOKEPOINT_REGISTRY even when nearestRouteIds was empty, producing a full array of score-0 entries in registry insertion order. The caller's exposures[0] then picked the first registry entry (Suez) as the "primary" chokepoint despite primaryChokepointExposure = 0. LI, AD, SM, BT and other landlocked countries were all silently assigned a fake chokepoint. Fix: guard at the top of computeExposures() -- return [] when input is empty so primaryChokepointId stays '' and primaryChokepointExposure stays 0.	2026-04-10 17:12:29 +04:00
Elie Habib	1af73975b9	feat(energy): SPR policy classification layer (#2881 ) * feat(energy): add SPR policy classification layer with 66-country registry Static JSON registry classifying strategic petroleum reserve regimes for 66 countries (all IEA members + major producers/consumers). Integrates into energy profile handler, shock model limitations, analyst context, spine seeder, and CDP UI. - scripts/data/spr-policies.json: 66-entry registry with regime, source, asOf - scripts/seed-spr-policies.mjs: seeder following chokepoint-baselines pattern - Proto fields 51-59 on GetCountryEnergyProfileResponse - Handler reads SPR registry from Redis, populates proto fields - Shock model adds fuel-mode-gated SPR limitations for non-IEA gov SPR - Analyst context refactored to accumulator pattern (IEA + SPR parts) - CDP UI: SPR badge for non-IEA government_spr, muted text for spare_capacity - Spine integration: SPR fields in shockInputs + hasSprPolicy coverage flag - Cache keys, health, bootstrap, seed-health registrations - Tests: registry shape, ISO2, regime enum, required entries, no estimatedFillPct * fix(energy): remove SPR from bootstrap (server-only); narrow SPR hasAny gate to renderable regimes * feat(energy): render "no known SPR" risk note for countries with regime=none * fix(energy): human-readable SPR regime labels; parallelize spine+registry reads in analyst	2026-04-09 22:16:24 +04:00
Elie Habib	23ed4eba44	fix(supply-chain): address all code review findings from PR #2873 (#2878 ) * fix(supply-chain): address all code review findings from PR #2873 - Rename costIncreasePct → supplyDeficitPct (semantic correction) - Add primaryChokepointWarRiskTier to GetBypassOptionsResponse - Consolidate ThreatLevel/threatLevelToWarRiskTier into _insurance-tier.ts - Replace inline CpEntry/ChokepointStatusCacheEntry with ChokepointInfo - Add outer cachedFetchJson wrapper (3 serial Redis reads → 1 on warm path) - Add hs2 validation guard matching sibling handler pattern - Extract CHOKEPOINT_STATUS_KEY constant; eliminate string literal duplication - Add SCORE_RISK_WEIGHT/SCORE_COST_WEIGHT named constants; clamp liveScore ≥ 0 - Add Math.max(0,...) to liveScore for sub-1.0 cost multiplier corridors - Fix closurePct: req.closurePct ?? 100 (was \|\| which falsy-coalesced zero) - Type fetchBypassOptions cargoType as CargoType (was implicit string) - Add exhaustiveness check to threatLevelToInsurancePremiumBps switch - Move TIER_RANK to module level in _insurance-tier.ts - Update WIDGET_PRO_SYSTEM_PROMPT with both new PRO RPCs * fix(supply-chain): fix supplyDeficitPct averaging and coverageDays sentinel - Remove .filter(d > 0) from productDeficits: zero-deficit products have demand and must stay in the denominator to avoid overstating the average - Clamp coverageDays = Math.max(0, effectiveCoverDays): prevents -1 net-exporter sentinel from leaking into the public API response - Update proto comment: document 0 for net exporters - Add test assertions for both contracts * chore(api-docs): regenerate OpenAPI docs for coverage_days comment update * refactor(supply-chain): use CHOKEPOINT_STATUS_KEY in chokepoint-status writer The key was extracted to cache-keys.ts in the previous commit but the primary writer (getChokepointStatus) and BOOTSTRAP_CACHE_KEYS still embedded the raw string literal. Import the constant at both sites to complete the refactor. * test: update supply-chain-v2 assertions for CHOKEPOINT_STATUS_KEY refactor Handler now imports CHOKEPOINT_STATUS_KEY as REDIS_CACHE_KEY from cache-keys.ts rather than defining a local constant. BOOTSTRAP_CACHE_KEYS also references the constant. Update source-string assertions to match the new patterns. * fix: keep BOOTSTRAP_CACHE_KEYS.chokepoints as string literal bootstrap.test.mjs enforces string-literal values in BOOTSTRAP_CACHE_KEYS via regex. CHOKEPOINT_STATUS_KEY is used in handler imports and is the primary dedup win; the static registry entry stays as-is per test contract.	2026-04-09 21:41:26 +04:00
Elie Habib	bd07829518	feat(supply-chain): Sprint 2 — bypass corridor intelligence + cost shock engine (#2873 ) * feat(supply-chain): Sprint 2 — bypass corridor intelligence + cost shock engine - src/config/bypass-corridors.ts: ~40 bypass corridors for all 13 chokepoints - server/supply-chain/v1/get-bypass-options.ts: PRO-gated RPC, live bypass scoring from chokepoint status cache - server/supply-chain/v1/get-country-cost-shock.ts: PRO-gated RPC, war risk premium BPS + energy coverage days (HS 27) - server/supply-chain/v1/_insurance-tier.ts: pure function, Lloyd's JWC threat → premium BPS - gateway.ts + premium-paths.ts: registered both RPCs as slow-browser + PRO-gated - src/services/supply-chain/index.ts: fetchBypassOptions + fetchCountryCostShock client methods - proto: GetBypassOptions + GetCountryCostShock messages + service registrations - tests/supply-chain-sprint2.test.mjs: 61 tests covering all new components Co-Authored-By: Claude Sonnet 4.6 (200K context) <noreply@anthropic.com> * fix(cost-shock): call computeEnergyShockScenario directly instead of reading wrong cache key The old code read from `energy:shock:${iso2}:${chokepointId}:v1` which never matches the actual v2 cache key written by compute-energy-shock.ts. Fix by calling computeEnergyShockScenario() directly (it handles v2 caching internally) and mapping effectiveCoverDays + crude product deficitPct to the response fields. * fix(cost-shock): average refined product deficitPct instead of looking for non-existent 'crude' product --------- Co-authored-by: Claude Sonnet 4.6 (200K context) <noreply@anthropic.com>	2026-04-09 20:15:41 +04:00

1 2 3 4

179 Commits