worldmonitor

mirror of https://github.com/koala73/worldmonitor.git synced 2026-04-25 17:14:57 +02:00

Files

Elie Habib 425507d15a fix(brief): category-gated context + RELEVANCE RULE to stop formulaic grounding (#3281 )

* fix(brief): category-gated context + RELEVANCE RULE to stop formulaic grounding

Shadow-diff of 15 v2 pairs (2026-04-22) showed the analyst pattern-
matching the loudest context numbers — VIX 19.50, top forecast
probability, MidEast FX stress 77 — into every story regardless of
editorial fit. A Rwanda humanitarian story about refugees cited VIX;
an aviation story cited a forecast probability.

Root cause: every story got the same 6-bundle context block, so the
LLM had markets / forecasts / macro in-hand and the "cite a specific
fact" instruction did the rest.

Two-layer fix:

1. STRUCTURAL — sectionsForCategory() maps the story's category to
an editorially-relevant subset of bundles. Humanitarian stories
don't see marketData / forecasts / macroSignals; diplomacy gets
riskScores only; market/energy gets markets+forecasts but drops
riskScores. The model physically cannot cite what it wasn't
given. Unknown categories fall back to all six (backcompat).

2. PROMPT — WHY_MATTERS_ANALYST_SYSTEM_V2 adds a RELEVANCE RULE
that explicitly permits grounding in headline/description
actors when no context fact is a clean fit, and bans dragging
off-topic market metrics into humanitarian/aviation/diplomacy
stories. The prompt footer (inline, per-call) restates the
same guardrail — models follow inline instructions more
reliably than system-prompt constraints on longer outputs.

Cache keys bumped to invalidate the formulaic v5 output: endpoint
v5 to v6, shadow v3 to v4. Adds 11 unit tests pinning the 5
policies + default fallback + humanitarian structural guarantee +
market policy does-see-markets + guardrail footer presence.

Observability: endpoint now logs policyLabel per call so operators
can confirm in Vercel logs that humanitarian/aviation stories are
NOT seeing marketData without dumping the full prompt.

* test(brief): address greptile P2 — sync MAX_BODY_BYTES + add parseWhyMattersV2 coverage

Greptile PR #3281 review raised two P2 test-quality issues:

1. Test-side MAX_BODY_BYTES mirror was still 4096 — the endpoint
was bumped to 8192 in PR #3269 (v2 output + description). With
the stale constant, a payload in the 4097–8192 range was
accepted by the real endpoint but looked oversize in the test
mirror, letting the body-cap invariant silently drift. Fixed
by syncing to 8192 + bumping the bloated fixture to 10_000
bytes so a future endpoint-cap bump doesn't silently
re-invalidate the assertion.

2. parseWhyMattersV2 (the only output-validation gate on the
analyst path) had no dedicated unit tests. Adds 11 targeted
cases covering: valid 2 and 3 sentence output, 100/500 char
bounds (incl. boundary assertions), all 6 banned preamble
phrases, section-label leaks (SITUATION/ANALYSIS/Watch),
markdown leakage (#, -, *, 1.), stub echo rejection, smart/
plain quote stripping, non-string defensive branch, and
whitespace-only strings.

Suite size: 50 to 61 tests, all green.

* fix(brief): add aviation policy to sectionsForCategory (PR #3281 review P1)

Reviewer caught that aviation was named in WHY_MATTERS_ANALYST_SYSTEM_V2's
RELEVANCE RULE as a category banned from off-topic market metrics, but
had no matching regex entry in CATEGORY_SECTION_POLICY. So 'Aviation
Incident' / 'Airspace Closure' / 'Plane Crash' / 'Drone Incursion' all
fell through to DEFAULT_SECTIONS and still got all 6 bundles including
marketData, forecasts, and macroSignals — exactly the VIX / forecast
probability pattern the PR claimed to structurally prevent.

Reproduced on HEAD before fix:
Aviation Incident -> default
Airspace Closure -> default
Plane Crash -> default
...etc.

Fix:
1. Adds aviation policy (same 3 bundles as humanitarian/diplomacy/
tech: worldBrief, countryBrief, riskScores).
2. Adds dedicated aviation-gating test with 6 category variants.
3. Adds meta-invariant test: every category named in the system
prompt's RELEVANCE RULE MUST have a structural policy entry,
asserting policyLabel !== 'default'. If someone adds a new
category name to the prompt in the future, this test fires
until they wire up a regex — prevents soft-guard drift.
4. Removes 'Aviation Incident' from the default-fall-through test
list (it now correctly matches aviation).

No cache bump needed — v6 was published to the feature branch only a
few minutes ago, no production entries have been written yet.

2026-04-22 08:21:01 +04:00

aviation/v1

perf(api): split monolithic edge function into per-domain functions (#753 )

2026-03-02 14:13:34 +04:00

brief

fix(brief): switch carousel to @vercel/og on edge runtime (#3210 )

2026-04-19 15:18:12 +04:00

climate/v1

perf(api): split monolithic edge function into per-domain functions (#753 )

2026-03-02 14:13:34 +04:00

conflict/v1

perf(api): split monolithic edge function into per-domain functions (#753 )

2026-03-02 14:13:34 +04:00

consumer-prices/v1

feat(consumer-prices): add basket price monitoring domain (#1901 )

2026-03-20 17:08:22 +04:00

cyber/v1

perf(api): split monolithic edge function into per-domain functions (#753 )

2026-03-02 14:13:34 +04:00

data

Proto-first API rebuild: sebuf contracts, handlers, gateway, and generated docs (#106 )

2026-02-21 03:39:56 +04:00

discord/oauth

feat(notifications): gate all endpoints behind PRO entitlement (#2852 )

2026-04-09 09:18:19 +04:00

displacement/v1

perf(api): split monolithic edge function into per-domain functions (#753 )

2026-03-02 14:13:34 +04:00

economic/v1

perf(api): split monolithic edge function into per-domain functions (#753 )

2026-03-02 14:13:34 +04:00

eia

feat(eia): gold-standard /api/eia/petroleum (Railway seed → Redis → Vercel reads only) (#3161 )

2026-04-18 14:40:00 +04:00

enrichment

refactor(enrichment): dedupe domain/org normalization helpers (#1707 )

2026-03-16 08:51:15 +04:00

forecast/v1

feat(forecast): AI Forecasts prediction module (#1579 )

2026-03-15 01:42:04 +04:00

giving/v1

perf(api): split monolithic edge function into per-domain functions (#753 )

2026-03-02 14:13:34 +04:00

health/v1

feat(panels): Disease Outbreaks, Shipping Stress, Social Velocity, nuclear test site enrichment (#2375 )

2026-03-27 22:33:45 +04:00

imagery/v1

feat(map): add NOTAM overlay + satellite imagery integration (#1356 )

2026-03-10 07:19:02 +04:00

infrastructure/v1

perf(api): split monolithic edge function into per-domain functions (#753 )

2026-03-02 14:13:34 +04:00

intelligence/v1

fix(llm): pin LLM edge functions to US/EU regions to prevent geo-block 403s (#2541 )

2026-03-30 11:08:14 +04:00

internal

fix(brief): category-gated context + RELEVANCE RULE to stop formulaic grounding (#3281 )

2026-04-22 08:21:01 +04:00

maritime/v1

perf(api): split monolithic edge function into per-domain functions (#753 )

2026-03-02 14:13:34 +04:00

market/v1

perf(api): split monolithic edge function into per-domain functions (#753 )

2026-03-02 14:13:34 +04:00

military/v1

perf(api): split monolithic edge function into per-domain functions (#753 )

2026-03-02 14:13:34 +04:00

natural/v1

feat: move EONET/GDACS to server-side with Redis caching (#983 )

2026-03-04 15:02:03 +04:00

news/v1

fix(llm): pin LLM edge functions to US/EU regions to prevent geo-block 403s (#2541 )

2026-03-30 11:08:14 +04:00

oauth

feat(mcp): live airspace + maritime tools; fix OAuth consent UI (#2442 )

2026-03-28 23:59:47 +04:00

positive-events/v1

perf(api): split monolithic edge function into per-domain functions (#753 )

2026-03-02 14:13:34 +04:00

prediction/v1

perf(api): split monolithic edge function into per-domain functions (#753 )

2026-03-02 14:13:34 +04:00

radiation/v1

feat: add Radiation Watch with seeded anomaly intelligence, map layers, and country exposure (#1735 )

2026-03-17 09:18:06 +04:00

referral

fix(referral): stop /api/referral/me 503s on prod homepage (#3186 )

2026-04-18 23:32:48 +04:00

research/v1

perf(api): split monolithic edge function into per-domain functions (#753 )

2026-03-02 14:13:34 +04:00

resilience/v1

feat(resilience): add service proto and stub handlers (#2657 )

2026-04-04 08:04:46 +04:00

sanctions/v1

feat(sanctions): add OFAC sanctions pressure intelligence (#1739 )

2026-03-17 11:52:32 +04:00

scenario/v1

fix(supply-chain): popup-keyed history re-query + dataAvailable flag (#3187 )

2026-04-18 23:38:33 +04:00

seismology/v1

perf(api): split monolithic edge function into per-domain functions (#753 )

2026-03-02 14:13:34 +04:00

skills

fix(intelligence): analytical frameworks follow-up — P1 security + P2 correctness fixes (#2386 )

2026-03-28 01:10:02 +04:00

slack/oauth

feat(notifications): gate all endpoints behind PRO entitlement (#2852 )

2026-04-09 09:18:19 +04:00

supply-chain

fix(country-brief): display bugs — slugs, self-imports, N/A, HS labels (#3032 )

2026-04-12 22:41:44 +04:00

thermal/v1

Add thermal escalation seeded service (#1747 )

2026-03-17 14:24:26 +04:00

trade/v1

perf(api): split monolithic edge function into per-domain functions (#753 )

2026-03-02 14:13:34 +04:00

unrest/v1

perf(api): split monolithic edge function into per-domain functions (#753 )

2026-03-02 14:13:34 +04:00

v2/shipping

fix(api): unblock Pro API clients at edge + accept x-api-key alias (#3155 )

2026-04-18 08:18:49 +04:00

webcam/v1

feat(webcams): add webcam map layer with Windy API integration (#1540 ) (#1540 )

2026-03-14 09:34:54 +04:00

wildfire/v1

perf(api): split monolithic edge function into per-domain functions (#753 )

2026-03-02 14:13:34 +04:00

youtube

fix: resolve YouTube 'sign in to confirm' bot-check in embed panels (#1284 )

2026-03-10 07:00:07 +04:00

_api-key.js

fix(api): unblock Pro API clients at edge + accept x-api-key alias (#3155 )

2026-04-18 08:18:49 +04:00

_cors.js

fix(api): unblock Pro API clients at edge + accept x-api-key alias (#3155 )

2026-04-18 08:18:49 +04:00

_cors.test.mjs

Fix Tauri desktop runtime reliability and settings UX

2026-02-13 23:05:51 +04:00

_crypto.js

feat(mcp): full OAuth 2.1 compliance — Authorization Code + PKCE + DCR (#2432 )

2026-03-28 18:40:53 +04:00

_email-validation.js

feat(email): add deliverability guards to reduce waitlist bounces (#2819 )

2026-04-08 11:21:40 +04:00

_github-release.js

refactor: dedupe github latest release fetch wiring (#1704 )

2026-03-16 08:59:33 +04:00

_ip-rate-limit.js

refactor(api): dedupe in-memory IP rate limiter (#1740 )

2026-03-17 08:32:49 +04:00

_json-response.js

Triage security alerts (#1903 )

2026-03-20 12:37:24 +04:00

_oauth-token.js

refactor: consolidate Upstash helpers and extract DeckGL color config (#2465 )

2026-03-29 10:38:43 +04:00

_product-fallback-prices.js

fix(catalog): update prices to match Dodo catalog via API (#2678 )

2026-04-04 15:05:00 +04:00

_rate-limit.js

feat(mcp): OAuth 2.0 Authorization Server for claude.ai connector (#2418 )

2026-03-28 14:53:32 +04:00

_relay.js

fix(relay): add authorization guard to api/_relay.js and dedupe military baseUrl call (#1992 )

2026-03-21 16:37:43 +04:00

_rss-allowed-domains.js

feat(feeds): add IRNA, Mehr, Jerusalem Post, Ynetnews to middleeast (#3236 )

2026-04-20 19:07:09 +04:00

_seed-envelope.js

feat(seed-contract): PR 1 foundation — envelope + contract + conformance test (#3095 )

2026-04-14 22:11:56 +04:00

_sentry-edge.js

fix(edge): fire-and-forget Sentry, 2s timeout, response check (#2559 )

2026-03-31 07:28:35 +04:00

_turnstile.js

refactor: share edge Turnstile helper (#1628 )

2026-03-15 09:21:22 +04:00

_turnstile.test.mjs

refactor: share edge Turnstile helper (#1628 )

2026-03-15 09:21:22 +04:00

_upstash-json.js

feat(brief): hosted magazine edge route + latest-brief preview RPC (Phase 2) (#3153 )

2026-04-18 07:28:49 +04:00

ais-snapshot.js

refactor(api): extract shared relay helper into _relay.js (#782 )

2026-03-02 19:28:31 +04:00

bootstrap.js

feat(seed-contract): PR 2a — runSeed envelope dual-write + 91 seeders migrated (#3097 )

2026-04-15 09:16:27 +04:00

cache-purge.js

refactor: consolidate Upstash helpers and extract DeckGL color config (#2465 )

2026-03-29 10:38:43 +04:00

chat-analyst.ts

fix(api): unblock Pro API clients at edge + accept x-api-key alias (#3155 )

2026-04-18 08:18:49 +04:00

contact.js

fix: update contact email to elie@worldmonitor.app (#2623 )

2026-04-02 20:13:33 +04:00

create-checkout.ts

[codex] guard duplicate subscription checkout (#3162 )

2026-04-18 15:19:34 +04:00

customer-portal.ts

[codex] guard duplicate subscription checkout (#3162 )

2026-04-18 15:19:34 +04:00

download.js

refactor: dedupe github latest release fetch wiring (#1704 )

2026-03-16 08:59:33 +04:00

fwdstart.js

refactor: dedupe edge api json response assembly (#1702 )

2026-03-16 11:52:56 +04:00

geo.js

refactor: dedupe edge api json response assembly (#1702 )

2026-03-16 11:52:56 +04:00

gpsjam.js

refactor: dedupe edge api json response assembly (#1702 )

2026-03-16 11:52:56 +04:00

health.js

feat(resilience): flag-gated pillar-combined score activation (default off) (#3267 )

2026-04-22 06:52:07 +04:00

invalidate-user-api-key-cache.ts

feat(auth): user-facing API key management (create / list / revoke) (#3125 )

2026-04-17 07:20:39 +04:00

latest-brief.ts

fix(brief): per-run slot URL so same-day digests link to distinct briefs (#3205 )

2026-04-19 14:15:59 +04:00

loaders-xml-wms-regression.test.mjs

Build/runtime hardening and dependency security updates (#286 )

2026-02-24 08:21:03 +00:00

mcp-proxy.js

fix(mcp): SSE transport support for /sse MCP servers (#1848 )

2026-03-19 03:42:48 +04:00

mcp.ts

fix(seeds): upstream API drift — SPDR XLSX + IMF IRFCL + IMF-External BX/BM drop (#3076 )

2026-04-14 08:19:47 +04:00

military-flights.js

refactor: dedupe edge api json response assembly (#1702 )

2026-03-16 11:52:56 +04:00

notification-channels.ts

feat(notifications): Phase 6 — web-push channel for PWA notifications (#3173 )

2026-04-18 20:27:08 +04:00

notify.ts

fix(security): strip importanceScore from /api/notify payload + scope fan-out by userId (#3143 )

2026-04-17 11:14:25 +04:00

og-story.js

fix(security): prevent Host header injection in story.js (#2102 )

2026-03-23 08:44:25 +04:00

og-story.test.mjs

fix(api): sanitize og-story level input (#219 )

2026-02-22 03:19:01 +04:00

opensky.js

refactor(api): extract shared relay helper into _relay.js (#782 )

2026-03-02 19:28:31 +04:00

oref-alerts.js

refactor: dedupe edge api json response assembly (#1702 )

2026-03-16 11:52:56 +04:00

polymarket.js

refactor(api): extract shared relay helper into _relay.js (#782 )

2026-03-02 19:28:31 +04:00

product-catalog.js

feat(seed-contract): PR 2a — runSeed envelope dual-write + 91 seeders migrated (#3097 )

2026-04-15 09:16:27 +04:00

fix(emails): update transactional email copy — 22 → 30+ services (#3203 )

2026-04-19 13:15:17 +04:00

reverse-geocode.js

refactor: consolidate Upstash helpers and extract DeckGL color config (#2465 )

2026-03-29 10:38:43 +04:00

rss-proxy.js

Triage security alerts (#1903 )

2026-03-20 12:37:24 +04:00

sanctions-entity-search.js

feat(sanctions): entity lookup index + OpenSanctions search (#2042 ) (#2085 )

2026-03-23 19:38:11 +04:00

satellites.js

refactor: dedupe edge api json response assembly (#1702 )

2026-03-16 11:52:56 +04:00

seed-contract-probe.ts

fix(seed-contract-probe): send Origin header so /api/bootstrap boundary check doesn't 401 (#3100 )

2026-04-15 15:34:38 +04:00

seed-health.js

feat(eia): gold-standard /api/eia/petroleum (Railway seed → Redis → Vercel reads only) (#3161 )

2026-04-18 14:40:00 +04:00

story.js

fix(security): prevent Host header injection in story.js (#2102 )

2026-03-23 08:44:25 +04:00

telegram-feed.js

fix(api): remove double URL-encoding in telegram-feed relay (#2108 )

2026-03-23 09:14:40 +04:00

user-prefs.ts

feat(prefs): Phase 2 — frontend preferences sync (#2507 )

2026-03-29 16:46:12 +04:00

version.js

refactor: dedupe edge api json response assembly (#1702 )

2026-03-16 11:52:56 +04:00

widget-agent.ts

fix(api): unblock Pro API clients at edge + accept x-api-key alias (#3155 )

2026-04-18 08:18:49 +04:00