Files
worldmonitor/scripts/data
Elie Habib ce797da3a4 chore(energy-atlas): backfill productClass on oil pipelines + enforce enum (#3383)
* chore(energy-atlas): backfill productClass on all oil pipelines + enforce enum

Prior state: 12/75 oil pipelines carried a `productClass: "crude"` tag;
63/75 did not. The field had zero consumers anywhere in the codebase
(no validator, no server handler, no frontend reader) — orphan metadata
from partial curation. Inconsistency spotted during the energy-data
audit after the Energy Atlas PR chain landed.

Changes:

1. Backfill all 63 missing entries with one of three values based on
   the pipeline's name/operator/route:
   - `crude` (70 total): crude-oil trunks, gathering lines, export
     systems. Covers Druzhba, Enbridge Mainline, Keystone-XL, CPC,
     BTC, ESPO, Sumed, Forties, Brent, OCP, OCENSA, EACOP, LAPSSET,
     etc.
   - `products` (4 total): explicit refined-product pipelines —
     Abqaiq-Yanbu Products Line, Vadinar-Kandla, Yangzi-Hefei-Hangzhou,
     Tuxpan-Mexico City.
   - `mixed` (1 total): Salina Cruz-Minatitlán, the only dual-use
     crude/products bridge in the set.

2. Promote productClass from orphan metadata to a schema invariant:
   - Oil pipelines MUST declare one of {crude, products, mixed}.
   - Gas pipelines MUST NOT carry the field (commodity IS its own
     class there).
   - Enforced in scripts/_pipeline-registry.mjs::validateRegistry.

3. Five new test assertions in tests/pipelines-registry.test.mts
   cover both the data invariant (every oil entry has a valid value;
   no gas entry has one) and the validator behavior (rejects missing,
   rejects unknown enum value, rejects gas-with-productClass).

File formatting: the oil registry mixes two styles — multi-line (each
field on its own line) and compact (several fields packed onto one
line). The insertion preserves the local style for each entry by
reusing the whitespace that follows `"commodityType": "oil",`.

No runtime consumers yet; this lands the data hygiene so future
downstream work (crude-vs-products split on the map, refined-product
shock calcs) can rely on the field being present and valid.

* fix(energy-atlas): import VALID_OIL_PRODUCT_CLASSES in tests instead of redefining

Greptile P2 on #3383: the test file defined its own inline
`const VALID = new Set(['crude', 'products', 'mixed'])`, mirroring the
registry's `VALID_OIL_PRODUCT_CLASSES`. If a future PR adds a new class
(e.g. `condensate`) to the registry, the inline copy wouldn't update —
the data test would start reporting valid pipelines as failing before
the validator rejects them, creating a confusing diagnostic gap.

Export `VALID_OIL_PRODUCT_CLASSES` from `scripts/_pipeline-registry.mjs`
and import it in the test. Single source of truth; no drift possible.
2026-04-24 23:36:51 +04:00
..