mirror of
https://github.com/koala73/worldmonitor.git
synced 2026-04-25 17:14:57 +02:00
fix(grocery-basket): fix JP sites, add TRY/EGP/INR floors, evict bad routes (#2324)
The bilateral outlier gate (previous commit) catches bad prices AFTER the
fact. This commit prevents them from being accepted in the first place and
forces re-scraping on the very next seed run.
Japan sites (grocery-basket.json):
Before: kakaku.com, price.com — price comparison aggregators that show
per-gram/per-serving prices in LLM summaries, not per-kg shelf prices.
price.com is not even Japan-specific.
After: seiyu.co.jp, life.co.jp, aeon-net.com — actual Japanese supermarket
chains with clear per-unit JPY shelf prices.
CURRENCY_MIN additions (seed-grocery-basket.mjs):
TRY: 10 — Turkish shelf prices are always ≥ 10 TRY; the bad values
(2.75–5.61 TRY) were per-100g sub-unit matches.
EGP: 5 — Egyptian shelf prices are ≥ 5 EGP; 2.95/3.25 EGP were fractional.
INR: 12 — Indian shelf prices are ≥ 12 INR; 10 INR potatoes was stale.
One-time eviction (_migration:bad-prices-v1):
All JP routes — stale since sites changed; forces fresh searches on new domains.
TR: sugar, eggs, milk, oil — confirmed sub-unit price scrapes.
EG: salt, bread, milk — confirmed too-cheap scrapes.
IN: potatoes, milk — confirmed too-cheap scrapes.
Next seed run re-fetches all evicted items from EXA/Firecrawl from scratch.
This commit is contained in:
@@ -114,8 +114,9 @@
|
||||
"currency": "JPY",
|
||||
"flag": "🇯🇵",
|
||||
"sites": [
|
||||
"kakaku.com",
|
||||
"price.com"
|
||||
"seiyu.co.jp",
|
||||
"life.co.jp",
|
||||
"aeon-net.com"
|
||||
]
|
||||
},
|
||||
{
|
||||
|
||||
Reference in New Issue
Block a user