mirror of
https://github.com/koala73/worldmonitor.git
synced 2026-04-25 17:14:57 +02:00
- Remove PostHog analytics runtime and configuration - Add API rate limiting (api/_rate-limit.js) - Harden traffic controls across edge functions - Add runtime fallback controls and data-loader improvements - Add military base data scripts (fetch-mirta-bases, fetch-osm-bases) - Gitignore large raw data files - Settings playground prototypes
9.6 KiB
9.6 KiB
| 1 | # | Variant | Category | Feed Name | Status | Newest Date | Error | URL | Replacement | Notes |
|---|---|---|---|---|---|---|---|---|---|---|
| 2 | 32 | full | europe | Corriere della Sera | DEAD | HTTP 404 | https://xml2.corriereobjects.it/rss/incipit.xml | https://www.corriere.it/rss/homepage.xml | Direct RSS 69 items verified | |
| 3 | 36 | full | europe | De Telegraaf | DEAD | HTTP 403 | https://www.telegraaf.nl/rss | https://news.google.com/rss/search?q=site:telegraaf.nl+when:1d&hl=nl&gl=NL&ceid=NL:nl | Direct feed blocks crawlers; Google News 100 items | |
| 4 | 38 | full | europe | Dagens Nyheter | DEAD | HTTP 404 | https://www.dn.se/rss/senaste-nytt/ | https://www.dn.se/rss/ | Drop /senaste-nytt/ suffix; 116 items verified | |
| 5 | 65 | full | middleeast | L'Orient-Le Jour | DEAD | HTTP 403 | https://www.lorientlejour.com/rss | https://news.google.com/rss/search?q=site:lorientlejour.com+when:1d&hl=fr&gl=LB&ceid=LB:fr | Direct 403; Google News 74 items French | |
| 6 | 132 | full | latam | O Globo | DEAD | HTTP 404 | https://oglobo.globo.com/rss/top_noticias/ | https://news.google.com/rss/search?q=site:oglobo.globo.com+when:1d&hl=pt-BR&gl=BR&ceid=BR:pt-419 | Direct RSS empty shell; Google News 100 items | |
| 7 | 133 | full | latam | Folha de S.Paulo | DEAD | fetch failed | https://feeds.folha.uol.com.br/emcimadahora/rss091.xml | KEEP | Transient failure; 100 items on retest | |
| 8 | 136 | full | latam | El Universal | DEAD | HTTP 404 | https://www.eluniversal.com.mx/rss.xml | https://news.google.com/rss/search?q=site:eluniversal.com.mx+when:1d&hl=es-419&gl=MX&ceid=MX:es-419 | All direct paths 404; Google News 100 items | |
| 9 | 139 | full | latam | Animal Político | DEAD | HTTP 404 | https://animalpolitico.com/feed/ | https://news.google.com/rss/search?q=site:animalpolitico.com+when:1d&hl=es-419&gl=MX&ceid=MX:es-419 | Direct 404; Google News 98 items | |
| 10 | 140 | full | latam | Proceso | DEAD | HTTP 404 | https://www.proceso.com.mx/feed/ | https://news.google.com/rss/search?q=site:proceso.com.mx+when:1d&hl=es-419&gl=MX&ceid=MX:es-419 | Direct 404; Google News 100 items | |
| 11 | 141 | full | latam | Milenio | DEAD | HTTP 404 | https://www.milenio.com/rss | https://news.google.com/rss/search?q=site:milenio.com+when:1d&hl=es-419&gl=MX&ceid=MX:es-419 | All direct paths 404; Google News 100 items | |
| 12 | 161 | full | asia | Bangkok Post | DEAD | HTTP 451 | https://www.bangkokpost.com/rss | https://news.google.com/rss/search?q=site:bangkokpost.com+when:1d&hl=en-US&gl=US&ceid=US:en | Geo-blocked 451; Google News 42 items | |
| 13 | 377 | happy | science | ScienceDaily | DEAD | Timeout | https://www.sciencedaily.com/rss/top.xml | https://www.sciencedaily.com/rss/all.xml | top.xml empty; all.xml has 40 items verified | |
| 14 | 388 | intel | inspiring | Breaking Defense | DEAD | HTTP 403 | https://breakingdefense.com/feed/ | KEEP | Works with proper User-Agent; 15 items verified | |
| 15 | 402 | intel | inspiring | RAND | DEAD | HTTP 404 | https://www.rand.org/rss/all.xml | https://news.google.com/rss/search?q=site:rand.org+when:7d&hl=en-US&gl=US&ceid=US:en | Direct 403; Google News 50 items | |
| 16 | 406 | intel | inspiring | NTI | DEAD | HTTP 403 | https://www.nti.org/rss/ | https://news.google.com/rss/search?q=site:nti.org+when:30d&hl=en-US&gl=US&ceid=US:en | Direct feed empty; Google News 30d window 27 items | |
| 17 | 415 | intel | inspiring | Bellingcat | DEAD | fetch failed | https://www.bellingcat.com/feed/ | https://news.google.com/rss/search?q=site:bellingcat.com+when:30d&hl=en-US&gl=US&ceid=US:en | SSL handshake fails; Google News 30d 19 items (low pub freq) | |
| 18 | 23 | full | europe | DW News [es] | EMPTY | No dates found | https://rss.dw.com/xml/rss-es-all | https://news.google.com/rss/search?q=site:dw.com/es&hl=es-419&gl=MX&ceid=MX:es-419 | DW deprecated es RSS endpoint; Google News 100 items | |
| 19 | 28 | full | europe | Bild | EMPTY | No dates found | https://www.bild.de/feed/alles.xml | KEEP (parser fix) | Feed works; dates use CET/CEST timezone abbreviation not RFC 2822 | |
| 20 | 110 | full | crisis | CrisisWatch | EMPTY | No dates found | https://www.crisisgroup.org/rss | KEEP (parser fix) | Feed works; Drupal date format: "Wednesday, February 25, 2026 - 21:07" | |
| 21 | 111 | full | crisis | IAEA | EMPTY | No dates found | https://www.iaea.org/feeds/topnews | KEEP (parser fix) | Feed works; 2-digit year: "Thu, 26 Feb 26" needs expansion to 2026 | |
| 22 | 116 | full | africa | News24 | EMPTY | No dates found | https://feeds.capi24.com/v1/Search/articles/news24/Africa/rss | https://feeds.news24.com/articles/news24/TopStories/rss | Old CAPI feed empty; new URL 20 items verified | |
| 23 | 157 | full | asia | India News Network | EMPTY | No dates found | https://www.indianewsnetwork.com/rss.en.diplomacy.xml | https://news.google.com/rss/search?q=India+diplomacy+foreign+policy+news&hl=en&gl=US&ceid=US:en | Original feed has zero date fields in any item | |
| 24 | 162 | full | asia | Thai PBS | EMPTY | No dates found | https://news.google.com/rss/search?q=site:thaipbsworld.com+when:2d&hl=th&gl=TH&ceid=TH:th | https://news.google.com/rss/search?q=Thai+PBS+World+news&hl=en&gl=US&ceid=US:en | Site moved to world.thaipbs.or.th no RSS; sparse results consider REMOVE | |
| 25 | 163 | full | asia | VnExpress | EMPTY | No dates found | https://vnexpress.net/rss | https://vnexpress.net/rss/tin-moi-nhat.rss | Bare /rss is HTML index; correct endpoint is /rss/tin-moi-nhat.rss 55 items | |
| 26 | 164 | full | asia | Tuoi Tre News | EMPTY | No dates found | https://news.google.com/rss/search?q=site:tuoitrenews.vn+when:2d&hl=vi&gl=VN&ceid=VN:vi | https://tuoitrenews.vn/rss | Direct RSS works 50 items; Google News was stale | |
| 27 | 231 | tech | regionalStartups | Disrupt Africa | EMPTY | No dates found | https://news.google.com/rss/search?q=site:disrupt-africa.com+when:7d&hl=en-US&gl=US&ceid=US:en | REMOVE | Last post Jan 2024; site inactive; no Google News results | |
| 28 | 237 | tech | github | GitHub Trending | EMPTY | No dates found | https://mshibanami.github.io/GitHubTrendingRSS/daily/all.xml | KEEP (parser fix) | Feed works with current items; parser may not handle its date format | |
| 29 | 268 | tech | thinktanks | MIT Tech Policy | EMPTY | No dates found | https://news.google.com/rss/search?q=site:techpolicypress.org+when:14d&hl=en-US&gl=US&ceid=US:en | https://news.google.com/rss/search?q=%22Tech+Policy+Press%22&hl=en&gl=US&ceid=US:en | Domain DNS fails; search by name returns 100 items | |
| 30 | 270 | tech | thinktanks | AI Now Institute | EMPTY | No dates found | https://news.google.com/rss/search?q=site:ainowinstitute.org+when:14d&hl=en-US&gl=US&ceid=US:en | https://news.google.com/rss/search?q=%22AI+Now+Institute%22&hl=en&gl=US&ceid=US:en | SSL issue on direct; Google News 59 items (infrequent publisher) | |
| 31 | 279 | tech | thinktanks | DigiChina | EMPTY | No dates found | https://news.google.com/rss/search?q=site:digichina.stanford.edu+when:14d&hl=en-US&gl=US&ceid=US:en | https://news.google.com/rss/search?q=DigiChina+Stanford+China+technology&hl=en&gl=US&ceid=US:en | WordPress RSS empty; Google News 20 items to Jul 2025 | |
| 32 | 306 | tech | podcasts | 20VC Episodes | EMPTY | No dates found | https://news.google.com/rss/search?q="20+Minute+VC"+Harry+Stebbings+when:14d&hl=en-US&gl=US&ceid=US:en | https://rss.libsyn.com/shows/61840/destinations/240976.xml | Official podcast RSS via Apple; 1423 episodes current | |
| 33 | 310 | tech | podcasts | Pivot Podcast | EMPTY | No dates found | https://news.google.com/rss/search?q="Pivot+podcast"+(Kara+Swisher+OR+Scott+Galloway)+when:14d&hl=en-US&gl=US&ceid=US:en | https://feeds.megaphone.fm/pivot | Megaphone RSS; 750 episodes current | |
| 34 | 315 | tech | podcasts | Startup Podcasts | EMPTY | No dates found | https://news.google.com/rss/search?q=("Masters+of+Scale"+OR+"The+Pitch+podcast"+OR+"startup+podcast")+episode+when:14d&hl=en-US&gl=US&ceid=US:en | https://rss.art19.com/masters-of-scale | Masters of Scale RSS 670 eps; "The Pitch" feeds 404 — drop it | |
| 35 | 379 | happy | science | Live Science | EMPTY | No dates found | https://www.livescience.com/feeds/all | https://www.livescience.com/feeds.xml | /feeds/all redirects to /feeds.xml; 20+ items current | |
| 36 | 383 | happy | science | Greater Good (Berkeley) | EMPTY | No dates found | https://greatergood.berkeley.edu/rss | https://greatergood.berkeley.edu/site/rss/articles | /rss is 404; correct path /site/rss/articles 50 items; uses dc:date | |
| 37 | 398 | intel | inspiring | CSIS | EMPTY | No dates found | https://www.csis.org/analysis?type=analysis | https://news.google.com/rss/search?q=site:csis.org&hl=en&gl=US&ceid=US:en | Not an RSS URL (HTML); all RSS paths 403; Google News 100 items | |
| 38 | 403 | intel | inspiring | Brookings | EMPTY | No dates found | https://www.brookings.edu/feed/ | https://news.google.com/rss/search?q=site:brookings.edu&hl=en&gl=US&ceid=US:en | WordPress feed bot-blocked; Google News 100 items | |
| 39 | 404 | intel | inspiring | Carnegie | EMPTY | No dates found | https://carnegieendowment.org/rss/ | https://news.google.com/rss/search?q=site:carnegieendowment.org&hl=en&gl=US&ceid=US:en | Next.js site returns HTML for RSS paths; Google News 100 items | |
| 40 | 5 | full | politics | CNN World | STALE | 18/09/2023 | Stale | http://rss.cnn.com/rss/cnn_world.rss | https://news.google.com/rss/search?q=site:cnn.com+world+news+when:1d&hl=en-US&gl=US&ceid=US:en | rss.cnn.com SSL failures; use Google News proxy for CNN world |
| 41 | 43 | full | europe | TVN24 | STALE | 01/04/2025 | Stale | https://tvn24.pl/najwazniejsze.xml | https://tvn24.pl/swiat.xml | najwazniejsze.xml stale; swiat.xml (world) 30 items current |
| 42 | 73 | full | ai | VentureBeat AI | STALE | 22/01/2026 | Stale | https://venturebeat.com/category/ai/feed/ | KEEP | Borderline stale; 308 redirect issue; sparse 7-item feed by design |
| 43 | 84 | full | gov | Pentagon | STALE | 23/01/2026 | Stale | https://news.google.com/rss/search?q=site:defense.gov+OR+Pentagon&hl=en-US&gl=US&ceid=US:en | KEEP | Borderline; defense.gov low recent output; Google News proxy working |
| 44 | 94 | full | layoffs | Layoffs.fyi | STALE | 29/12/2020 | Stale | https://layoffs.fyi/feed/ | https://news.google.com/rss/search?q=tech+company+layoffs+announced&hl=en&gl=US&ceid=US:en | Feed abandoned Dec 2020; Google News 100 items |
| 45 | 396 | intel | inspiring | Oryx OSINT | STALE | 07/12/2024 | Stale | https://www.oryxspioenkop.com/feeds/posts/default?alt=rss | KEEP | Publishes infrequently by design (detailed equipment loss lists) |
| 46 | 405 | intel | inspiring | FAS | STALE | 14/02/2023 | Stale | https://fas.org/feed/ | https://news.google.com/rss/search?q=site:fas.org+nuclear+weapons+security&hl=en&gl=US&ceid=US:en | RSS broken (1 item from 2023); Google News proxy available |