This task is to document the current infrastructure and assets of WMDE Analytics. The task text will be updated as the discussion below progresses.
## Tables
- presto_analytics_hive/goransm/wdcm_clients_wb_entity_usage
## Servers
The following are found on Cloud VPS:
- wikidata-analytics-1
- wiktionary-cognate-1
## Code repos
- WikidataAnalytics (includes Wikidata Concepts Monitor)
- [[ https://github.com/wikimedia/analytics-wmde-WD-WikidataAnalytics | GitHub ]]
- WiktionaryCognateDashboard
- [[ https://github.com/wikimedia/analytics-wmde-WiktionaryCognateDashboard | GitHub ]]
## CRON jobs
- `WDCM_Sqoop_Clients runs` on `stat1004` weekly - It doesn't run spark (but Sqoop)
- `2021_WMDE_Mitmachen_Bereich_2021_Campaign` runs on `stat1007` daily - It doesn't run spark (but Hive)
- `WD_PageviewsPerType` runs on `stat1007` daily but has been failing since February 17th - It runs a spark job
- `WD_UsageCoverage` runs on `stat1008` daily - It runs a spark job
- `WD_languagesLandscape` runs on `stat1008` monthly (30th of the month) - It runs a spark job
- `Wiktionary_CognateDashboard` runs on stat1008 daily - It doesn't run spark
- `WDCM_EngineBiases` runs on `stat1008` weekly - It runs a spark job
- `Qurator_CuriousFacts` runs on `stat1008` monthly (10th of the month) - It runs a spark job
- `WMDE_BannerImpressions` runs on `stat1008` hourly - It doesn't runspark (but Hive)
- `NewEditors_comprehensive_report` runs on `stat1008` daily - It runs a spark job