Page MenuHomePhabricator
Feed Advanced Search

Jan 9 2023

EChetty moved T323614: [M] Reduce image_suggestion HDFS files footprint from Structured Data (Tracking) to Sprint 05-06 on the Data Pipelines board.
Jan 9 2023, 4:18 PM · Data Pipelines, Structured-Data-Backlog (Current Work), Image-Suggestions
EChetty moved T309769: Expanding External Referrer Tracking from Ready to Deploy to In Review on the Data Pipelines (Sprint 05-06) board.
Jan 9 2023, 4:06 PM · Data Pipelines (Sprint 08), Metrics Platform Backlog, Foundational Technology Requests
EChetty moved T309769: Expanding External Referrer Tracking from In Review to Ready to Deploy on the Data Pipelines (Sprint 05-06) board.
Jan 9 2023, 4:06 PM · Data Pipelines (Sprint 08), Metrics Platform Backlog, Foundational Technology Requests
EChetty moved T301403: Investigate wikimedia and wikidata unique devices per-project-family overcount offset from Incident/Unexpected work to Done on the Data Pipelines (Sprint 05-06) board.
Jan 9 2023, 4:05 PM · Data Pipelines (Sprint 05-06), Data-Engineering-Planning, Product-Analytics
EChetty moved T324486: [Migration] migrate simple oozie jobs from To be prioritised to To be discussed /To be estimated on the Data Pipelines board.
Jan 9 2023, 3:33 PM · Data-Engineering, Data Pipelines
EChetty moved T324485: [Airflow] Migrate Druid loading Oozie jobs - Parent task from To be prioritised to To be discussed /To be estimated on the Data Pipelines board.
Jan 9 2023, 3:33 PM · Data Pipelines (Sprint 14)
EChetty moved T326339: Use uap-core browser-family for bot detection from Backlog to To be discussed /To be estimated on the Data Pipelines board.
Jan 9 2023, 3:33 PM · Data-Engineering, Data Pipelines
EChetty moved T326193: Airflow upgrade (refactor deb creation + version bump + switch to PostgreSQL) from Backlog to To be discussed /To be estimated on the Data Pipelines board.
Jan 9 2023, 3:32 PM · Data Pipelines
EChetty assigned T324660: Install Ceph Cluster for Data Engineering to BTullis.
Jan 9 2023, 11:34 AM · Data-Platform-SRE, Epic
EChetty moved T324660: Install Ceph Cluster for Data Engineering from Backlog to Epics on the Shared-Data-Infrastructure board.
Jan 9 2023, 11:33 AM · Data-Platform-SRE, Epic

Jan 6 2023

EChetty moved T320860: Fix mediawiki-history page computation for deleted pages having the same title from Backlog to To be discussed /To be estimated on the Data Pipelines board.
Jan 6 2023, 3:22 PM · Data-Engineering, Data Pipelines
EChetty moved T323905: [M] Automate Airflow DAG release from Backlog to To be discussed /To be estimated on the Data Pipelines board.
Jan 6 2023, 3:22 PM · Structured-Data-Backlog (Current Work), Data Pipelines
EChetty moved T325103: Prune raw HDFS FSImages stored on HDFS from Backlog to To be discussed /To be estimated on the Data Pipelines board.
Jan 6 2023, 3:22 PM · Data-Engineering, Data Pipelines
EChetty moved T325213: Increase mypy coverage in airflow-dags from Backlog to To be discussed /To be estimated on the Data Pipelines board.
Jan 6 2023, 12:59 PM · Data-Engineering, Data Pipelines
EChetty moved T325204: Add missing tests in airflow-dags from Backlog to To be discussed /To be estimated on the Data Pipelines board.
Jan 6 2023, 12:59 PM · Data Pipelines
EChetty moved T58628: Non-mobile UAs on mobile (2g/gprs, etc) IP-blocks from Backlog to To be discussed /To be estimated on the Data Pipelines board.
Jan 6 2023, 12:58 PM · Data-Engineering, Data-Engineering-Wikistats
EChetty moved T311229: Drop MediaViewer and MultimediaViewer* tables from Backlog to To be discussed /To be estimated on the Data Pipelines board.
Jan 6 2023, 12:58 PM · Data-Engineering, Data Pipelines
EChetty moved T323662: NEW FEATURE REQUEST: Dataset with active and non-active Wikis from Backlog to To be discussed /To be estimated on the Data Pipelines board.
Jan 6 2023, 12:58 PM · Data-Engineering, Data Pipelines
EChetty moved T324757: When moving oozie webrequest-load to airflow/spark avoid the error-check corner case from Backlog to To be discussed /To be estimated on the Data Pipelines board.
Jan 6 2023, 12:57 PM · Data-Engineering, Data Pipelines
EChetty moved T325181: Present "Notebooks in Airflow" solution to PA and discuss ownership of different steps from Backlog to To be discussed /To be estimated on the Data Pipelines board.
Jan 6 2023, 12:57 PM · Data-Engineering, Product-Analytics
EChetty moved T325185: [Airflow] Implement a NotebookOperator from Backlog to To be discussed /To be estimated on the Data Pipelines board.
Jan 6 2023, 12:57 PM · Data-Engineering, Data Pipelines
EChetty moved T325195: Set up a repository to generate packaged conda environments via CI for Jupyter notebooks from Backlog to To be discussed /To be estimated on the Data Pipelines board.
Jan 6 2023, 12:57 PM · Data-Engineering, Data Pipelines
EChetty moved T325611: Add TikTok's in-app browser to ua-parser library from Backlog to To be discussed /To be estimated on the Data Pipelines board.
Jan 6 2023, 12:56 PM · Data-Engineering, Data Pipelines, Product-Analytics
EChetty moved T325306: Provide aggregated user device data per-country from Backlog to To be discussed /To be estimated on the Data Pipelines board.
Jan 6 2023, 12:56 PM · Data-Engineering
EChetty moved T323562: Number of anonymous edits from Incoming (new tickets) to Event Platform Backlog on the Data-Engineering board.
Jan 6 2023, 12:56 PM · Data-Engineering, Data-Engineering-Wikistats
EChetty moved T324993: User stats page from Incoming (new tickets) to Event Platform Backlog on the Data-Engineering board.
Jan 6 2023, 12:56 PM · Data-Engineering, Data-Engineering-Wikistats
EChetty moved T324053: Remove Matplotlib as a Wmfdata-Python dependency from Incoming (new tickets) to WMF-Data on the Data-Engineering board.
Jan 6 2023, 12:56 PM · Data-Engineering, Product-Analytics, Wmfdata-Python
EChetty moved T324135: Wmfdata-Python triggers a Pandas warning during mariadb.run and hive.run from Incoming (new tickets) to WMF-Data on the Data-Engineering board.
Jan 6 2023, 12:55 PM · Data-Engineering, Product-Analytics, Wmfdata-Python
EChetty edited projects for T323562: Number of anonymous edits, added: Data-Engineering; removed Data-Engineering-Planning.
Jan 6 2023, 12:55 PM · Data-Engineering, Data-Engineering-Wikistats
EChetty edited projects for T324993: User stats page, added: Data-Engineering; removed Data-Engineering-Planning.
Jan 6 2023, 12:55 PM · Data-Engineering, Data-Engineering-Wikistats
EChetty edited projects for T324053: Remove Matplotlib as a Wmfdata-Python dependency, added: Data-Engineering; removed Data-Engineering-Planning.
Jan 6 2023, 12:54 PM · Data-Engineering, Product-Analytics, Wmfdata-Python
EChetty edited projects for T324135: Wmfdata-Python triggers a Pandas warning during mariadb.run and hive.run, added: Data-Engineering; removed Data-Engineering-Planning.
Jan 6 2023, 12:54 PM · Data-Engineering, Product-Analytics, Wmfdata-Python
EChetty moved T324681: Add country_meta_data from Incoming (new tickets) to DaaS Work on the Data-Engineering board.
Jan 6 2023, 12:53 PM · Movement-Insights, Data-Engineering, Equity-Landscape
EChetty moved T324968: Access input metrics from Incoming (new tickets) to DaaS Work on the Data-Engineering board.
Jan 6 2023, 12:53 PM · Equity-Landscape
EChetty moved T326330: Update sqoop for CheckUser table from Backlog to To be discussed /To be estimated on the Data Pipelines board.
Jan 6 2023, 12:52 PM · Data Pipelines (Sprint 07), Data-Engineering-Planning, Patch-For-Review
EChetty moved T321557: EventBus' stream config destination_event_service setting should move into producers.mediawikI_eventbus specific settings. from Backlog to Event Platform on the Data-Engineering-Planning board.
Jan 6 2023, 12:51 PM · Data-Engineering, Event-Platform, MW-1.40-notes (1.40.0-wmf.8; 2022-10-31)
EChetty moved T325611: Add TikTok's in-app browser to ua-parser library from Backlog to Pipelines on the Data-Engineering-Planning board.
Jan 6 2023, 12:51 PM · Data-Engineering, Data Pipelines, Product-Analytics
EChetty moved T324689: [EPIC] Streaming and event driven Python services from Backlog to Event Platform on the Data-Engineering-Planning board.
Jan 6 2023, 12:51 PM · Data-Engineering, Data Engineering and Event Platform Team, Event-Platform
EChetty edited projects for T324681: Add country_meta_data, added: Data-Engineering; removed Data-Engineering-Planning.
Jan 6 2023, 12:51 PM · Movement-Insights, Data-Engineering, Equity-Landscape
EChetty edited projects for T324968: Access input metrics, added: Data-Engineering; removed Data-Engineering-Planning.
Jan 6 2023, 12:50 PM · Equity-Landscape
EChetty moved T324994: Incident: 2022-12-09 api appserver worker starvation from Backlog to Event Platform on the Data-Engineering-Planning board.
Jan 6 2023, 12:50 PM · Data-Engineering, Event-Platform, SRE-OnFire, serviceops
EChetty moved T325068: Uneven CPU throttling of eventgate-analytics under load from Backlog to Event Platform on the Data-Engineering-Planning board.
Jan 6 2023, 12:49 PM · Data-Platform-SRE, Data-Engineering, SRE-Sprint-Week-Sustainability-March2023, Event-Platform, SRE-OnFire, Sustainability (Incident Followup), serviceops
EChetty moved T325266: Replace refinery-source Guava caches by Caffeine from Backlog to Event Platform on the Data-Engineering-Planning board.
Jan 6 2023, 12:48 PM · Event-Platform, Data-Engineering-Planning
EChetty moved T326302: Misconfigured proxies on data-engineering hosts from Backlog to Ops Week on the Data-Engineering-Planning board.
Jan 6 2023, 12:47 PM · Data-Platform-SRE, Data-Engineering
EChetty moved T324796: Document how to show your work in phabricator and/or elsewhere from Backlog to Radar on the Data-Engineering-Planning board.
Jan 6 2023, 12:46 PM · Data-Engineering-Planning
EChetty moved T324660: Install Ceph Cluster for Data Engineering from Backlog to Shared Data Infra on the Data-Engineering-Planning board.
Jan 6 2023, 12:46 PM · Data-Platform-SRE, Epic
EChetty moved T324578: [EPIC] Flink Applications on Kubernetes from Backlog to Event Platform on the Data-Engineering-Planning board.
Jan 6 2023, 12:45 PM · Data-Engineering, Data Engineering and Event Platform Team, Event-Platform
EChetty moved T325544: Update refinery-source PageviewDefinition to better handle `Special:` pages from Backlog to Pipelines on the Data-Engineering-Planning board.
Jan 6 2023, 12:45 PM · Data-Engineering, Data Pipelines
EChetty moved T325256: Document known data quality issues on Wikistats from Backlog to Pipelines on the Data-Engineering-Planning board.
Jan 6 2023, 12:45 PM · Data Products, Epic, Data Pipelines
EChetty moved T324108: [SPIKE] Use Flink for batch backfilling from Backlog to Event Platform on the Data-Engineering-Planning board.
Jan 6 2023, 12:45 PM · Data-Engineering, Event-Platform
EChetty moved T322602: "Invalid revision ID -1" error for VisualEditorFeatureUse events, mostly from officewiki from Backlog to Pipelines on the Data-Engineering-Planning board.
Jan 6 2023, 12:44 PM · MW-1.41-notes (1.41.0-wmf.4; 2023-04-10), Editing-team (Kanban Board), Data Pipelines, Data-Engineering-Planning, Wikimedia-production-error, WMF-General-or-Unknown
EChetty moved T323562: Number of anonymous edits from Backlog to Radar on the Data-Engineering-Planning board.
Jan 6 2023, 12:44 PM · Data-Engineering, Data-Engineering-Wikistats
EChetty moved T323692: Create puppet defined type for adding/updating/deleting secrets or other small files on HDFS from Backlog to Pipelines on the Data-Engineering-Planning board.
Jan 6 2023, 12:15 PM · Data-Engineering, Data Pipelines, Cassandra
EChetty moved T276088: Configuration Management for Kafka settings from Backlog to Event Platform on the Data-Engineering-Planning board.
Jan 6 2023, 12:15 PM · Data-Platform-SRE, Data-Engineering, serviceops-radar, Event-Platform, Analytics-Radar, SRE
EChetty moved T325306: Provide aggregated user device data per-country from Backlog to Pipelines on the Data-Engineering-Planning board.
Jan 6 2023, 12:15 PM · Data-Engineering
EChetty moved T325341: Use a fake timer in EventBus unit test for PageChangeEventSerializerTest::testCreatePageChangeVisibilityEvent from Backlog to Event Platform on the Data-Engineering-Planning board.
Jan 6 2023, 12:14 PM · Data-Engineering-Planning, ci-test-error, Event-Platform
EChetty moved T325359: Flink Restart Strategy for Enrichment Service from Backlog to Event Platform on the Data-Engineering-Planning board.
Jan 6 2023, 12:14 PM · Data-Engineering-Planning, Event-Platform
EChetty moved T326002: [Event Platform] eventgate-wikimedia occasionally fails to produce events due to stream config fetch errors from Backlog to Pipelines on the Data-Engineering-Planning board.
Jan 6 2023, 12:14 PM · Data-Engineering (Sprint 6), MW-1.41-notes (1.41.0-wmf.28; 2023-09-26), Event-Platform, Data Pipelines
EChetty moved T326229: Provide a mechanism to notify subscribers when page view data is available from Backlog to Radar on the Data-Engineering-Planning board.
Jan 6 2023, 12:14 PM · Data-Engineering, Pageviews-API
EChetty moved T326330: Update sqoop for CheckUser table from Backlog to Pipelines on the Data-Engineering-Planning board.
Jan 6 2023, 12:13 PM · Data Pipelines (Sprint 07), Data-Engineering-Planning, Patch-For-Review
EChetty moved T324757: When moving oozie webrequest-load to airflow/spark avoid the error-check corner case from Backlog to Pipelines on the Data-Engineering-Planning board.
Jan 6 2023, 11:24 AM · Data-Engineering, Data Pipelines
EChetty moved T324953: [NEEDS GROOMING] Integrate Flink Table API in eventutils-python from Backlog to Event Platform on the Data-Engineering-Planning board.
Jan 6 2023, 11:23 AM · Data Engineering and Event Platform Team, Data-Engineering, Event-Platform
EChetty moved T324980: Event Driven Enrichment Pipelines repositories should be generated from a template from Backlog to Event Platform on the Data-Engineering-Planning board.
Jan 6 2023, 11:23 AM · Event-Platform (Sprint 12), Data-Engineering-Planning
EChetty moved T325273: EventRowTypeInfo should support schema evolution of rows seriliazed in flink application state from Backlog to Event Platform on the Data-Engineering-Planning board.
Jan 6 2023, 11:23 AM · Data-Engineering-Planning, Discovery-Search (Current work), CirrusSearch, Event-Platform
EChetty moved T325303: Deploy mediawiki-page-content-change-enrichment to wikikube k8s from Backlog to Event Platform on the Data-Engineering-Planning board.
Jan 6 2023, 11:23 AM · Event-Platform (Sprint 14 B), Patch-For-Review, Epic, Data-Engineering-Planning
EChetty moved T325304: Deploy to YARN from Backlog to Event Platform on the Data-Engineering-Planning board.
Jan 6 2023, 11:23 AM · Event-Platform (Sprint 08), Data-Engineering-Planning
EChetty moved T325305: Deploy mediawiki-event-enrichment flink app to DSE k8s from Backlog to Event Platform on the Data-Engineering-Planning board.
Jan 6 2023, 11:23 AM · Event-Platform (Sprint 08), Data-Engineering-Planning
EChetty moved T325307: Deploy to production k8s from Backlog to Event Platform on the Data-Engineering-Planning board.
Jan 6 2023, 11:23 AM · Data-Engineering-Planning, Event-Platform
EChetty moved T325527: Check home/HDFS leftovers of ryanmax from Backlog to Ops Week on the Data-Engineering-Planning board.
Jan 6 2023, 11:20 AM · Data-Platform-SRE (2023.12.01 - 2023.12.31)
EChetty moved T326157: Check home/HDFS leftovers of akhatun from Backlog to Ops Week on the Data-Engineering-Planning board.
Jan 6 2023, 11:19 AM · Data-Engineering
EChetty edited projects for T324576: Flink on Kubernetes Helm charts, added: Data-Engineering-Planning; removed Data-Engineering.
Jan 6 2023, 11:01 AM · Data-Engineering, Event-Platform (Sprint 07), Discovery-Search (Current work), Patch-For-Review, serviceops
EChetty edited projects for T325068: Uneven CPU throttling of eventgate-analytics under load, added: Data-Engineering-Planning; removed Data-Engineering.
Jan 6 2023, 11:01 AM · Data-Platform-SRE, Data-Engineering, SRE-Sprint-Week-Sustainability-March2023, Event-Platform, SRE-OnFire, Sustainability (Incident Followup), serviceops
EChetty edited projects for T324578: [EPIC] Flink Applications on Kubernetes, added: Data-Engineering-Planning; removed Data-Engineering.
Jan 6 2023, 11:01 AM · Data-Engineering, Data Engineering and Event Platform Team, Event-Platform
EChetty edited projects for T324796: Document how to show your work in phabricator and/or elsewhere, added: Data-Engineering-Planning; removed Data-Engineering.
Jan 6 2023, 11:01 AM · Data-Engineering-Planning
EChetty edited projects for T324660: Install Ceph Cluster for Data Engineering, added: Data-Engineering-Planning; removed Data-Engineering.
Jan 6 2023, 11:01 AM · Data-Platform-SRE, Epic
EChetty edited projects for T324689: [EPIC] Streaming and event driven Python services, added: Data-Engineering-Planning; removed Data-Engineering.
Jan 6 2023, 11:01 AM · Data-Engineering, Data Engineering and Event Platform Team, Event-Platform
EChetty edited projects for T324681: Add country_meta_data, added: Data-Engineering-Planning; removed Data-Engineering.
Jan 6 2023, 11:01 AM · Movement-Insights, Data-Engineering, Equity-Landscape
EChetty edited projects for T324746: Flink wrappers and helper libraries should be moved into a dedicated git repo with packaging and CI., added: Data-Engineering-Planning; removed Data-Engineering.
Jan 6 2023, 11:01 AM · Event-Platform (Sprint 07), Data-Engineering-Planning
EChetty edited projects for T324951: We should provide utilities for local development and unit testing of Python streaming services, added: Data-Engineering-Planning; removed Data-Engineering.
Jan 6 2023, 11:01 AM · Event-Platform (Sprint 07), Data-Engineering-Planning
EChetty edited projects for T324757: When moving oozie webrequest-load to airflow/spark avoid the error-check corner case, added: Data-Engineering-Planning; removed Data-Engineering.
Jan 6 2023, 11:01 AM · Data-Engineering, Data Pipelines
EChetty edited projects for T324994: Incident: 2022-12-09 api appserver worker starvation, added: Data-Engineering-Planning; removed Data-Engineering.
Jan 6 2023, 11:01 AM · Data-Engineering, Event-Platform, SRE-OnFire, serviceops
EChetty edited projects for T324968: Access input metrics, added: Data-Engineering-Planning; removed Data-Engineering.
Jan 6 2023, 11:01 AM · Equity-Landscape
EChetty edited projects for T324953: [NEEDS GROOMING] Integrate Flink Table API in eventutils-python, added: Data-Engineering-Planning; removed Data-Engineering.
Jan 6 2023, 11:01 AM · Data Engineering and Event Platform Team, Data-Engineering, Event-Platform
EChetty edited projects for T324980: Event Driven Enrichment Pipelines repositories should be generated from a template, added: Data-Engineering-Planning; removed Data-Engineering.
Jan 6 2023, 11:01 AM · Event-Platform (Sprint 12), Data-Engineering-Planning
EChetty edited projects for T324993: User stats page, added: Data-Engineering-Planning; removed Data-Engineering.
Jan 6 2023, 11:01 AM · Data-Engineering, Data-Engineering-Wikistats
EChetty edited projects for T325273: EventRowTypeInfo should support schema evolution of rows seriliazed in flink application state, added: Data-Engineering-Planning; removed Data-Engineering.
Jan 6 2023, 11:01 AM · Data-Engineering-Planning, Discovery-Search (Current work), CirrusSearch, Event-Platform
EChetty edited projects for T325266: Replace refinery-source Guava caches by Caffeine, added: Data-Engineering-Planning; removed Data-Engineering.
Jan 6 2023, 11:01 AM · Event-Platform, Data-Engineering-Planning
EChetty edited projects for T325303: Deploy mediawiki-page-content-change-enrichment to wikikube k8s, added: Data-Engineering-Planning; removed Data-Engineering.
Jan 6 2023, 11:00 AM · Event-Platform (Sprint 14 B), Patch-For-Review, Epic, Data-Engineering-Planning
EChetty edited projects for T325304: Deploy to YARN, added: Data-Engineering-Planning; removed Data-Engineering.
Jan 6 2023, 11:00 AM · Event-Platform (Sprint 08), Data-Engineering-Planning
EChetty edited projects for T325306: Provide aggregated user device data per-country, added: Data-Engineering-Planning; removed Data-Engineering.
Jan 6 2023, 11:00 AM · Data-Engineering
EChetty edited projects for T325305: Deploy mediawiki-event-enrichment flink app to DSE k8s, added: Data-Engineering-Planning; removed Data-Engineering.
Jan 6 2023, 11:00 AM · Event-Platform (Sprint 08), Data-Engineering-Planning
EChetty edited projects for T325307: Deploy to production k8s, added: Data-Engineering-Planning; removed Data-Engineering.
Jan 6 2023, 11:00 AM · Data-Engineering-Planning, Event-Platform
EChetty edited projects for T325341: Use a fake timer in EventBus unit test for PageChangeEventSerializerTest::testCreatePageChangeVisibilityEvent, added: Data-Engineering-Planning; removed Data-Engineering.
Jan 6 2023, 11:00 AM · Data-Engineering-Planning, ci-test-error, Event-Platform
EChetty edited projects for T325359: Flink Restart Strategy for Enrichment Service, added: Data-Engineering-Planning; removed Data-Engineering.
Jan 6 2023, 11:00 AM · Data-Engineering-Planning, Event-Platform
EChetty edited projects for T325527: Check home/HDFS leftovers of ryanmax, added: Data-Engineering-Planning; removed Data-Engineering.
Jan 6 2023, 11:00 AM · Data-Platform-SRE (2023.12.01 - 2023.12.31)
EChetty edited projects for T325611: Add TikTok's in-app browser to ua-parser library, added: Data-Engineering-Planning; removed Data-Engineering.
Jan 6 2023, 11:00 AM · Data-Engineering, Data Pipelines, Product-Analytics
EChetty edited projects for T326002: [Event Platform] eventgate-wikimedia occasionally fails to produce events due to stream config fetch errors, added: Data-Engineering-Planning; removed Data-Engineering.
Jan 6 2023, 11:00 AM · Data-Engineering (Sprint 6), MW-1.41-notes (1.41.0-wmf.28; 2023-09-26), Event-Platform, Data Pipelines
EChetty edited projects for T326157: Check home/HDFS leftovers of akhatun, added: Data-Engineering-Planning; removed Data-Engineering.
Jan 6 2023, 11:00 AM · Data-Engineering
EChetty edited projects for T326229: Provide a mechanism to notify subscribers when page view data is available, added: Data-Engineering-Planning; removed Data-Engineering.
Jan 6 2023, 11:00 AM · Data-Engineering, Pageviews-API
EChetty edited projects for T326252: k8s deployment-charts mesh module should allow use of mesh without public_port Service, added: Data-Engineering-Planning; removed Data-Engineering.
Jan 6 2023, 11:00 AM · Event-Platform, Data-Engineering-Planning, serviceops