User Details
User Details
- User Since
- Sep 7 2023, 12:13 AM (33 w, 4 d)
- Availability
- Available
- LDAP User
- Ahoelzl
- MediaWiki User
- AHoelzl-WMF [ Global Accounts ]
Fri, Apr 12
Fri, Apr 12
Ahoelzl moved T357372: [Maintenance] Migrate pingback to Airflow from In progress to Done on the Data-Engineering (Q4 2024 April 1st - June 30th) board.
Thu, Apr 11
Thu, Apr 11
Ahoelzl added a comment to T362289: [Refine Refactoring] Adjust Refine schema management code for Iceberg.
closing, part of https://phabricator.wikimedia.org/T356762
Ahoelzl renamed T356762: [Refine refactoring] Extract refine schema management into a dedicated tool from Extract refine schema management into a dedicated tool to [Refine refactoring] Extract refine schema management into a dedicated tool.
Ahoelzl edited projects for T356762: [Refine refactoring] Extract refine schema management into a dedicated tool, added: Data-Engineering (Q4 2024 April 1st - June 30th); removed Data-Engineering.
Ahoelzl updated the task description for T356762: [Refine refactoring] Extract refine schema management into a dedicated tool.
Wed, Apr 10
Wed, Apr 10
Ahoelzl renamed T362291: [Datasets Config] Define and implement SLOs, monitoring and logging from [Datasets Config] Define SLOs, monitoring and logging to [Datasets Config] Define and implement SLOs, monitoring and logging.
Thu, Apr 4
Thu, Apr 4
Ahoelzl set the point value for T346046: [Search Update Pipeline] Source streams for private wikis to 13.
Ahoelzl set the point value for T361094: [NEEDS GROOMING] Orchestrate gobblin ingestion task with Airflow to 8.
Ahoelzl set the point value for T361770: Support metrics platform backend migration to new service runner to 3.
Ahoelzl set the point value for T361769: Migrate and re-deploy eventstreams using new service runner to 8.
Wed, Apr 3
Wed, Apr 3
Ahoelzl renamed T360924: Replace service runner with a simplified library to better support metrics and debugging from Re-implement service runner to better support metrics and debugging to Replace service runner with a simplified library to better support metrics and debugging.
Ahoelzl renamed T360924: Replace service runner with a simplified library to better support metrics and debugging from Improve service runner to better support metrics and debugging to Re-implement service runner to better support metrics and debugging.
Mon, Apr 1
Mon, Apr 1
Ahoelzl added a comment to T360924: Replace service runner with a simplified library to better support metrics and debugging.
Update: the new version should undergo a security review.
Ahoelzl added a comment to T361509: [Spike] Define technology roadmap around Airflow / k8s / ceph.
Regarding Cepth we have several efforts in flight:
- CEPH (data-sre)
- Swift (multimedia + ad hoc use cases) https://wikitech.wikimedia.org/wiki/Swift
- https://wikitech.wikimedia.org/wiki/MOSS (multimedia + misc use cases) https://phabricator.wikimedia.org/T279621
Ahoelzl set the point value for T361509: [Spike] Define technology roadmap around Airflow / k8s / ceph to 5.
Ahoelzl set the point value for T361503: [Spike] [Maintenance] Define late arrival event strategy and idem-potent backfilling concept. to 3.
Ahoelzl set the point value for T361501: [Refine Refactoring] Configure and deploy all Refine data sets for parallel production processing and testing to 13.
Ahoelzl renamed T361498: [Spike] [Refine Refactoring] List out all production Refine datasets that need to be migrated to the config store (Airflow and Iceberg) from [Spike] List out all production Refine datasets that need to be migrated to the config store (Airflow and Iceberg) to [Spike] [Refine Refactoring] List out all production Refine datasets that need to be migrated to the config store (Airflow and Iceberg).
Ahoelzl renamed T361499: [Maintenance] Resolve long launch times for canary events on Airflow (30mins in total) from Resolve long launch times for canary events on Airflow (30mins in total) to [Maintenance] Resolve long launch times for canary events on Airflow (30mins in total).
Mar 27 2024
Mar 27 2024
Mar 26 2024
Mar 26 2024
Ahoelzl renamed T360922: [Status Store] [SPIKE] Investigate and document approach for Iceberg Sensors from [Status Store] [SPIKE] Document Approach for Iceberg Sensors to [Status Store] [SPIKE] Investigate and document approach for Iceberg Sensors.
Mar 25 2024
Mar 25 2024
Mar 21 2024
Mar 21 2024
Ahoelzl moved T357372: [Maintenance] Migrate pingback to Airflow from Next Up to In progress on the Data-Engineering (Sprint 9) board.
Ahoelzl moved T356424: [Maintenance] Migrate cx ReportUpdater job from In progress to Done on the Data-Engineering (Sprint 9) board.
Ahoelzl moved T357430: Airflow mapped tasks UI & metrics from Next Up to In progress on the Data-Engineering (Sprint 9) board.
Mar 7 2024
Mar 7 2024
Ahoelzl moved T351093: [Data Quality] Define concept for Alerting in coordination with SRE from In progress to Done on the Data-Engineering (Sprint 9) board.
Ahoelzl added a comment to T351093: [Data Quality] Define concept for Alerting in coordination with SRE.
SRE / Brian King is driving that going forward.
Feb 22 2024
Feb 22 2024
Feb 21 2024
Feb 21 2024
Ahoelzl added a comment to T354694: [Maintenance] Safeguard VarnishKafka to HAProxy analytics transition.
For sync meeting updates from 2/20, see project document.
Traffic team is working on configuration management and will be ready for stream production in a few weeks.
Ahoelzl updated the task description for T354694: [Maintenance] Safeguard VarnishKafka to HAProxy analytics transition.
Feb 15 2024
Feb 15 2024
Feb 8 2024
Feb 8 2024
Ahoelzl moved T356424: [Maintenance] Migrate cx ReportUpdater job from Next Up to In progress on the Data-Engineering (Sprint 8) board.
Feb 7 2024
Feb 7 2024
Ahoelzl moved T356866: [Data Quality] Update data_quality schemas to be compatible with Iceberg tables from Sprint 8 to Sprint 9 on the Data-Engineering board.
Ahoelzl changed the point value for T356866: [Data Quality] Update data_quality schemas to be compatible with Iceberg tables from 1 to 3.
Ahoelzl edited projects for T356866: [Data Quality] Update data_quality schemas to be compatible with Iceberg tables, added: Data-Engineering (Sprint 8); removed Data-Engineering.
Ahoelzl renamed T356866: [Data Quality] Update data_quality schemas to be compatible with Iceberg tables from Update data_quality schemas to be compatible with Iceberg tables to [Data Quality] Update data_quality schemas to be compatible with Iceberg tables.
Feb 6 2024
Feb 6 2024
Ahoelzl updated subscribers of T333537: Deprecate WMDE Technical Wishes reportupdater jobs.
Feb 5 2024
Feb 5 2024
Ahoelzl updated subscribers of T354692: [Data Quality] Implement basic data quality metrics for MW history.
@JAllemandou mentioned existing MW checks that should be migrated.
Feb 2 2024
Feb 2 2024
Ahoelzl updated subscribers of T354692: [Data Quality] Implement basic data quality metrics for MW history.
Ahoelzl updated the task description for T354692: [Data Quality] Implement basic data quality metrics for MW history.
Feb 1 2024
Feb 1 2024
Ahoelzl added a comment to T356364: [Maintenance] Migrate Gitlab CI to blubber.
Ahoelzl moved T356364: [Maintenance] Migrate Gitlab CI to blubber from Next Up to In Review on the Data-Engineering (Sprint 8) board.
Ahoelzl set the point value for T356363: [Refine Refactoring] Refactor refinery code for compatibility with Airflow integration to 8.
Ahoelzl set the point value for T356362: [Refine Refactoring] [Spike] Define a concept and provide a PoC for dynamic DAG execution in Airflow to 5.
Ahoelzl set the point value for T356360: [Refine Refactoring] Orchestrate Airflow execution of navigationtiming from config store to 8.
Ahoelzl updated the task description for T356360: [Refine Refactoring] Orchestrate Airflow execution of navigationtiming from config store.
Ahoelzl edited projects for T356192: [Refine refactoring] Refactor and migrate navigationtiming to Airflow, added: Data-Engineering; removed Data-Engineering (Sprint 8).
Ahoelzl renamed T355542: [Dataset Config Store] - Define config API for navigationtiming and implement local development instance from [Dataset Config Store] - Proof of Concept to [Dataset Config Store] - Define config API for navigationtiming and implement local development instance.
Jan 31 2024
Jan 31 2024
Ahoelzl edited projects for T348774: [Maintenance] Add a deletion job for `hdfs_usage` data, added: Data-Engineering; removed Data-Engineering (Sprint 8).
Jan 30 2024
Jan 30 2024
Ahoelzl set the point value for T349763: [Data Quality] Develop Airflow post processing instrumentation to collect and log configurable data metrics to 13.
Ahoelzl set the point value for T352669: [Iceberg Migration] Migrate aqs hourly tables to Iceberg to 5.
Ahoelzl set the point value for T352670: [Iceberg Migration] Migrate browser_general tables to Iceberg to 5.
Ahoelzl changed the point value for T351093: [Data Quality] Define concept for Alerting in coordination with SRE from 8 to 5.
Ahoelzl set the point value for T351093: [Data Quality] Define concept for Alerting in coordination with SRE to 8.
Ahoelzl set the point value for T354694: [Maintenance] Safeguard VarnishKafka to HAProxy analytics transition to 8.
Ahoelzl added a comment to T351093: [Data Quality] Define concept for Alerting in coordination with SRE.
SRE presented a work document for Alert Review:
https://docs.google.com/document/d/1PQKabMx9qoAKQS6qlHJDs2z2B_Bum_KqLYRaZ1pzXGc/edit
Ahoelzl added a comment to T355606: Requesting analytics-privatedata-users access for amastilovic.
@Eevans @ABran-WMF can you help us move this over the finishing line? It's blocking Aleksandar from getting productive in the data engineering team. Thank you!