Page MenuHomePhabricator

odimitrijevic (Olja Dimitrjevic)
User

Today

  • Clear sailing ahead.

Tomorrow

  • Clear sailing ahead.

Saturday

  • Clear sailing ahead.

User Details

User Since
Apr 28 2021, 12:42 AM (68 w, 1 d)
Availability
Available
LDAP User
Unknown
MediaWiki User
ODimitrijevic (WMF) [ Global Accounts ]

Recent Activity

Mon, Aug 8

odimitrijevic added a project to T259804: Rename geoeditors_blacklist_country: Data Engineering Planning.
Mon, Aug 8, 3:17 PM · Data Engineering Planning, Analytics-Clusters, Voice & Tone

Thu, Jul 28

odimitrijevic assigned T311176: Add xcollazo to analytics-admins to RKemper.
Thu, Jul 28, 9:17 PM · SRE-Access-Requests, SRE, Data Engineering Planning
odimitrijevic assigned T313316: requesting Kerberos password for mikeraish (MRaishWMF) to RKemper.
Thu, Jul 28, 9:16 PM · Data-Engineering
odimitrijevic added a comment to T313316: requesting Kerberos password for mikeraish (MRaishWMF).

Request is approved.

Thu, Jul 28, 9:15 PM · Data-Engineering
odimitrijevic added a comment to T313616: Requesting access to the Desktop Improvements project statistics for SGrabarczuk.

Approved

Thu, Jul 28, 3:36 PM · SRE, SRE-Access-Requests
odimitrijevic added a comment to T313429: Requesting access to private-data for Mikeraish (MRaishWMF).

Approved

Thu, Jul 28, 3:36 PM · Patch-For-Review, SRE, SRE-Access-Requests
odimitrijevic added a comment to T313213: Grant Access to analytics-privatedata-users for Segun Oworu.

Approved!

Thu, Jul 28, 3:36 PM · SRE, SRE-Access-Requests

Wed, Jul 27

odimitrijevic edited projects for T313859: Document destination_event_service Event Platform stream configuration, added: Data Engineering Planning; removed Data-Engineering.
Wed, Jul 27, 6:23 PM · Data Engineering Planning

Tue, Jul 26

odimitrijevic created T313859: Document destination_event_service Event Platform stream configuration.
Tue, Jul 26, 8:59 PM · Data Engineering Planning

Jun 22 2022

odimitrijevic added a project to T308047: Document the Pageviews Dataset: Data-Engineering-Kanban.
Jun 22 2022, 3:02 PM · Data Engineering Planning, Data-Catalog
odimitrijevic moved T308047: Document the Pageviews Dataset from Next Up to In Progress on the Data-Catalog board.
Jun 22 2022, 3:02 PM · Data Engineering Planning, Data-Catalog
odimitrijevic assigned T308047: Document the Pageviews Dataset to JAllemandou.
Jun 22 2022, 3:02 PM · Data Engineering Planning, Data-Catalog

Jun 16 2022

odimitrijevic moved T266640: Decide whether to migrate from Presto to Trino from Incoming to Analyze on the Data-Engineering board.
Jun 16 2022, 11:21 PM · Data-Engineering, Patch-For-Review
odimitrijevic edited projects for T232795: We are not capturing IPs of original requests for proxied requests from operamini and googleweblight. x-forwarded-for is null and client-ip is the same as IP on Webrequest data , added: Data-Engineering-Icebox; removed Data-Engineering.
Jun 16 2022, 11:18 PM · Data-Engineering-Icebox, Traffic-Icebox, SRE
odimitrijevic created T310846: Improve Bot Detection Heuristics.
Jun 16 2022, 11:05 PM · Data-Engineering-Icebox
odimitrijevic moved T273086: Downloading from Archiva.wikimedia.org seems slower than Maven Central from Incoming to Ops on the Data-Engineering board.
Jun 16 2022, 4:03 PM · Data-Engineering, SRE
odimitrijevic added a project to T273086: Downloading from Archiva.wikimedia.org seems slower than Maven Central: Data-Engineering.
Jun 16 2022, 4:02 PM · Data-Engineering, SRE

Jun 10 2022

odimitrijevic closed T293643: Data Catalog Technical Evaluation as Resolved.
Jun 10 2022, 11:23 PM · Data-Catalog, Data-Engineering-Kanban, Epic, Data-Engineering
odimitrijevic added a comment to T293643: Data Catalog Technical Evaluation.

Completed: https://wikitech.wikimedia.org/wiki/Data_Catalog_Application_Evaluation/Rubric/data-catalog-evaluation_server_notes

Jun 10 2022, 11:23 PM · Data-Catalog, Data-Engineering-Kanban, Epic, Data-Engineering

Jun 8 2022

odimitrijevic claimed T310229: Data Catalog Documentation Style Guide.
Jun 8 2022, 9:02 PM · Data Engineering Planning, Data-Catalog
odimitrijevic moved T310229: Data Catalog Documentation Style Guide from MVP to Next Up on the Data-Catalog board.
Jun 8 2022, 9:02 PM · Data Engineering Planning, Data-Catalog
odimitrijevic created T310229: Data Catalog Documentation Style Guide.
Jun 8 2022, 9:02 PM · Data Engineering Planning, Data-Catalog
odimitrijevic created T310203: Data Catalog Demo.
Jun 8 2022, 5:12 PM · Data Engineering Planning, Data-Catalog

Jun 7 2022

odimitrijevic created T310079: Upgrade DataHub V0.8.38.
Jun 7 2022, 2:48 PM · Data-Engineering, Data-Engineering-Kanban, Data-Catalog

Jun 6 2022

odimitrijevic added a comment to T142073: Improve user management for AQS Cassandra.

@Eevans Is this request still relevant given the latest AQS plans?

Jun 6 2022, 5:47 PM · Cassandra, Data-Engineering, User-Elukey, Pageviews-API
odimitrijevic closed T303453: Procure MaxMind GeoIP2 Database License, a subtask of T302989: Migrate to MaxMind GeoIP2 , as Resolved.
Jun 6 2022, 5:16 PM · Data-Engineering
odimitrijevic closed T303453: Procure MaxMind GeoIP2 Database License as Resolved.

The two licenses have been extended to 2023-05-13.

Jun 6 2022, 5:16 PM · Data-Engineering-Kanban, Data-Engineering
odimitrijevic added a project to T308311: TypeError: navigator.sendBeacon is not a function: Data-Engineering-Radar.
Jun 6 2022, 4:01 PM · Data-Engineering-Radar, MW-1.39-notes (1.39.0-wmf.12; 2022-05-16), Metrics-Platform
odimitrijevic triaged T309382: DataHub rights assignment is case-sensitive as High priority.
Jun 6 2022, 2:38 PM · Data Engineering Planning, Data-Catalog
odimitrijevic added a comment to T309738: Move Mediawiki QueryPages computation to Hadoop.

@Milimetric In order to evaluate impact of doing this work do we have info on how frequently these queries run, the duration and resource allocation is in computing these queries.

Jun 6 2022, 2:33 PM · Data-Persistence (Consultation), Data-Engineering
odimitrijevic moved T309922: Create `research` hive user from Incoming to Ops Week on the Data-Engineering board.
Jun 6 2022, 2:25 PM · Data-Engineering-Kanban, Data-Engineering
odimitrijevic triaged T309922: Create `research` hive user as High priority.
Jun 6 2022, 2:25 PM · Data-Engineering-Kanban, Data-Engineering

May 30 2022

odimitrijevic triaged T309007: Migrate eventlogging check_prometheus checks to alertmanager as Medium priority.
May 30 2022, 3:17 PM · Data-Engineering
odimitrijevic triaged T309000: Check home/HDFS leftovers of razzi as High priority.
May 30 2022, 3:16 PM · Data-Engineering-Kanban, Data-Engineering
odimitrijevic moved T309000: Check home/HDFS leftovers of razzi from Incoming to Ops Week on the Data-Engineering board.
May 30 2022, 3:16 PM · Data-Engineering-Kanban, Data-Engineering
odimitrijevic moved T308778: Fix turnilo after upgrade from Incoming to Ops on the Data-Engineering board.
May 30 2022, 3:15 PM · Patch-For-Review, Data-Engineering-Kanban, Data-Engineering

May 26 2022

odimitrijevic added a comment to T303453: Procure MaxMind GeoIP2 Database License.

The license has been extended to 7-13. They payment of the yearly subscription is still in progress.

May 26 2022, 5:44 AM · Data-Engineering-Kanban, Data-Engineering
odimitrijevic moved T303453: Procure MaxMind GeoIP2 Database License from Next Up to In Progress on the Data-Engineering-Kanban board.
May 26 2022, 5:44 AM · Data-Engineering-Kanban, Data-Engineering

May 19 2022

odimitrijevic removed a project from T282131: Determine which remaining legacy EventLogging schemas need to be migrated or decommissioned: Data-Engineering-Kanban.
May 19 2022, 5:02 PM · MW-1.38-notes (1.38.0-wmf.26; 2022-03-14), Data-Engineering, Fundraising-Backlog, Better Use Of Data, Product-Analytics, Product-Data-Infrastructure, Analytics-Kanban, MediaWiki-extensions-EventLogging, Event-Platform Value Stream
odimitrijevic moved T308441: Error when updating dashboard from Incoming to Visualize on the Data-Engineering board.

@Pablo Does removing the emoji's resolve the issue?

May 19 2022, 4:31 PM · Product-Analytics, Data-Engineering-Kanban, Superset, Data-Engineering
odimitrijevic added a project to T308766: Fix airflow interlanguage job: Data Pipelines.
May 19 2022, 4:04 PM · Data Pipelines, Data-Engineering-Kanban, Data-Engineering
odimitrijevic added a project to T308767: Fix api_daily job: Data Pipelines.
May 19 2022, 4:04 PM · Patch-For-Review, Data Pipelines, Data-Engineering-Kanban, Data-Engineering

Apr 26 2022

odimitrijevic added a project to T293647: Data Catalog POC: Data-Catalog.
Apr 26 2022, 10:06 PM · Data-Catalog, Epic, Data-Engineering

Apr 7 2022

odimitrijevic placed T301895: Help with data that's not appearing on charts up for grabs.

@Iflorez Is this still a problem?

Apr 7 2022, 7:15 PM · Data-Engineering-Kanban, Superset, Data-Engineering, Product-Analytics
odimitrijevic closed T294258: Data Catalog Requirements as Resolved.
Apr 7 2022, 7:01 PM · Data-Catalog, Epic, Data-Engineering-Kanban, Data-Engineering
odimitrijevic added a comment to T294258: Data Catalog Requirements.

Requirements and evaluation have been posted on wikitech: https://wikitech.wikimedia.org/wiki/Data_Catalog_Application_Evaluation/Rubric

Apr 7 2022, 7:01 PM · Data-Catalog, Epic, Data-Engineering-Kanban, Data-Engineering
odimitrijevic closed T294259: Presto/Superset User Experience Improvement as Resolved.

Closing epic given that we are focusing on other priorities. There are outstanding tasks which will be prioritized independently.

Apr 7 2022, 6:59 PM · Superset, Epic, Data-Engineering-Kanban, Data-Engineering
odimitrijevic removed a parent task for T269832: Add a presto query logger: T294259: Presto/Superset User Experience Improvement.
Apr 7 2022, 6:57 PM · Patch-For-Review, Data-Engineering
odimitrijevic removed a parent task for T294772: Superset Timeout Logging: T294259: Presto/Superset User Experience Improvement.
Apr 7 2022, 6:57 PM · Superset, Data-Engineering
odimitrijevic removed subtasks for T294259: Presto/Superset User Experience Improvement: T297120: Try to improve the LDAP integration for Superset user account creation, T295589: Upgrade Presto to access UDF library improvements, T269832: Add a presto query logger, T294772: Superset Timeout Logging.
Apr 7 2022, 6:57 PM · Superset, Epic, Data-Engineering-Kanban, Data-Engineering
odimitrijevic removed a parent task for T297120: Try to improve the LDAP integration for Superset user account creation: T294259: Presto/Superset User Experience Improvement.
Apr 7 2022, 6:57 PM · Data-Engineering
odimitrijevic removed a parent task for T295589: Upgrade Presto to access UDF library improvements: T294259: Presto/Superset User Experience Improvement.
Apr 7 2022, 6:57 PM · Data-Engineering
odimitrijevic removed a subtask for T294259: Presto/Superset User Experience Improvement: T266640: Decide whether to migrate from Presto to Trino.
Apr 7 2022, 6:56 PM · Superset, Epic, Data-Engineering-Kanban, Data-Engineering
odimitrijevic removed a parent task for T266640: Decide whether to migrate from Presto to Trino: T294259: Presto/Superset User Experience Improvement.
Apr 7 2022, 6:56 PM · Data-Engineering, Patch-For-Review
odimitrijevic removed a project from T294772: Superset Timeout Logging: Data-Engineering-Kanban.
Apr 7 2022, 6:44 PM · Superset, Data-Engineering
odimitrijevic added a comment to T292389: Automate kerberos credential creation and management to ease the creation of testing infrastructure.

@elukey @Majavah Following up on this task, is the merging of the patch blocked? Should this be deprioritized for now or is there anything that others in the DE team can do to help complete the work?

Apr 7 2022, 6:19 PM · Patch-For-Review, Data-Engineering, Analytics-Kanban

Apr 4 2022

odimitrijevic added a comment to T305298: Requesting access to Analytic Cluster for Research Intern (paramita_das).

Approved!

Apr 4 2022, 2:38 PM · SRE, SRE-Access-Requests

Mar 25 2022

Dzahn awarded T302864: Maxmind: GeoIP Download Failed a Like token.
Mar 25 2022, 12:21 AM · Data-Engineering-Kanban, Data-Engineering, SRE, procurement, serviceops, Analytics, Traffic

Mar 24 2022

odimitrijevic added a project to T304651: Spike: Investigate creating robust alerts to notify that caching nodes are not sending traffic data: Traffic.
Mar 24 2022, 8:14 PM · Data-Engineering, SRE, Traffic, Data-Engineering-Kanban
odimitrijevic renamed T304651: Spike: Investigate creating robust alerts to notify that caching nodes are not sending traffic data from Spike: Investigate importing etcd config to help write robust data loss alerts to Spike: Investigate creating robust alerts to notify that caching nodes are not sending traffic data.
Mar 24 2022, 8:10 PM · Data-Engineering, SRE, Traffic, Data-Engineering-Kanban
odimitrijevic added a subtask for T300164: Pageview Data loss due to wrong version of package installed on some varnishkafka instances: T304617: Lock-in Varnish and VarnishKafka versions.
Mar 24 2022, 8:07 PM · Data-Engineering-Kanban, Data-Engineering
odimitrijevic added a parent task for T304617: Lock-in Varnish and VarnishKafka versions: T300164: Pageview Data loss due to wrong version of package installed on some varnishkafka instances.
Mar 24 2022, 8:07 PM · Data-Engineering-Radar, SRE, Traffic
odimitrijevic created T304617: Lock-in Varnish and VarnishKafka versions.
Mar 24 2022, 3:39 PM · Data-Engineering-Radar, SRE, Traffic
odimitrijevic moved T303049: New Service Request: DataHub from Incoming to Security & Governance on the Data-Engineering board.
Mar 24 2022, 3:21 PM · Patch-For-Review, serviceops, Data-Catalog, Data-Engineering, Service-deployment-requests, Services, SRE
odimitrijevic closed T303465: Purge GeoIP2 datasets as per the licensing agreement, a subtask of T302989: Migrate to MaxMind GeoIP2 , as Declined.
Mar 24 2022, 3:20 PM · Data-Engineering
odimitrijevic closed T303465: Purge GeoIP2 datasets as per the licensing agreement as Declined.
Mar 24 2022, 3:20 PM · Data-Engineering
odimitrijevic moved T303381: Streamline CI for our fork of DataHub from Incoming to Security & Governance on the Data-Engineering board.
Mar 24 2022, 3:20 PM · Data-Engineering, Data-Catalog
odimitrijevic closed T303461: Download the Maxmind Geoip2 Databases, a subtask of T302989: Migrate to MaxMind GeoIP2 , as Declined.
Mar 24 2022, 3:19 PM · Data-Engineering
odimitrijevic closed T303461: Download the Maxmind Geoip2 Databases as Declined.
Mar 24 2022, 3:19 PM · Data-Engineering
odimitrijevic moved T304572: Herald rule to add Product Analytics and Data Engineering tags to Wmfdata-Python tasks from Incoming to Ops on the Data-Engineering board.
Mar 24 2022, 3:18 PM · Data Engineering Planning, Data-Engineering-Kanban, Product-Analytics, wmfdata-python, Phabricator

Mar 23 2022

odimitrijevic updated the task description for T303464: Disable GeoIP Legacy Download / Identify all users of legacy (v1) GeoIP datasets and inform them of the need to switch to GeoIP2 dataset.
Mar 23 2022, 4:41 AM · Trust-and-Safety, serviceops, Traffic, SRE, Data-Engineering
odimitrijevic moved T302391: [Airflow] Refactor jobs to not use DAG factories from Incoming to Transform on the Data-Engineering board.
Mar 23 2022, 4:36 AM · Data-Engineering-Kanban, Data-Engineering, Data Pipelines
odimitrijevic raised the priority of T304224: Archiva's disk partiton space is getting filled up from Medium to High.
Mar 23 2022, 4:35 AM · Data-Engineering-Kanban, Data-Engineering
odimitrijevic triaged T304065: Check home/HDFS leftovers of clarakosi as High priority.
Mar 23 2022, 4:31 AM · Data-Engineering-Kanban, Data-Engineering
odimitrijevic moved T304065: Check home/HDFS leftovers of clarakosi from Incoming to Ops Week on the Data-Engineering board.
Mar 23 2022, 4:31 AM · Data-Engineering-Kanban, Data-Engineering
odimitrijevic edited projects for T304379: Non-deterministic unit test "streamInSample() - session sampling resets", added: Data-Engineering-Radar; removed Data-Engineering-Kanban.
Mar 23 2022, 4:30 AM · Metrics-Platform, Data-Engineering-Radar, MediaWiki-extensions-EventLogging
odimitrijevic added projects to T304379: Non-deterministic unit test "streamInSample() - session sampling resets": Event-Platform Value Stream, Data-Engineering-Kanban.
Mar 23 2022, 4:30 AM · Metrics-Platform, Data-Engineering-Radar, MediaWiki-extensions-EventLogging
odimitrijevic moved T302263: The network_internal druid load job fails if data is not present from Incoming to Ops Week on the Data-Engineering board.
Mar 23 2022, 4:29 AM · Data-Engineering, Patch-For-Review, Data-Engineering-Kanban
odimitrijevic edited projects for T302263: The network_internal druid load job fails if data is not present, added: Data-Engineering; removed Data-Engineering-Radar.

Now that drmrs dc is operational this should be resided upon. As part of the work let's ensure that the data is collected as expected and that it is reported correctly. Are there any quality checks that should be implemented for due diligence as part of a different task?

Mar 23 2022, 4:28 AM · Data-Engineering, Patch-For-Review, Data-Engineering-Kanban

Mar 17 2022

odimitrijevic placed T298893: [Airflow] Troubleshoot MySQL connection issues up for grabs.
Mar 17 2022, 2:41 PM · Data-Engineering-Kanban, Data Pipelines, Data-Engineering

Mar 16 2022

odimitrijevic triaged T303977: Resume Webrequest Data Purge Job as High priority.
Mar 16 2022, 4:51 PM · Data-Engineering, Data-Engineering-Kanban
odimitrijevic created T303977: Resume Webrequest Data Purge Job.
Mar 16 2022, 4:49 PM · Data-Engineering, Data-Engineering-Kanban

Mar 14 2022

odimitrijevic added projects to T303464: Disable GeoIP Legacy Download / Identify all users of legacy (v1) GeoIP datasets and inform them of the need to switch to GeoIP2 dataset: SRE, Traffic, serviceops, Trust-and-Safety.
Mar 14 2022, 5:39 AM · Trust-and-Safety, serviceops, Traffic, SRE, Data-Engineering
odimitrijevic renamed T303464: Disable GeoIP Legacy Download / Identify all users of legacy (v1) GeoIP datasets and inform them of the need to switch to GeoIP2 dataset from Deprecate GeoIP Legacy Download to Disable GeoIP Legacy Download.
Mar 14 2022, 5:38 AM · Trust-and-Safety, serviceops, Traffic, SRE, Data-Engineering
odimitrijevic closed T302989: Migrate to MaxMind GeoIP2 as Declined.

Data engineering already uses GeoIP2 datasets.

Mar 14 2022, 5:32 AM · Data-Engineering
odimitrijevic moved T303464: Disable GeoIP Legacy Download / Identify all users of legacy (v1) GeoIP datasets and inform them of the need to switch to GeoIP2 dataset from Incoming to Datasets on the Data-Engineering board.
Mar 14 2022, 5:30 AM · Trust-and-Safety, serviceops, Traffic, SRE, Data-Engineering
odimitrijevic updated the task description for T303464: Disable GeoIP Legacy Download / Identify all users of legacy (v1) GeoIP datasets and inform them of the need to switch to GeoIP2 dataset.
Mar 14 2022, 5:29 AM · Trust-and-Safety, serviceops, Traffic, SRE, Data-Engineering
odimitrijevic moved T302392: Unifying HDFS Sensor and FSSPEC Sensor from Incoming to Transform on the Data-Engineering board.
Mar 14 2022, 5:23 AM · Data-Engineering, Data-Engineering-Kanban, Data Pipelines
odimitrijevic moved T303473: Variabilization of existing jobs from Incoming to Transform on the Data-Engineering board.
Mar 14 2022, 5:23 AM · Data-Engineering, Data Pipelines
odimitrijevic moved T303199: [Airflow] Troubleshoot traffic anomaly detection job from Incoming to Transform on the Data-Engineering board.
Mar 14 2022, 5:22 AM · Data Pipelines, Data-Engineering
odimitrijevic moved T303201: Airflow-triggered Spark-jobs produce hdfs-files belonging to the wrong hdfs-user-group from Incoming to Transform on the Data-Engineering board.
Mar 14 2022, 5:21 AM · Data-Engineering-Kanban, Data Pipelines, Data-Engineering

Mar 11 2022

odimitrijevic moved T303193: Projectviews by country Airflow job from Incoming to Datasets on the Data-Engineering board.
Mar 11 2022, 4:48 PM · Data Pipelines, Data-Engineering-Kanban
odimitrijevic moved T299732: Implement top endpoint of the pageviews API from Incoming to Serve on the Data-Engineering board.
Mar 11 2022, 4:47 PM · API Platform, Data-Engineering, User-Eevans, Platform Engineering Roadmap
odimitrijevic moved T299466: Q3:(Need By: TBD) rack/setup/install stat1009 from Incoming to Ops on the Data-Engineering board.
Mar 11 2022, 4:46 PM · Data-Engineering, SRE, ops-eqiad, DC-Ops
odimitrijevic closed T303463: Modify Refine jobs to use GeoIP2 databases as Declined.
Mar 11 2022, 4:45 PM · Data-Engineering
odimitrijevic closed T303463: Modify Refine jobs to use GeoIP2 databases, a subtask of T302989: Migrate to MaxMind GeoIP2 , as Declined.
Mar 11 2022, 4:45 PM · Data-Engineering
odimitrijevic closed T303466: Identify Opportunities in using the new GeoIP2 databases , a subtask of T302989: Migrate to MaxMind GeoIP2 , as Declined.
Mar 11 2022, 4:45 PM · Data-Engineering
odimitrijevic closed T303466: Identify Opportunities in using the new GeoIP2 databases as Declined.

The data engineering team is already on GeoIP2.

Mar 11 2022, 4:45 PM · Spike, Data-Engineering
odimitrijevic added projects to T302728: Analytics Platform Future State Planing: Data-Engineering-Kanban, Epic.
Mar 11 2022, 4:35 PM · Epic, Data-Engineering-Kanban, Data-Engineering
odimitrijevic closed T303462: Modify Matamo to use GeoIP2 databases, a subtask of T302989: Migrate to MaxMind GeoIP2 , as Declined.
Mar 11 2022, 4:34 PM · Data-Engineering