Page MenuHomePhabricator

amastilovic (Aleksandar Mastilovic)
User

Projects

User does not belong to any projects.

Today

  • Clear sailing ahead.

Tomorrow

  • Clear sailing ahead.

Thursday

  • Clear sailing ahead.

User Details

User Since
Jan 20 2024, 12:05 AM (34 w, 3 d)
Availability
Available
IRC Nick
amastilovic
LDAP User
Aleksandar Mastilovic
MediaWiki User
AMastilovic-WMF [ Global Accounts ]

Recent Activity

Yesterday

amastilovic added a comment to T372014: Problem deploying - missing airflow_client dependency.

This was resolved some time ago when SRE released updated Airflow Debian package.

Mon, Sep 16, 10:30 PM · Dumps 2.0 (Kanban Board), Data-Engineering (Q1 2024 July 1st - September 30th)

Thu, Sep 5

amastilovic edited projects for T367404: Update parent pom to disable fetching dependencies from Archiva and use Gitlab instead, added: Data-Engineering (Q1 2024 July 1st - September 30th); removed Data-Engineering.
Thu, Sep 5, 4:53 PM · Data-Platform-SRE (2024.09.06 - 2024.09.27), Data-Engineering (Q1 2024 July 1st - September 30th), Patch-For-Review, Java-Scala-Standardization, Discovery-Search
amastilovic added a project to T369901: Migrate wmf-jvm-parent-pom and supporting components to the Maven group on Gitlab: Data-Engineering (Q1 2024 July 1st - September 30th).
Thu, Sep 5, 4:21 PM · Data-Platform-SRE (2024.09.06 - 2024.09.27), Data-Engineering (Q1 2024 July 1st - September 30th), Patch-For-Review, Java-Scala-Standardization

Fri, Aug 23

amastilovic added a comment to T372647: Implement Airflow Dataset class for RestExternalTaskSensor.

Maybe just change the name of the file to data_dependencies.yaml and the module to DataDependency?
And then the user would use it like data_dependency("data-dependency-name").get_sensor_for(dag)?
With this, the semantic weirdness would be solved, no?

Fri, Aug 23, 7:09 PM · Dumps 2.0 (Kanban Board), Data-Engineering (Q1 2024 July 1st - September 30th)

Aug 7 2024

amastilovic updated subscribers of T365659: Implement automatic sync of refinery HQL files to HDFS.

I talked to @BTullis about obtaining a functional test environment that would mimic the real world this service would be operating in, and he kindly provided a list of things to do in order to build such an environment. The list is in the subtask ticket https://phabricator.wikimedia.org/T371994

Aug 7 2024, 6:22 PM · Data-Engineering (Q1 2024 July 1st - September 30th)
amastilovic added a subtask for T365659: Implement automatic sync of refinery HQL files to HDFS: T371994: Obtain SRE resources needed to test the HDFS synchronizer service.
Aug 7 2024, 5:36 PM · Data-Engineering (Q1 2024 July 1st - September 30th)
amastilovic added a parent task for T371994: Obtain SRE resources needed to test the HDFS synchronizer service: T365659: Implement automatic sync of refinery HQL files to HDFS.
Aug 7 2024, 5:36 PM · Data-Engineering (Q1 2024 July 1st - September 30th)
amastilovic created T371994: Obtain SRE resources needed to test the HDFS synchronizer service.
Aug 7 2024, 5:35 PM · Data-Engineering (Q1 2024 July 1st - September 30th)
amastilovic updated the task description for T360968: [Developer Experience] [SPIKE] Investigate process to automate deployment of folders and artifacts to HDFS.
Aug 7 2024, 5:23 PM · Data-Engineering (Q1 2024 July 1st - September 30th), Release-Engineering-Team, Spike
amastilovic updated the task description for T360922: [Status Store] [SPIKE] Investigate and document approach for Iceberg Sensors.
Aug 7 2024, 4:53 PM · Data-Engineering (Q1 2024 July 1st - September 30th), Dumps 2.0 (Kanban Board), Spike
amastilovic updated the task description for T369900: Develop Airflow ExternalTaskSensor to orchestrate DAG dependencies.
Aug 7 2024, 3:57 PM · Dumps 2.0 (Kanban Board), Data-Engineering (Q1 2024 July 1st - September 30th)
amastilovic updated the task description for T369900: Develop Airflow ExternalTaskSensor to orchestrate DAG dependencies.
Aug 7 2024, 3:57 PM · Dumps 2.0 (Kanban Board), Data-Engineering (Q1 2024 July 1st - September 30th)

Jul 25 2024

amastilovic moved T369851: NEW BUG REPORT Mediawiki_history contains duplicate rows for some revisions from Next Up to In progress on the Data-Engineering (Q1 2024 July 1st - September 30th) board.
Jul 25 2024, 4:27 PM · Data-Engineering (Q1 2024 July 1st - September 30th), Movement-Insights, Analytics-Data-Problem, Data-Platform

Jul 22 2024

amastilovic added a comment to T367403: Validate CI integration so that Ci can release Maven artifacts on user's demand.

Also, could we add a settings.xml file with the following contents to the Docker image? It's necessary for Maven release plugin to interact with GitLab:

Jul 22 2024, 4:42 PM · Release-Engineering-Team (Radar), Patch-For-Review, Data-Engineering (Q1 2024 July 1st - September 30th), Java-Scala-Standardization, Discovery-Search, Data-Platform-SRE
amastilovic updated the task description for T369900: Develop Airflow ExternalTaskSensor to orchestrate DAG dependencies.
Jul 22 2024, 2:39 PM · Dumps 2.0 (Kanban Board), Data-Engineering (Q1 2024 July 1st - September 30th)
amastilovic updated the task description for T369900: Develop Airflow ExternalTaskSensor to orchestrate DAG dependencies.
Jul 22 2024, 2:38 PM · Dumps 2.0 (Kanban Board), Data-Engineering (Q1 2024 July 1st - September 30th)

Jul 11 2024

amastilovic updated the task description for T360922: [Status Store] [SPIKE] Investigate and document approach for Iceberg Sensors.
Jul 11 2024, 4:50 PM · Data-Engineering (Q1 2024 July 1st - September 30th), Dumps 2.0 (Kanban Board), Spike

Jul 8 2024

amastilovic added a comment to T360968: [Developer Experience] [SPIKE] Investigate process to automate deployment of folders and artifacts to HDFS.

I've considered the option of pulling from the git origin into the destination HDFS, albeit not using a systemd timer. I've actually done something similar before in previous jobs/roles, by mounting HDFS onto a local file system, but I don't think this is a viable solution for a number of reasons:

Jul 8 2024, 11:34 PM · Data-Engineering (Q1 2024 July 1st - September 30th), Release-Engineering-Team, Spike

Jul 2 2024

amastilovic added a comment to T367391: Setup a test project to validate upload to the Gitlab package registry.

Seconded about the .test in the group ID - do we really need that? I believe the group ID should simply be org.wikimedia and then we can have the test part in the artifact ID.

Jul 2 2024, 4:55 PM · Data-Platform-SRE (2024.08.17 - 2024.09.06), Release-Engineering-Team (Radar), User-brennen, Java-Scala-Standardization

Jun 24 2024

amastilovic added a comment to T368033: Design a suitable DAG deployment method.

In some ways, the fundamental question is: do we want to move to a continuous deployment model, or do we want to retain manual deployment.

Jun 24 2024, 11:05 PM · Data-Platform-SRE (2024.09.06 - 2024.09.27), Data-Engineering
amastilovic added a comment to T368033: Design a suitable DAG deployment method.

My feeling is that, at our scale and given its proximity to the dse-k8s cluster, Ceph will be just fine for these DAG volumes.

Jun 24 2024, 11:04 PM · Data-Platform-SRE (2024.09.06 - 2024.09.27), Data-Engineering

Jun 12 2024

amastilovic set IRC Nick to amastilovic on amastilovic.
Jun 12 2024, 9:50 PM

Jun 10 2024

amastilovic assigned T367116: mw-page-content-change-enrich flink app is missing in k8s staging to gmodena.
Jun 10 2024, 9:24 PM · Data-Platform-SRE (2024.06.17 - 2024.07.07), Data-Engineering, Event-Platform
amastilovic added a comment to T367073: Requesting access to Kubernetes deployment for amastilovic.

Merged and applied - done

Jun 10 2024, 5:37 PM · Data-Engineering, SRE, SRE-Access-Requests
amastilovic created T367073: Requesting access to Kubernetes deployment for amastilovic.
Jun 10 2024, 3:39 PM · Data-Engineering, SRE, SRE-Access-Requests

Jun 6 2024

amastilovic updated the task description for T365659: Implement automatic sync of refinery HQL files to HDFS.
Jun 6 2024, 11:17 PM · Data-Engineering (Q1 2024 July 1st - September 30th)
amastilovic updated subscribers of T360968: [Developer Experience] [SPIKE] Investigate process to automate deployment of folders and artifacts to HDFS.

Q: Have we discussed these ideas with Release Engineering folks? They are currently working on a similar CD project, but it might be MediaWiki focused only.

Jun 6 2024, 11:11 PM · Data-Engineering (Q1 2024 July 1st - September 30th), Release-Engineering-Team, Spike

Jun 5 2024

JAllemandou awarded T360922: [Status Store] [SPIKE] Investigate and document approach for Iceberg Sensors a Love token.
Jun 5 2024, 3:17 PM · Data-Engineering (Q1 2024 July 1st - September 30th), Dumps 2.0 (Kanban Board), Spike

Jun 4 2024

amastilovic updated the task description for T365659: Implement automatic sync of refinery HQL files to HDFS.
Jun 4 2024, 11:19 PM · Data-Engineering (Q1 2024 July 1st - September 30th)
amastilovic closed T365382: Move reportupdater reports away from their local filesystem locations as Resolved.

This ticket has been resolved, the tasks from the ticket definition have been performed on an-launcher1002 (an-launcher1001 has been decommissioned).

Jun 4 2024, 9:28 PM · Data-Engineering (Q4 2024 April 1st - June 30th)

May 22 2024

amastilovic created T365659: Implement automatic sync of refinery HQL files to HDFS.
May 22 2024, 11:10 PM · Data-Engineering (Q1 2024 July 1st - September 30th)
amastilovic added a comment to T354552: [Maintenance] Migrate ReportUpdater browser queries to Airflow.

@lbowmaker this task can be closed.

May 22 2024, 11:05 PM · Patch-For-Review, Data-Engineering (Q4 2024 April 1st - June 30th)
amastilovic updated subscribers of T362699: Update converted reportupdater DAG queries to correct CSV options.

@lbowmaker this task can be closed.

May 22 2024, 11:05 PM · Data-Engineering (Q4 2024 April 1st - June 30th)
amastilovic added a comment to T357372: [Maintenance] Migrate pingback to Airflow.

@lbowmaker this task can be closed.

May 22 2024, 11:05 PM · Data-Engineering (Q4 2024 April 1st - June 30th)
amastilovic added a comment to T357938: [Maintenance] Migrate wmcs to Airflow.

@lbowmaker this task can be closed.

May 22 2024, 11:04 PM · Data-Engineering (Q4 2024 April 1st - June 30th)
amastilovic created T365658: Update Airflow Developer Guide on WikiTech.
May 22 2024, 11:03 PM · Data-Engineering

May 20 2024

amastilovic added a comment to T365201: PHP 8.3 missing (showing as other?) on https://pingback.wmflabs.org/#php-version.

We are attempting to resolve this issue in this ticket: T365382

May 20 2024, 9:18 PM · Data-Engineering, MediaWiki-General
amastilovic added a comment to T365382: Move reportupdater reports away from their local filesystem locations.

No update still - let's wait for a bit and see what happens, the sync refreshment period might be daily as opposed to hourly.

May 20 2024, 8:30 PM · Data-Engineering (Q4 2024 April 1st - June 30th)
amastilovic created T365382: Move reportupdater reports away from their local filesystem locations.
May 20 2024, 5:30 PM · Data-Engineering (Q4 2024 April 1st - June 30th)

May 16 2024

amastilovic added a comment to T364487: Airflow DAG (hdfs_usage_weekly) failed with no details in the application log.

This task can be closed as the issue has been fixed and changes to the DAG have been merged.

May 16 2024, 7:53 PM · Data-Engineering

May 13 2024

amastilovic added a comment to T364487: Airflow DAG (hdfs_usage_weekly) failed with no details in the application log.

What's the longer-term location for the log4j properties file name? Presumably we don't want to leave the file name as aqu-log4j.properties within any folder?

May 13 2024, 7:00 PM · Data-Engineering

May 9 2024

amastilovic added a comment to T364487: Airflow DAG (hdfs_usage_weekly) failed with no details in the application log.

MR to switch to using DagProperties: https://gitlab.wikimedia.org/repos/data-engineering/airflow-dags/-/merge_requests/689

May 9 2024, 7:03 PM · Data-Engineering

May 8 2024

amastilovic added a comment to T364487: Airflow DAG (hdfs_usage_weekly) failed with no details in the application log.

The issue was in the path to the configured log4j.properties file in Airflow UI, hdfs:///user/aqu/aqu-log4j.properties was not accessible by the Airflow user analytics.

May 8 2024, 9:37 PM · Data-Engineering
amastilovic claimed T364487: Airflow DAG (hdfs_usage_weekly) failed with no details in the application log.
May 8 2024, 9:33 PM · Data-Engineering

Apr 26 2024

amastilovic claimed T357938: [Maintenance] Migrate wmcs to Airflow.
Apr 26 2024, 8:58 PM · Data-Engineering (Q4 2024 April 1st - June 30th)

Apr 17 2024

amastilovic created T362832: Migrate refinery HQL files to CI/CD supported GitLab repository.
Apr 17 2024, 10:23 PM · Data-Engineering (Q1 2024 July 1st - September 30th)
amastilovic added a comment to T362699: Update converted reportupdater DAG queries to correct CSV options.

Hi @amastilovic, can you please associate one or more active project tags with this task (via the Add Action...Change Project Tags dropdown)? That will allow to see a task when looking at project workboards or searching for tasks in certain projects, and get notified about a task when watching a related project tag. Thanks!

Apr 17 2024, 10:17 PM · Data-Engineering (Q4 2024 April 1st - June 30th)
amastilovic added a project to T362699: Update converted reportupdater DAG queries to correct CSV options: Data-Engineering (Q4 2024 April 1st - June 30th).
Apr 17 2024, 10:16 PM · Data-Engineering (Q4 2024 April 1st - June 30th)

Apr 16 2024

amastilovic created T362699: Update converted reportupdater DAG queries to correct CSV options.
Apr 16 2024, 5:01 PM · Data-Engineering (Q4 2024 April 1st - June 30th)

Apr 9 2024

amastilovic created T362201: Fix and validate browser report DAG and queries.
Apr 9 2024, 10:43 PM · Data-Engineering (Q4 2024 April 1st - June 30th)

Feb 16 2024

amastilovic created T357803: [Maintenance] Migrate pingback ReportUpdater job.
Feb 16 2024, 6:22 PM · Data-Engineering

Jan 29 2024

amastilovic added a comment to T355606: Requesting analytics-privatedata-users access for amastilovic.

I need access to the following (from the wiki page you provided):

Jan 29 2024, 9:58 PM · Patch-For-Review, SRE, SRE-Access-Requests
amastilovic created T356106: Requesting Kerberos access for amastilovic.
Jan 29 2024, 8:10 PM

Jan 25 2024

amastilovic added a comment to T355607: Grant Access to ldap/wmf for Aleksandar Mastilovic.

@Arnoldokoth @Dzahn thank you!

Jan 25 2024, 8:55 PM · Patch-For-Review, SRE, LDAP-Access-Requests

Jan 22 2024

amastilovic created T355607: Grant Access to ldap/wmf for Aleksandar Mastilovic.
Jan 22 2024, 9:41 PM · Patch-For-Review, SRE, LDAP-Access-Requests
amastilovic created T355606: Requesting analytics-privatedata-users access for amastilovic.
Jan 22 2024, 9:34 PM · Patch-For-Review, SRE, SRE-Access-Requests