Page MenuHomePhabricator

lbowmaker (Luke Bowmaker)
User

Projects

Today

  • Clear sailing ahead.

Tomorrow

  • Clear sailing ahead.

Monday

  • Clear sailing ahead.

User Details

User Since
Sep 21 2021, 7:38 PM (10 w, 3 d)
Availability
Available
LDAP User
Unknown
MediaWiki User
LBowmaker (WMF) [ Global Accounts ]

Recent Activity

Tue, Nov 30

lbowmaker created T296776: [SPIKE] Design Desired Architecture for Similar Users Implementation.
Tue, Nov 30, 8:13 PM · Spike, Generated Data Platform
lbowmaker created T296758: Implement Cassandra Data Loader in Airflow.
Tue, Nov 30, 4:38 PM · Generated Data Platform (Product Roadmap)

Wed, Nov 24

lbowmaker added a comment to T295483: [SPIKE] Investigate and Decide on solution for Airflow > Cassandra for Monthly Image Recs Dataset.

Linked ticket: https://phabricator.wikimedia.org/T281517

Wed, Nov 24, 2:30 PM · Generated Data Platform, Spike

Tue, Nov 16

lbowmaker added a comment to T295483: [SPIKE] Investigate and Decide on solution for Airflow > Cassandra for Monthly Image Recs Dataset.

Subscribed DE - please add any comments/questions/suggestions.

Tue, Nov 16, 2:48 PM · Generated Data Platform, Spike
lbowmaker updated subscribers of T295483: [SPIKE] Investigate and Decide on solution for Airflow > Cassandra for Monthly Image Recs Dataset.
Tue, Nov 16, 2:48 PM · Generated Data Platform, Spike

Mon, Nov 15

lbowmaker updated the task description for T295337: Support Ad-hoc run of Image Recs Data Pipeline.
Mon, Nov 15, 3:19 PM · Generated Data Platform

Wed, Nov 10

lbowmaker added a comment to T293648: Content Translation Recommendations API.

Does this require data persistence of the suggestions or are they done on the fly?

Wed, Nov 10, 8:56 PM · Foundational Technology Requests
lbowmaker updated subscribers of T295483: [SPIKE] Investigate and Decide on solution for Airflow > Cassandra for Monthly Image Recs Dataset.
Wed, Nov 10, 4:06 PM · Generated Data Platform, Spike
lbowmaker added a project to T295485: [SPIKE] Investigate Approach for Shipping Airflow/Data Pipeline Metrics: Spike.
Wed, Nov 10, 4:06 PM · Spike, Generated Data Platform
lbowmaker created T295485: [SPIKE] Investigate Approach for Shipping Airflow/Data Pipeline Metrics.
Wed, Nov 10, 4:06 PM · Spike, Generated Data Platform
lbowmaker moved T295483: [SPIKE] Investigate and Decide on solution for Airflow > Cassandra for Monthly Image Recs Dataset from Product Roadmap to Investigate 🔍 on the Generated Data Platform board.
Wed, Nov 10, 3:58 PM · Generated Data Platform, Spike
lbowmaker created T295483: [SPIKE] Investigate and Decide on solution for Airflow > Cassandra for Monthly Image Recs Dataset.
Wed, Nov 10, 3:58 PM · Generated Data Platform, Spike
lbowmaker updated the task description for T281517: 📊[PLACEHOLDER] We should implement a data loader for Cassandra.
Wed, Nov 10, 2:42 PM · Generated Data Platform, Image-Suggestion-API, Image-Suggestions

Tue, Nov 9

lbowmaker moved T295405: Implement Image Recommendations Schema in Cassandra from Product Roadmap to Ready/Groomed 📚 on the Generated Data Platform board.
Tue, Nov 9, 8:57 PM · Generated Data Platform
lbowmaker created T295405: Implement Image Recommendations Schema in Cassandra.
Tue, Nov 9, 8:57 PM · Generated Data Platform
lbowmaker reassigned T295360: Data pipelines skeleton should be generated from a template from lbowmaker to gmodena.
Tue, Nov 9, 4:15 PM · Generated Data Platform
lbowmaker moved T293808: Design Image Recommendations Schema from Work in Progress ⚙️ to QA/Review ❓ on the Generated Data Platform board.
Tue, Nov 9, 4:03 PM · Generated Data Platform
lbowmaker moved T292747: Define and Create Logging Routines - Airflow UI from Ready/Groomed 📚 to Backlog on the Generated Data Platform board.
Tue, Nov 9, 3:06 PM · Airflow, Generated Data Platform
lbowmaker moved T280585: 📊Image Matching experiments should be deterministic from Backlog to Ready/Groomed 📚 on the Generated Data Platform board.
Tue, Nov 9, 2:53 PM · Generated Data Platform, Image-Suggestion-API, Image-Suggestions, Platform Team Workboards (Image Suggestion API)
lbowmaker added a project to T280585: 📊Image Matching experiments should be deterministic : Generated Data Platform.
Tue, Nov 9, 2:53 PM · Generated Data Platform, Image-Suggestion-API, Image-Suggestions, Platform Team Workboards (Image Suggestion API)
lbowmaker moved T292220: Define and Document Coding Standards from Backlog to Ready/Groomed 📚 on the Generated Data Platform board.
Tue, Nov 9, 2:37 PM · Documentation, Generated Data Platform
lbowmaker moved T295338: [SPIKE] Investigate More Frequent Runs of Image Recs Data Pipeline from Investigate 🔍 to Backlog on the Generated Data Platform board.
Tue, Nov 9, 2:37 PM · Generated Data Platform, Spike
lbowmaker moved T295337: Support Ad-hoc run of Image Recs Data Pipeline from Backlog to Ready/Groomed 📚 on the Generated Data Platform board.
Tue, Nov 9, 2:23 PM · Generated Data Platform
lbowmaker set Due Date to Tue, Nov 30, 5:00 AM on T295360: Data pipelines skeleton should be generated from a template.
Tue, Nov 9, 2:11 PM · Generated Data Platform
lbowmaker moved T292738: Define and Document Workflow Process from Sign-off ✔️ to Done 🎊 on the Generated Data Platform board.
Tue, Nov 9, 2:06 PM · Documentation, Generated Data Platform
lbowmaker moved T292738: Define and Document Workflow Process from QA/Review ❓ to Sign-off ✔️ on the Generated Data Platform board.
Tue, Nov 9, 2:05 PM · Documentation, Generated Data Platform
lbowmaker moved T292738: Define and Document Workflow Process from Work in Progress ⚙️ to QA/Review ❓ on the Generated Data Platform board.
Tue, Nov 9, 2:05 PM · Documentation, Generated Data Platform
lbowmaker added a comment to T292738: Define and Document Workflow Process.

First draft of process:

Tue, Nov 9, 2:05 PM · Documentation, Generated Data Platform
lbowmaker moved T292748: [SPIKE] Create Generic Components for Scheduling from Backlog to Ready/Groomed 📚 on the Generated Data Platform board.
Tue, Nov 9, 2:03 PM · Generated Data Platform
lbowmaker moved T295360: Data pipelines skeleton should be generated from a template from Backlog to Investigate 🔍 on the Generated Data Platform board.
Tue, Nov 9, 2:02 PM · Generated Data Platform
lbowmaker renamed T292748: [SPIKE] Create Generic Components for Scheduling from Create Generic Components for Scheduling (needs grooming to be more tasks) to [SPIKE] Create Generic Components for Scheduling.
Tue, Nov 9, 2:02 PM · Generated Data Platform
lbowmaker added a comment to T295360: Data pipelines skeleton should be generated from a template.

@gmodena - I think it would be useful to generate an airflow DAG skeleton too, at least a simple example of how to execute the data pipeline code.

Tue, Nov 9, 1:57 PM · Generated Data Platform
lbowmaker moved T294468: [SPIKE] Decide on best approach for API access to Cassandra from Backlog to Ready/Groomed 📚 on the Generated Data Platform board.
Tue, Nov 9, 1:41 PM · Spike, Generated Data Platform
lbowmaker reassigned T294468: [SPIKE] Decide on best approach for API access to Cassandra from lbowmaker to Eevans.
Tue, Nov 9, 1:41 PM · Spike, Generated Data Platform
lbowmaker moved T292743: Create Code Repo and Structure from Sign-off ✔️ to Done 🎊 on the Generated Data Platform board.
Tue, Nov 9, 1:34 PM · Generated Data Platform
lbowmaker moved T292743: Create Code Repo and Structure from QA/Review ❓ to Sign-off ✔️ on the Generated Data Platform board.
Tue, Nov 9, 1:34 PM · Generated Data Platform
lbowmaker added a comment to T292743: Create Code Repo and Structure.

Created this placeholder for our backlog in case there are any comments that require further work:

Tue, Nov 9, 1:34 PM · Generated Data Platform
lbowmaker moved T295364: [PLACEHOLDER] Pipelines Rep Structure Changes after RFC from Product Roadmap to Backlog on the Generated Data Platform board.
Tue, Nov 9, 1:33 PM · Generated Data Platform
lbowmaker created T295364: [PLACEHOLDER] Pipelines Rep Structure Changes after RFC.
Tue, Nov 9, 1:33 PM · Generated Data Platform
lbowmaker moved T292741: Define and Implement CI Checks from Sign-off ✔️ to Done 🎊 on the Generated Data Platform board.
Tue, Nov 9, 1:29 PM · Generated Data Platform
lbowmaker moved T293382: [SPIKE] Investigate Different CI Checks from Sign-off ✔️ to Done 🎊 on the Generated Data Platform board.
Tue, Nov 9, 1:29 PM · Spike, Generated Data Platform
lbowmaker moved T293382: [SPIKE] Investigate Different CI Checks from QA/Review ❓ to Sign-off ✔️ on the Generated Data Platform board.
Tue, Nov 9, 1:28 PM · Spike, Generated Data Platform
lbowmaker moved T292741: Define and Implement CI Checks from QA/Review ❓ to Sign-off ✔️ on the Generated Data Platform board.
Tue, Nov 9, 1:28 PM · Generated Data Platform
lbowmaker updated subscribers of T295338: [SPIKE] Investigate More Frequent Runs of Image Recs Data Pipeline.

Notes from chat with @gmodena and @JAllemandou:

Tue, Nov 9, 1:17 PM · Generated Data Platform, Spike
lbowmaker moved T295338: [SPIKE] Investigate More Frequent Runs of Image Recs Data Pipeline from Product Roadmap to Investigate 🔍 on the Generated Data Platform board.
Tue, Nov 9, 12:42 AM · Generated Data Platform, Spike
lbowmaker created T295338: [SPIKE] Investigate More Frequent Runs of Image Recs Data Pipeline.
Tue, Nov 9, 12:41 AM · Generated Data Platform, Spike
lbowmaker moved T295337: Support Ad-hoc run of Image Recs Data Pipeline from Product Roadmap to Backlog on the Generated Data Platform board.
Tue, Nov 9, 12:26 AM · Generated Data Platform
lbowmaker created T295337: Support Ad-hoc run of Image Recs Data Pipeline.
Tue, Nov 9, 12:21 AM · Generated Data Platform

Thu, Nov 4

lbowmaker added a comment to T293382: [SPIKE] Investigate Different CI Checks.

Data Pipeline (Lives in project name in repo) Checks:

Thu, Nov 4, 1:18 PM · Spike, Generated Data Platform

Nov 3 2021

lbowmaker moved T294468: [SPIKE] Decide on best approach for API access to Cassandra from Investigate 🔍 to Backlog on the Generated Data Platform board.
Nov 3 2021, 1:02 PM · Spike, Generated Data Platform
lbowmaker updated the task description for T294468: [SPIKE] Decide on best approach for API access to Cassandra.
Nov 3 2021, 1:01 PM · Spike, Generated Data Platform

Nov 1 2021

lbowmaker added a comment to T293808: Design Image Recommendations Schema.

@kostajh - what would happen if a page title changes between the image rec output and someone viewing the image rec then calling the API?

Nov 1 2021, 1:44 PM · Generated Data Platform

Oct 27 2021

lbowmaker moved T294480: Implement Image Recommendations DAG Performance Metrics from Product Roadmap to Backlog on the Generated Data Platform board.
Oct 27 2021, 7:12 PM · Generated Data Platform
lbowmaker created T294480: Implement Image Recommendations DAG Performance Metrics.
Oct 27 2021, 7:12 PM · Generated Data Platform
lbowmaker moved T294479: [SPIKE] Decide best approach for storage and visualization of Image Recs Metrics from Product Roadmap to Investigate 🔍 on the Generated Data Platform board.
Oct 27 2021, 7:06 PM · Generated Data Platform, Spike
lbowmaker created T294479: [SPIKE] Decide best approach for storage and visualization of Image Recs Metrics.
Oct 27 2021, 7:05 PM · Generated Data Platform, Spike
lbowmaker moved T294478: Implement Image Recommendations Algorithm Performance Metrics from Product Roadmap to Backlog on the Generated Data Platform board.
Oct 27 2021, 7:01 PM · Generated Data Platform
lbowmaker created T294478: Implement Image Recommendations Algorithm Performance Metrics.
Oct 27 2021, 7:00 PM · Generated Data Platform
lbowmaker moved T292747: Define and Create Logging Routines - Airflow UI from Backlog to Ready/Groomed 📚 on the Generated Data Platform board.
Oct 27 2021, 6:54 PM · Airflow, Generated Data Platform
lbowmaker renamed T292747: Define and Create Logging Routines - Airflow UI from Define and Create Logging Routines to Define and Create Logging Routines - Airflow UI.
Oct 27 2021, 6:54 PM · Airflow, Generated Data Platform
lbowmaker created T294468: [SPIKE] Decide on best approach for API access to Cassandra.
Oct 27 2021, 6:18 PM · Spike, Generated Data Platform
lbowmaker added a comment to T292743: Create Code Repo and Structure.

Adding some comments from the grooming session today:

Oct 27 2021, 2:48 PM · Generated Data Platform

Oct 26 2021

lbowmaker added a comment to T292743: Create Code Repo and Structure.

Let's say a dataset producer only cares about image recs and has no involvement in similar users.

Oct 26 2021, 7:20 PM · Generated Data Platform
lbowmaker updated subscribers of T292747: Define and Create Logging Routines - Airflow UI.

@gmodena - my thoughts for this task was to do something simple to also support a basic use case for a dataset producer/platform engineer. For example, 'I expected my Airflow DAG to generate a file, I can't see the file so I'll check for any errors in Airflow and the UI will show me the stack trace'.

Oct 26 2021, 7:13 PM · Airflow, Generated Data Platform

Oct 21 2021

lbowmaker updated subscribers of T293808: Design Image Recommendations Schema.

@Clarakosi @gmodena - Are you able to answer the first question above? Seems like there isn't a good reason to store page title? Not sure if there was any reasoning to it in the original requirements.

Oct 21 2021, 7:51 PM · Generated Data Platform

Oct 20 2021

lbowmaker set the image for Generated Data Platform (Product Roadmap) to F34701428: profile.
Oct 20 2021, 1:29 PM
lbowmaker set the image for Generated Data Platform to F34701424: profile.
Oct 20 2021, 1:28 PM
lbowmaker added a project to T293889: Event Driven Services (to be groomed and tasks added): Epic.
Oct 20 2021, 1:16 PM · Epic, Generated Data Platform (Product Roadmap)
lbowmaker created T293891: Dataset Access and Querying (to be groomed and tasks added).
Oct 20 2021, 1:15 PM · Epic, Generated Data Platform (Product Roadmap)
lbowmaker added a project to T293890: Dataset Discovery (to be groomed and tasks added): Epic.
Oct 20 2021, 1:15 PM · Epic, Generated Data Platform (Product Roadmap)
lbowmaker created T293890: Dataset Discovery (to be groomed and tasks added).
Oct 20 2021, 1:12 PM · Epic, Generated Data Platform (Product Roadmap)
lbowmaker created T293889: Event Driven Services (to be groomed and tasks added).
Oct 20 2021, 1:10 PM · Epic, Generated Data Platform (Product Roadmap)
lbowmaker created T293888: Human in the Loop Data Interaction (to be groomed and tasks added).
Oct 20 2021, 1:04 PM · Epic, Generated Data Platform (Product Roadmap)
lbowmaker updated the task description for T293887: Event Streams (to be groomed and tasks added).
Oct 20 2021, 1:00 PM · Epic, Generated Data Platform (Product Roadmap)
lbowmaker created T293887: Event Streams (to be groomed and tasks added).
Oct 20 2021, 12:58 PM · Epic, Generated Data Platform (Product Roadmap)
lbowmaker added a comment to T293386: <Product Research> WikiWho Migration.

Some additional comments on the database side after discussion with SRE and Platform team today:

Oct 20 2021, 1:42 AM · Education-Program-Dashboard, Who-Wrote-That, XTools, Community-Tech, Foundational Technology Requests

Oct 19 2021

lbowmaker moved T292218: Implement Automated Batch Execution for Non-Platform Engineering from Backlog to Now (Deliver) on the Generated Data Platform (Product Roadmap) board.
Oct 19 2021, 9:12 PM · Generated Data Platform (Product Roadmap), Epic
lbowmaker moved T293807: Data Persistence for Image Recommendations from Backlog to Now (Deliver) on the Generated Data Platform (Product Roadmap) board.
Oct 19 2021, 9:12 PM · Generated Data Platform (Product Roadmap), Epic
lbowmaker moved T292218: Implement Automated Batch Execution for Non-Platform Engineering from Product Roadmap to Product Roadmap on the Generated Data Platform board.
Oct 19 2021, 9:09 PM · Generated Data Platform (Product Roadmap), Epic
lbowmaker moved T293807: Data Persistence for Image Recommendations from Product Roadmap to Product Roadmap on the Generated Data Platform board.
Oct 19 2021, 9:09 PM · Generated Data Platform (Product Roadmap), Epic
lbowmaker created Generated Data Platform (Product Roadmap).
Oct 19 2021, 9:05 PM
lbowmaker moved T292745: Deploy POC Script to Automated Execution System from Product Roadmap to Backlog on the Generated Data Platform board.
Oct 19 2021, 6:55 PM · Generated Data Platform
lbowmaker moved T292748: [SPIKE] Create Generic Components for Scheduling from Product Roadmap to Backlog on the Generated Data Platform board.
Oct 19 2021, 6:55 PM · Generated Data Platform
lbowmaker moved T292744: Setup Script Scheduling Tool (needs grooming to more tasks) from Product Roadmap to Backlog on the Generated Data Platform board.
Oct 19 2021, 6:55 PM · Generated Data Platform
lbowmaker moved T292747: Define and Create Logging Routines - Airflow UI from Product Roadmap to Backlog on the Generated Data Platform board.
Oct 19 2021, 6:55 PM · Airflow, Generated Data Platform
lbowmaker moved T292220: Define and Document Coding Standards from Product Roadmap to Backlog on the Generated Data Platform board.
Oct 19 2021, 6:54 PM · Documentation, Generated Data Platform
lbowmaker moved T292739: Define and Document Deployment Process from Product Roadmap to Backlog on the Generated Data Platform board.
Oct 19 2021, 6:54 PM · Documentation, Generated Data Platform
lbowmaker moved T293808: Design Image Recommendations Schema from Backlog to Work in Progress ⚙️ on the Generated Data Platform board.
Oct 19 2021, 6:22 PM · Generated Data Platform
lbowmaker moved T293809: Define Data Management Process from Backlog to Work in Progress ⚙️ on the Generated Data Platform board.
Oct 19 2021, 6:22 PM · Generated Data Platform
lbowmaker reassigned T293809: Define Data Management Process from lbowmaker to Eevans.
Oct 19 2021, 6:21 PM · Generated Data Platform
lbowmaker reassigned T293808: Design Image Recommendations Schema from lbowmaker to Eevans.
Oct 19 2021, 6:16 PM · Generated Data Platform
lbowmaker updated the task description for T293808: Design Image Recommendations Schema.
Oct 19 2021, 6:06 PM · Generated Data Platform
lbowmaker updated the task description for T293807: Data Persistence for Image Recommendations.
Oct 19 2021, 5:59 PM · Generated Data Platform (Product Roadmap), Epic
lbowmaker updated the task description for T293808: Design Image Recommendations Schema.
Oct 19 2021, 5:35 PM · Generated Data Platform
lbowmaker updated the task description for T293809: Define Data Management Process.
Oct 19 2021, 5:34 PM · Generated Data Platform
lbowmaker updated the task description for T293807: Data Persistence for Image Recommendations.
Oct 19 2021, 5:33 PM · Generated Data Platform (Product Roadmap), Epic
lbowmaker updated the task description for T293807: Data Persistence for Image Recommendations.
Oct 19 2021, 4:24 PM · Generated Data Platform (Product Roadmap), Epic
lbowmaker removed a parent task for T293809: Define Data Management Process: T293808: Design Image Recommendations Schema.
Oct 19 2021, 4:24 PM · Generated Data Platform
lbowmaker removed a subtask for T293808: Design Image Recommendations Schema: T293809: Define Data Management Process.
Oct 19 2021, 4:24 PM · Generated Data Platform
lbowmaker added a subtask for T293807: Data Persistence for Image Recommendations: T293809: Define Data Management Process.
Oct 19 2021, 4:24 PM · Generated Data Platform (Product Roadmap), Epic