Page MenuHomePhabricator

lerickson (Lindsay Erickson)
User

Today

  • No visible events.

Tomorrow

  • No visible events.

Thursday

  • No visible events.

User Details

User Since
Jan 22 2026, 1:46 PM (20 w, 4 d)
Availability
Available
LDAP User
Lerickson
MediaWiki User
LErickson-WMF [ Global Accounts ]

Recent Activity

Today

lerickson claimed T421200: wdqs-proxy: event platform integration for query logs.

claiming what remains of this (registering the stream, integrating with the proxy, whatever else comes up, also finalizing the schema release when we're ready to move it out of dev)

Tue, Jun 16, 2:18 AM · Wikidata Platform Team (Sprint 06 (2026/06/02)), OKR-Work

Yesterday

lerickson added a comment to T428235: [NEEDS GROOMING] data-reload: create a QLever index using an ephemeral instance.

This will run from the indexing pod, right? Do we need to bundle rclone with the qlever docker image?

Mon, Jun 15, 7:52 PM · Data-Platform-SRE, Wikidata Platform Team, OKR-Work
lerickson moved T429025: data-reload: try to speed up indexing by altering how files are passed in from In Engineering to Needs Sign-off on the Wikidata Platform Team (Sprint 06 (2026/06/02)) board.
Mon, Jun 15, 6:56 PM · OKR-Work, Wikidata Platform Team (Sprint 06 (2026/06/02))
lerickson updated subscribers of T429025: data-reload: try to speed up indexing by altering how files are passed in.

Update: this doesn't seem to make a difference.

Mon, Jun 15, 6:55 PM · OKR-Work, Wikidata Platform Team (Sprint 06 (2026/06/02))
lerickson added a comment to T428235: [NEEDS GROOMING] data-reload: create a QLever index using an ephemeral instance.

I'm going to dump a couple thoughts and questions here for grooming purposes (we can discuss more on Tuesday):

Mon, Jun 15, 5:52 PM · Data-Platform-SRE, Wikidata Platform Team, OKR-Work
lerickson added a subtask for T428235: [NEEDS GROOMING] data-reload: create a QLever index using an ephemeral instance: T427030: data-reload: get qlever index into S3, bypassing pipeline.
Mon, Jun 15, 12:59 PM · Data-Platform-SRE, Wikidata Platform Team, OKR-Work
lerickson added a parent task for T427030: data-reload: get qlever index into S3, bypassing pipeline: T428235: [NEEDS GROOMING] data-reload: create a QLever index using an ephemeral instance.
Mon, Jun 15, 12:59 PM · Wikidata Platform Team (Sprint 06 (2026/06/02)), OKR-Work

Fri, Jun 12

lerickson set the point value for T429026: data-reload: automate index-building given a set of NT files ready to be indexed to 3.
Fri, Jun 12, 6:40 PM · OKR-Work, Wikidata Platform Team (Sprint 06 (2026/06/02))
lerickson claimed T429026: data-reload: automate index-building given a set of NT files ready to be indexed.
Fri, Jun 12, 6:40 PM · OKR-Work, Wikidata Platform Team (Sprint 06 (2026/06/02))
lerickson added a project to T429026: data-reload: automate index-building given a set of NT files ready to be indexed: OKR-Work.
Fri, Jun 12, 6:40 PM · OKR-Work, Wikidata Platform Team (Sprint 06 (2026/06/02))
lerickson added a project to T429025: data-reload: try to speed up indexing by altering how files are passed in: OKR-Work.
Fri, Jun 12, 6:39 PM · OKR-Work, Wikidata Platform Team (Sprint 06 (2026/06/02))
lerickson moved T429026: data-reload: automate index-building given a set of NT files ready to be indexed from Backlog to In Engineering on the Wikidata Platform Team (Sprint 06 (2026/06/02)) board.
Fri, Jun 12, 6:38 PM · OKR-Work, Wikidata Platform Team (Sprint 06 (2026/06/02))
lerickson updated the task description for T429025: data-reload: try to speed up indexing by altering how files are passed in.
Fri, Jun 12, 6:33 PM · OKR-Work, Wikidata Platform Team (Sprint 06 (2026/06/02))
lerickson moved T429025: data-reload: try to speed up indexing by altering how files are passed in from Backlog to In Engineering on the Wikidata Platform Team (Sprint 06 (2026/06/02)) board.
Fri, Jun 12, 6:32 PM · OKR-Work, Wikidata Platform Team (Sprint 06 (2026/06/02))
lerickson created T429026: data-reload: automate index-building given a set of NT files ready to be indexed.
Fri, Jun 12, 1:37 PM · OKR-Work, Wikidata Platform Team (Sprint 06 (2026/06/02))
lerickson created T429025: data-reload: try to speed up indexing by altering how files are passed in.
Fri, Jun 12, 1:28 PM · OKR-Work, Wikidata Platform Team (Sprint 06 (2026/06/02))
lerickson moved T424218: wdqs-proxy: Improved error handling apart from HTTP response codes from In Engineering to Needs Sign-off on the Wikidata Platform Team (Sprint 06 (2026/06/02)) board.
Fri, Jun 12, 12:28 PM · Patch-For-Review, Wikidata Platform Team (Sprint 06 (2026/06/02)), OKR-Work

Thu, Jun 11

lerickson moved T427030: data-reload: get qlever index into S3, bypassing pipeline from In Engineering to Needs Sign-off on the Wikidata Platform Team (Sprint 06 (2026/06/02)) board.
Thu, Jun 11, 6:41 PM · Wikidata Platform Team (Sprint 06 (2026/06/02)), OKR-Work
lerickson updated subscribers of T427030: data-reload: get qlever index into S3, bypassing pipeline.

An index and metadata file with files & timestamp now exists in the S3 bucket, and latest.json points to it.

Thu, Jun 11, 6:40 PM · Wikidata Platform Team (Sprint 06 (2026/06/02)), OKR-Work
lerickson updated the task description for T427030: data-reload: get qlever index into S3, bypassing pipeline.
Thu, Jun 11, 6:25 PM · Wikidata Platform Team (Sprint 06 (2026/06/02)), OKR-Work
lerickson updated the task description for T427030: data-reload: get qlever index into S3, bypassing pipeline.
Thu, Jun 11, 6:22 PM · Wikidata Platform Team (Sprint 06 (2026/06/02)), OKR-Work

Mon, Jun 8

lerickson added a comment to T424218: wdqs-proxy: Improved error handling apart from HTTP response codes.

Update on these error codes. I believe that we don't need to worry about many of them.

Mon, Jun 8, 9:12 PM · Patch-For-Review, Wikidata Platform Team (Sprint 06 (2026/06/02)), OKR-Work
lerickson moved T425036: data-reload: implement wikidata dump to Ceph S3 from In Engineering to Needs Sign-off on the Wikidata Platform Team (Sprint 06 (2026/06/02)) board.
Mon, Jun 8, 12:36 PM · Wikidata Platform Team (Sprint 06 (2026/06/02)), Patch-For-Review, OKR-Work
lerickson added a comment to T425036: data-reload: implement wikidata dump to Ceph S3.

The first dump, kicked off last week, is complete, with data ending up in S3. Marking as complete.

Mon, Jun 8, 12:36 PM · Wikidata Platform Team (Sprint 06 (2026/06/02)), Patch-For-Review, OKR-Work

Fri, Jun 5

lerickson added a comment to T424218: wdqs-proxy: Improved error handling apart from HTTP response codes.

I've been investigating to see which errors can be returned from qlever. I actually could not find documentation about this, but searching the code worked pretty well.

Fri, Jun 5, 6:56 PM · Patch-For-Review, Wikidata Platform Team (Sprint 06 (2026/06/02)), OKR-Work
lerickson updated the task description for T424218: wdqs-proxy: Improved error handling apart from HTTP response codes.
Fri, Jun 5, 4:25 PM · Patch-For-Review, Wikidata Platform Team (Sprint 06 (2026/06/02)), OKR-Work
lerickson moved T428115: wdqs-proxy: add server logs from In Engineering to Needs Sign-off on the Wikidata Platform Team (Sprint 06 (2026/06/02)) board.
Fri, Jun 5, 3:54 PM · Wikidata Platform Team (Sprint 06 (2026/06/02))
lerickson added a comment to T428115: wdqs-proxy: add server logs.

Added server logs when exceptions are created. As for the context MDC object I mentioned above, that's not relevant yet. I expect that we will use one later when we have some fields in mind to add to the logs, but for now, nothing left to do here.

Fri, Jun 5, 3:54 PM · Wikidata Platform Team (Sprint 06 (2026/06/02))
lerickson updated the task description for T428115: wdqs-proxy: add server logs.
Fri, Jun 5, 3:10 PM · Wikidata Platform Team (Sprint 06 (2026/06/02))

Thu, Jun 4

lerickson set the point value for T428115: wdqs-proxy: add server logs to 1.
Thu, Jun 4, 2:02 AM · Wikidata Platform Team (Sprint 06 (2026/06/02))
lerickson moved T428115: wdqs-proxy: add server logs from Backlog to In Engineering on the Wikidata Platform Team (Sprint 06 (2026/06/02)) board.
Thu, Jun 4, 2:01 AM · Wikidata Platform Team (Sprint 06 (2026/06/02))
lerickson created T428115: wdqs-proxy: add server logs.
Thu, Jun 4, 1:39 AM · Wikidata Platform Team (Sprint 06 (2026/06/02))

Wed, Jun 3

lerickson renamed T425036: data-reload: implement wikidata dump to Ceph S3 from data-reload: implement wikidata dump to CephFS to data-reload: implement wikidata dump to Ceph S3.
Wed, Jun 3, 2:54 PM · Wikidata Platform Team (Sprint 06 (2026/06/02)), Patch-For-Review, OKR-Work

Tue, Jun 2

lerickson added a comment to T427319: request: s3 access for wdqs::alternatives.

Confirming that I can connect to our S3 bucket from wdqs1030.

Tue, Jun 2, 8:24 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26), Wikidata Platform Team

Fri, May 29

lerickson added a comment to T425036: data-reload: implement wikidata dump to Ceph S3.

There was a lot of unexpected complexity involved in this task because connecting to S3 in a DAG turned out to be highly nontrivial config-wise. Thanks to a lot of help from SRE in setting up secrets, debugging issues with accessing them, and resolving network access issues preventing data from going to S3, I have a working DAG now that can do a lexemes dump. Sent back the MR for review again, hoping to merge next week.

Fri, May 29, 8:29 PM · Wikidata Platform Team (Sprint 06 (2026/06/02)), Patch-For-Review, OKR-Work

Thu, May 28

lerickson updated the task description for T425036: data-reload: implement wikidata dump to Ceph S3.
Thu, May 28, 7:43 PM · Wikidata Platform Team (Sprint 06 (2026/06/02)), Patch-For-Review, OKR-Work
lerickson updated the task description for T427319: request: s3 access for wdqs::alternatives.
Thu, May 28, 2:31 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26), Wikidata Platform Team

Wed, May 27

lerickson set the point value for T424218: wdqs-proxy: Improved error handling apart from HTTP response codes to 3.
Wed, May 27, 7:02 PM · Patch-For-Review, Wikidata Platform Team (Sprint 06 (2026/06/02)), OKR-Work
lerickson changed the status of T424218: wdqs-proxy: Improved error handling apart from HTTP response codes, a subtask of T422522: WE2.5.8 WDQS v2 technical build, from Open to In Progress.
Wed, May 27, 7:01 PM · Wikidata Platform Team (Sprint 06 (2026/06/02)), OKR-Work, Epic
lerickson changed the status of T424218: wdqs-proxy: Improved error handling apart from HTTP response codes from Open to In Progress.
Wed, May 27, 7:01 PM · Patch-For-Review, Wikidata Platform Team (Sprint 06 (2026/06/02)), OKR-Work
lerickson updated the task description for T427414: wdqs-backend-init: Init container needs to determine the most recent successful index to read from ceph S3.
Wed, May 27, 3:52 PM · OKR-Work, Wikidata Platform Team (Sprint 05 (2026/05/05))
lerickson created T427414: wdqs-backend-init: Init container needs to determine the most recent successful index to read from ceph S3.
Wed, May 27, 3:46 PM · OKR-Work, Wikidata Platform Team (Sprint 05 (2026/05/05))

Tue, May 26

lerickson added a comment to T427030: data-reload: get qlever index into S3, bypassing pipeline.

Update: I've indexed a shard of the import_wikidata_ttl DAG's output and put it in our S3 bucket, along with a tiny metadata file. This was a manual process; I'm working on a script. We also would like S3 access from the WDQS nodes to make this easy to do before the DAG setup is ready: T427319

Tue, May 26, 8:34 PM · Wikidata Platform Team (Sprint 06 (2026/06/02)), OKR-Work
lerickson changed the status of T427030: data-reload: get qlever index into S3, bypassing pipeline from Open to In Progress.
Tue, May 26, 8:32 PM · Wikidata Platform Team (Sprint 06 (2026/06/02)), OKR-Work
lerickson claimed T427030: data-reload: get qlever index into S3, bypassing pipeline.
Tue, May 26, 8:32 PM · Wikidata Platform Team (Sprint 06 (2026/06/02)), OKR-Work
lerickson created T427319: request: s3 access for wdqs::alternatives.
Tue, May 26, 6:53 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26), Wikidata Platform Team

Fri, May 22

lerickson created T427030: data-reload: get qlever index into S3, bypassing pipeline.
Fri, May 22, 3:12 AM · Wikidata Platform Team (Sprint 06 (2026/06/02)), OKR-Work

Thu, May 21

lerickson moved T424218: wdqs-proxy: Improved error handling apart from HTTP response codes from Backlog to Ready on the Wikidata Platform Team (Sprint 05 (2026/05/05)) board.
Thu, May 21, 3:28 PM · Patch-For-Review, Wikidata Platform Team (Sprint 06 (2026/06/02)), OKR-Work
lerickson claimed T424218: wdqs-proxy: Improved error handling apart from HTTP response codes.
Thu, May 21, 3:28 PM · Patch-For-Review, Wikidata Platform Team (Sprint 06 (2026/06/02)), OKR-Work
lerickson edited projects for T424218: wdqs-proxy: Improved error handling apart from HTTP response codes, added: Wikidata Platform Team (Sprint 05 (2026/05/05)); removed Wikidata Platform Team.
Thu, May 21, 3:27 PM · Patch-For-Review, Wikidata Platform Team (Sprint 06 (2026/06/02)), OKR-Work

Wed, May 20

lerickson added a comment to T426764: Airflow secret for S3 credentials for wikidata-platform.

Thank you! I am trying to use the connection right now in a DAG-in-progress running on my dev instance. When I do s3 = get_s3_client("wikidata_platform_s3_dpe")
I get this error:

Wed, May 20, 2:48 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26), Wikidata Platform Team
lerickson moved T423104: wdqs-proxy: write protection (reject SPARQL update queries) from In Engineering to Needs Sign-off on the Wikidata Platform Team (Sprint 05 (2026/05/05)) board.
Wed, May 20, 2:01 PM · Wikidata Platform Team (Sprint 05 (2026/05/05)), OKR-Work
lerickson added a comment to T423104: wdqs-proxy: write protection (reject SPARQL update queries).

The proxy now rejects updates (well, it actually already did) and diagnoses them as being disallowed writes. Right now it just returns a plain old 400 Bad Request, but later on in T424218 we will add better info. This one is done, though.

Wed, May 20, 2:01 PM · Wikidata Platform Team (Sprint 05 (2026/05/05)), OKR-Work
lerickson updated the task description for T423104: wdqs-proxy: write protection (reject SPARQL update queries).
Wed, May 20, 1:59 PM · Wikidata Platform Team (Sprint 05 (2026/05/05)), OKR-Work

Tue, May 19

lerickson closed T425973: Requesting Ceph S3 credentials for Wikidata Plaftorm as Resolved.
Tue, May 19, 3:47 PM · Data-Platform-SRE (2026-04-24 - 2026-05-15), Wikidata Platform Team
lerickson added a comment to T425973: Requesting Ceph S3 credentials for Wikidata Plaftorm.

I filed T426764 to track getting the creds into airflow, since that's really a separate question from what this phab was tracking. This one is all done, IMO.

Tue, May 19, 3:24 PM · Data-Platform-SRE (2026-04-24 - 2026-05-15), Wikidata Platform Team
lerickson updated the task description for T426764: Airflow secret for S3 credentials for wikidata-platform.
Tue, May 19, 3:17 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26), Wikidata Platform Team
lerickson created T426764: Airflow secret for S3 credentials for wikidata-platform.
Tue, May 19, 3:15 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26), Wikidata Platform Team

Mon, May 18

lerickson moved T425035: data-reload: Set up Wikidata Platform team to use Ceph from In Engineering to Needs Sign-off on the Wikidata Platform Team (Sprint 05 (2026/05/05)) board.
Mon, May 18, 6:46 PM · Wikidata Platform Team (Sprint 05 (2026/05/05)), OKR-Work
lerickson added a comment to T425035: data-reload: Set up Wikidata Platform team to use Ceph.

We have a PVC now in mediawiki-dumps-legacy. It is called wdqs-update-pipeline
I have successfully interacted with this PVC in a test airflow instance (created a directory and performed a wikibase lexemes dump)

Mon, May 18, 6:45 PM · Wikidata Platform Team (Sprint 05 (2026/05/05)), OKR-Work
lerickson added a comment to T423104: wdqs-proxy: write protection (reject SPARQL update queries).

Good point @trueg that the HTTP method isn't the issue, so 405 isn't as close as I thought it was. I think I'm OK with using 400 given all this context.

Mon, May 18, 1:34 PM · Wikidata Platform Team (Sprint 05 (2026/05/05)), OKR-Work

May 13 2026

lerickson updated the task description for T425036: data-reload: implement wikidata dump to Ceph S3.
May 13 2026, 8:36 PM · Wikidata Platform Team (Sprint 06 (2026/06/02)), Patch-For-Review, OKR-Work
lerickson added a comment to T423104: wdqs-proxy: write protection (reject SPARQL update queries).

Thank you! I ended up doing something very similar. I like your idea of passing along the cause with an UpdateOperationException and then returning a better HTTP response. I was originally planning to update the response but had 403 in mind, but this one (405) is better.

May 13 2026, 8:03 PM · Wikidata Platform Team (Sprint 05 (2026/05/05)), OKR-Work
lerickson added a comment to T425035: data-reload: Set up Wikidata Platform team to use Ceph.
  • we likely will not need our own PVC in our namespace. We can probably use S3 for all cross-task storage in our own DAG tasks.
  • We have an S3 bucket!
  • @BTullis will create a PVC for us in the mediawiki-dumps-legacy to use for the dump
May 13 2026, 2:54 PM · Wikidata Platform Team (Sprint 05 (2026/05/05)), OKR-Work

May 12 2026

lerickson added a comment to T425973: Requesting Ceph S3 credentials for Wikidata Plaftorm.

@RKemper Thank you so much!
In order to interact with this in a DAG, I understand that these credentials will need to be in an airflow secret, and that is something only SRE can do. Does that belong in a separate ticket? This is the "wikidata" airflow instance.

May 12 2026, 8:32 PM · Data-Platform-SRE (2026-04-24 - 2026-05-15), Wikidata Platform Team
lerickson added a comment to T423104: wdqs-proxy: write protection (reject SPARQL update queries).

It turns out that the proxy was already rejecting updates, because it created and executed a Query object and in Jena, a Query is used to represent the read-only operations (see Query javadoc with the different QueryTypes). UpdateRequest is used to hold a request to insert/delete and other non-read-only actions.

May 12 2026, 6:26 PM · Wikidata Platform Team (Sprint 05 (2026/05/05)), OKR-Work
lerickson added a comment to T425973: Requesting Ceph S3 credentials for Wikidata Plaftorm.

Update: after talking with Ben, it seems we might use Ceph RGW/S3 for the munge/split step and also for the side-effect tables too. I will come back with a different storage estimate when I have more info, but just wanted to say this here for posterity.

May 12 2026, 3:51 PM · Data-Platform-SRE (2026-04-24 - 2026-05-15), Wikidata Platform Team
lerickson updated the task description for T425466: data-reload: SPIKE: investigate Spark/Kubernetes/Ceph performance.
May 12 2026, 1:51 PM · Wikidata Platform Team, OKR-Work
lerickson renamed T425137: data-reload: set up RDF tables in Ceph, refactor DDL from data-reload: set up RDF tables in CephFS, refactor DDL to data-reload: set up RDF tables in Ceph, refactor DDL.
May 12 2026, 12:52 PM · Wikidata Platform Team, OKR-Work

May 11 2026

lerickson added a comment to T425973: Requesting Ceph S3 credentials for Wikidata Plaftorm.

@gmodena I was not planning to use this bucket for the output of the data processing task. I was planning to store that using a new CephFS PVC. I was, however, planning to use it for the index files. So I guess it depends on what you mean by main/scholarly splits. If you mean the output of munge/split, no. If you mean the index files themselves, yes.

May 11 2026, 9:55 PM · Data-Platform-SRE (2026-04-24 - 2026-05-15), Wikidata Platform Team
lerickson added a comment to T425035: data-reload: Set up Wikidata Platform team to use Ceph.

I filed T425973 for the S3 bucket bullet above.

May 11 2026, 6:38 PM · Wikidata Platform Team (Sprint 05 (2026/05/05)), OKR-Work
lerickson changed the status of T423104: wdqs-proxy: write protection (reject SPARQL update queries), a subtask of T422522: WE2.5.8 WDQS v2 technical build, from Open to In Progress.
May 11 2026, 6:36 PM · Wikidata Platform Team (Sprint 06 (2026/06/02)), OKR-Work, Epic
lerickson changed the status of T423104: wdqs-proxy: write protection (reject SPARQL update queries) from Open to In Progress.
May 11 2026, 6:36 PM · Wikidata Platform Team (Sprint 05 (2026/05/05)), OKR-Work
lerickson created T425973: Requesting Ceph S3 credentials for Wikidata Plaftorm.
May 11 2026, 4:01 PM · Data-Platform-SRE (2026-04-24 - 2026-05-15), Wikidata Platform Team
lerickson updated subscribers of T425035: data-reload: Set up Wikidata Platform team to use Ceph.

Update on what I've learned so far. The tasks are numbered as follows: 1) Dump, 2) Quality check, 3) Spark processing, 4) Index, 5) Orchestrate reload

May 11 2026, 1:30 PM · Wikidata Platform Team (Sprint 05 (2026/05/05)), OKR-Work

May 8 2026

lerickson added a comment to T407702: Multiple deleted items still available in Wikidata Query Service.

Hi again @Yirba , I believe it's all ready now (I don't see the items in WDQS anymore). Thanks again for the report.

May 8 2026, 8:34 PM · Wikidata-Omega (Radar/Epics/Stalled), Wikidata Platform Team (Sprint 03 (2026/03/03)), Essential-Work, Wikidata-Query-Service, Wikidata
lerickson added a comment to T407702: Multiple deleted items still available in Wikidata Query Service.

Thanks @Yirba for the report! I attempted to fix these entities in the same way, but I am unfortunately still seeing them returned in WDQS. I will investigate and get back to you.

May 8 2026, 2:34 AM · Wikidata-Omega (Radar/Epics/Stalled), Wikidata Platform Team (Sprint 03 (2026/03/03)), Essential-Work, Wikidata-Query-Service, Wikidata

May 6 2026

lerickson changed the status of T425036: data-reload: implement wikidata dump to Ceph S3 from Open to In Progress.
May 6 2026, 5:10 PM · Wikidata Platform Team (Sprint 06 (2026/06/02)), Patch-For-Review, OKR-Work
lerickson changed the status of T425036: data-reload: implement wikidata dump to Ceph S3, a subtask of T422179: Set up regular bulk data ingestion and indexing pipeline, from Open to In Progress.
May 6 2026, 5:10 PM · Wikidata Platform Team, OKR-Work
lerickson claimed T425036: data-reload: implement wikidata dump to Ceph S3.
May 6 2026, 5:10 PM · Wikidata Platform Team (Sprint 06 (2026/06/02)), Patch-For-Review, OKR-Work
lerickson closed T424255: Setup qlever on wdqs2009 as Declined.

Thanks @gmodena , I will close this then!

May 6 2026, 3:28 PM · OKR-Work, Wikidata Platform Team
lerickson updated the task description for T422056: wdqs: database node logs should be pushed to logstash.
May 6 2026, 2:57 PM · Wikidata Platform Team (Sprint 06 (2026/06/02)), SRE Observability, OKR-Work
lerickson set the point value for T421200: wdqs-proxy: event platform integration for query logs to 5.
May 6 2026, 2:54 PM · Wikidata Platform Team (Sprint 06 (2026/06/02)), OKR-Work
lerickson added a comment to T424255: Setup qlever on wdqs2009.

@gmodena are we pulling 2009 into the k8s cluster, in which case we can close this?

May 6 2026, 2:51 PM · OKR-Work, Wikidata Platform Team
lerickson closed T423433: Deploy the new WDQS Data Processing tools as Declined.

Closing this - no longer relevant with the plan for an entirely new data pipeline. See T422179

May 6 2026, 2:48 PM · OKR-Work, Wikidata Platform Team
lerickson changed the status of T424896: request: index wikidata-platform gitlab repos from Open to In Progress.
May 6 2026, 2:39 PM · OKR-Work, Wikidata Platform Team (Sprint 05 (2026/05/05)), VPS-project-Codesearch
lerickson edited projects for T424896: request: index wikidata-platform gitlab repos, added: Wikidata Platform Team (Sprint 05 (2026/05/05)); removed Wikidata Platform Team.
May 6 2026, 2:39 PM · OKR-Work, Wikidata Platform Team (Sprint 05 (2026/05/05)), VPS-project-Codesearch
lerickson set the point value for T424338: Provide a helm chart and dse-k8s helmfiles for wdqs-proxy. to 2.
May 6 2026, 2:36 PM · Data-Platform-SRE (2026-06-05 - 2026-06-26), Patch-For-Review, Wikidata Platform Team (Sprint 06 (2026/06/02)), OKR-Work
lerickson set the point value for T425141: data-reload: preprocess the dump to 5.
May 6 2026, 2:35 PM · Wikidata Platform Team, OKR-Work
lerickson renamed T425137: data-reload: set up RDF tables in Ceph, refactor DDL from data-reload: set up RDF tables in CephFS to data-reload: set up RDF tables in CephFS, refactor DDL.
May 6 2026, 2:33 PM · Wikidata Platform Team, OKR-Work
lerickson set the point value for T425119: data-reload: implement a quality check step on the wikidata dump to 2.
May 6 2026, 2:26 PM · Wikidata Platform Team, OKR-Work
lerickson set the point value for T425036: data-reload: implement wikidata dump to Ceph S3 to 3.
May 6 2026, 2:25 PM · Wikidata Platform Team (Sprint 06 (2026/06/02)), Patch-For-Review, OKR-Work
lerickson set the point value for T425467: data-reload: Tune Spark params for the processing stage to 2.
May 6 2026, 2:24 PM · Wikidata Platform Team, OKR-Work
lerickson set the point value for T423104: wdqs-proxy: write protection (reject SPARQL update queries) to 2.
May 6 2026, 2:21 PM · Wikidata Platform Team (Sprint 05 (2026/05/05)), OKR-Work
lerickson set the point value for T425035: data-reload: Set up Wikidata Platform team to use Ceph to 2.
May 6 2026, 2:20 PM · Wikidata Platform Team (Sprint 05 (2026/05/05)), OKR-Work
lerickson set the point value for T422151: wdqs-streaming-consumer: report update operation metrics to 3.
May 6 2026, 2:16 PM · Wikidata Platform Team (Sprint 05 (2026/05/05)), OKR-Work
lerickson set the point value for T425007: Helm chart for wdqs-qlever and wdqs-streaming-consumer to 5.
May 6 2026, 2:12 PM · Wikidata Platform Team (Sprint 06 (2026/06/02)), OKR-Work
lerickson placed T425466: data-reload: SPIKE: investigate Spark/Kubernetes/Ceph performance up for grabs.
May 6 2026, 2:09 PM · Wikidata Platform Team, OKR-Work
lerickson placed T425467: data-reload: Tune Spark params for the processing stage up for grabs.
May 6 2026, 2:09 PM · Wikidata Platform Team, OKR-Work
lerickson placed T425137: data-reload: set up RDF tables in Ceph, refactor DDL up for grabs.
May 6 2026, 2:08 PM · Wikidata Platform Team, OKR-Work