Page MenuHomePhabricator

dcausse (David Causse)
User

Projects

Today

  • Clear sailing ahead.

Tomorrow

  • Clear sailing ahead.

Monday

  • Clear sailing ahead.

User Details

User Since
Jun 9 2015, 9:03 AM (463 w, 4 d)
Availability
Available
IRC Nick
dcausse
LDAP User
DCausse
MediaWiki User
DCausse (WMF) [ Global Accounts ]

Recent Activity

Yesterday

dcausse added projects to T363521: Completion suggester can promote a bad build: serviceops, CirrusSearch.

tagging @serviceops for help regarding the connectivity issue and this new delayed connect error: 113 error

Fri, Apr 26, 1:05 PM · serviceops-radar, CirrusSearch, Discovery-Search
dcausse lowered the priority of T363516: Many search suggestions missing when connecting to eqiad, but not when connecting to codfw from Unbreak Now! to Medium.

completion traffic is now served from codfw which has proper indices, lowering prio

Fri, Apr 26, 10:03 AM · CirrusSearch, Discovery-Search (Current work), Patch-For-Review
dcausse triaged T363516: Many search suggestions missing when connecting to eqiad, but not when connecting to codfw as Unbreak Now! priority.

This is still happening, raising to UBN

Fri, Apr 26, 9:16 AM · CirrusSearch, Discovery-Search (Current work), Patch-For-Review
dcausse added a comment to T363521: Completion suggester can promote a bad build.

The errors "delayed connect error: 113" seem to have started on apr 24 21:30 right after deploying https://gerrit.wikimedia.org/r/c/operations/puppet/+/1023937.
The errors affect both mw@wikikube and mwmaint1002 https://logstash.wikimedia.org/goto/5ac680b477389129ffb5ddf33fa09940
I think we should switch completion traffic to codfw while we work on a more resilient version of this maint script and also understand why we get these errors.

Fri, Apr 26, 9:05 AM · serviceops-radar, CirrusSearch, Discovery-Search

Wed, Apr 24

dcausse claimed T362508: WDQS updater misbehaving in codfw.
Wed, Apr 24, 7:33 AM · Patch-For-Review, Discovery-Search (Current work), Wikidata
dcausse added a comment to T359215: mediawiki_cirrussearch_request data is regularly late.

Quick update that some fix has been deployed two weeks ago (T359580#9699108) to stop pushing these late events.

Wed, Apr 24, 7:18 AM · Performance Issue, Data-Platform

Tue, Apr 23

dcausse claimed T362060: Generalize ScholarlyArticleSplitter.
Tue, Apr 23, 7:17 PM · Patch-For-Review, Discovery-Search (Current work), Wikidata

Fri, Apr 19

dcausse updated the task description for T362977: WDQS updater missed some updates.
Fri, Apr 19, 4:38 PM · Data-Engineering, Data-Platform, Wikidata, Wikidata-Query-Service
dcausse awarded T336443: Investigate performance differences between wdqs2022 and older hosts a Love token.
Fri, Apr 19, 4:30 PM · Data-Platform-SRE (2024.04.15 - 2024.05.05)
dcausse added a comment to T120242: Eventually-Consistent MediaWiki state change events | MediaWiki events as source of truth.

I think there are two issues to be discussed here. Defining qualitative requirements and how to repair inconsistencies.
Regarding qualitative requirements, for search and WDQS we don't have a good sense of what would be good enough. the only visible criteria we have at the moment is when users complain about stale data but without concrete measurement of the instability it is hard to define a number I guess. Could we do the other way around by starting to measure how consistent the streams are compared to the source of truth? Could this be done for some important streams like revision-create/page-delete/page-undelete/page-state by applying similar techniques than the one used in T215001#7523796? It is probable that missed events are rare in normal conditions but I still see huge spikes in the logs with many events failing to reach event-gate (T362977), could there be ways to improve the situation at a reasonable cost?

Fri, Apr 19, 1:17 PM · Data-Engineering, Analytics, DBA, WMF-Architecture-Team, Platform Team Legacy (Later), Event-Platform, Services (later)
dcausse created T362977: WDQS updater missed some updates.
Fri, Apr 19, 12:31 PM · Data-Engineering, Data-Platform, Wikidata, Wikidata-Query-Service

Tue, Apr 16

dcausse updated the task description for T362508: WDQS updater misbehaving in codfw.
Tue, Apr 16, 4:34 PM · Patch-For-Review, Discovery-Search (Current work), Wikidata
dcausse updated the task description for T362508: WDQS updater misbehaving in codfw.
Tue, Apr 16, 12:49 PM · Patch-For-Review, Discovery-Search (Current work), Wikidata
dcausse updated the task description for T362508: WDQS updater misbehaving in codfw.
Tue, Apr 16, 12:30 PM · Patch-For-Review, Discovery-Search (Current work), Wikidata
dcausse created P60589 Checkpoint read timeout from object store.
Tue, Apr 16, 9:12 AM

Mon, Apr 15

dcausse created T362508: WDQS updater misbehaving in codfw.
Mon, Apr 15, 8:01 AM · Patch-For-Review, Discovery-Search (Current work), Wikidata

Tue, Apr 9

dcausse updated the task description for T361935: Adapt the WDQS Streaming Updater to update multiple WDQS subgraphs.
Tue, Apr 9, 4:29 PM · Discovery-Search (Current work), Wikidata
dcausse updated the task description for T361935: Adapt the WDQS Streaming Updater to update multiple WDQS subgraphs.
Tue, Apr 9, 4:28 PM · Discovery-Search (Current work), Wikidata

Mon, Apr 8

dcausse created T362074: WDQS wikibase:around sometimes ignore exact matches.
Mon, Apr 8, 1:11 PM · Wikidata, Wikidata-Query-Service
dcausse added a subtask for T337013: [Epic] Splitting the graph in WDQS: T362060: Generalize ScholarlyArticleSplitter.
Mon, Apr 8, 11:48 AM · Discovery-Search (Current work), Epic, Wikidata-Query-Service, Wikidata
dcausse added a parent task for T362060: Generalize ScholarlyArticleSplitter: T337013: [Epic] Splitting the graph in WDQS.
Mon, Apr 8, 11:48 AM · Patch-For-Review, Discovery-Search (Current work), Wikidata
dcausse created T362060: Generalize ScholarlyArticleSplitter.
Mon, Apr 8, 11:48 AM · Patch-For-Review, Discovery-Search (Current work), Wikidata
dcausse reopened T361305: decommission elastic20[37-54].codfw.wmnet as "Open".

Reopening since it seems some of these hosts are still mentioned somewhere. The elastic settings check is complaining with CRITICAL - ['elastic2047.codfw.wmnet:9500', 'elastic2052.codfw.wmnet:9500', 'elastic2073.codfw.wmnet:9500', 'elastic2086.codfw.wmnet:9500', 'elastic2092.codfw.wmnet:9500', 'elastic2100.codfw.wmnet:9500', 'elastic2106.codfw.wmnet:9500'] does not match ['elastic2073.codfw.wmnet:9500', 'elastic2086.codfw.wmnet:9500', 'elastic2092.codfw.wmnet:9500', 'elastic2100.codfw.wmnet:9500', 'elastic2106.codfw.wmnet:9500']

Mon, Apr 8, 8:14 AM · SRE, ops-codfw, decommission-hardware
dcausse reopened T361305: decommission elastic20[37-54].codfw.wmnet, a subtask of T358882: Decommission elastic2037-2054, as Open.
Mon, Apr 8, 8:13 AM · Data-Platform-SRE (2024.03.25 - 2024.04.14)

Fri, Apr 5

dcausse moved T349911: Explore the feasibility of using SPARQL federation for scholia queries from Blocked/Waiting to Needs Reporting on the Discovery-Search (Current work) board.

Two scholia queries were rewritten:

The pages also contains some documentation about to approach such rewrites.
I'm boldly moving this ticket to our Needs Reporting (prior to be closed) column as I believe further explorations about how to rewrite scholia queries to support the split could perhaps be better handled in https://github.com/WDscholia/scholia.

Fri, Apr 5, 3:39 PM · Discovery-Search (Current work), Wikidata, Wikidata-Query-Service
dcausse added a subtask for T337013: [Epic] Splitting the graph in WDQS: T361950: Ensure that WDQS query throttling does not interfere with federation.
Fri, Apr 5, 3:29 PM · Discovery-Search (Current work), Epic, Wikidata-Query-Service, Wikidata
dcausse added a parent task for T361950: Ensure that WDQS query throttling does not interfere with federation: T337013: [Epic] Splitting the graph in WDQS.
Fri, Apr 5, 3:29 PM · Discovery-Search (Current work), Wikidata
dcausse renamed T361950: Ensure that WDQS query throttling does not interfere with federation from Ensure that WDQS query throttling do not interfere with federation to Ensure that WDQS query throttling does not interfere with federation.
Fri, Apr 5, 3:29 PM · Discovery-Search (Current work), Wikidata
dcausse created T361950: Ensure that WDQS query throttling does not interfere with federation.
Fri, Apr 5, 3:29 PM · Discovery-Search (Current work), Wikidata
dcausse updated the task description for T361935: Adapt the WDQS Streaming Updater to update multiple WDQS subgraphs.
Fri, Apr 5, 12:55 PM · Discovery-Search (Current work), Wikidata
dcausse added a subtask for T337013: [Epic] Splitting the graph in WDQS: T361935: Adapt the WDQS Streaming Updater to update multiple WDQS subgraphs.
Fri, Apr 5, 12:53 PM · Discovery-Search (Current work), Epic, Wikidata-Query-Service, Wikidata
dcausse added a parent task for T361935: Adapt the WDQS Streaming Updater to update multiple WDQS subgraphs: T337013: [Epic] Splitting the graph in WDQS.
Fri, Apr 5, 12:53 PM · Discovery-Search (Current work), Wikidata
dcausse created T361935: Adapt the WDQS Streaming Updater to update multiple WDQS subgraphs.
Fri, Apr 5, 12:52 PM · Discovery-Search (Current work), Wikidata

Thu, Apr 4

dcausse added a comment to T361114: Alert Search Platform and/or DPE SRE when Wikidata is lagged.

Thanks! I'm not very familiar with alerts being set from grafana neither, I'll try to get more info on this, worst case we can always set up a new one directly in alertmanager just for the wdqs lag and sent to the search team using the same formula used by updateQueryServiceLag.php.

Thu, Apr 4, 12:29 PM · Data-Platform-SRE (2024.04.15 - 2024.05.05), Wikidata, Wikidata-Query-Service
dcausse placed T361114: Alert Search Platform and/or DPE SRE when Wikidata is lagged up for grabs.

@Lucas_Werkmeister_WMDE thanks! Do you know where we could update this to include our alert email for such alerts?

Thu, Apr 4, 9:34 AM · Data-Platform-SRE (2024.04.15 - 2024.05.05), Wikidata, Wikidata-Query-Service
dcausse moved T355451: Update URLs on MediaWiki:Elastica-desc from To Be Deployed to Needs Reporting on the Discovery-Search (Current work) board.
Thu, Apr 4, 9:30 AM · MW-1.42-notes (1.42.0-wmf.22; 2024-03-12), Discovery-Search (Current work), Elasticsearch
dcausse moved T360993: WDQS lag propagation to wikidata not working as intended from Needs review to To Be Deployed on the Discovery-Search (Current work) board.
Thu, Apr 4, 9:29 AM · Data-Platform-SRE (2024.04.15 - 2024.05.05), MW-1.42-notes (1.42.0-wmf.25; 2024-04-02), Patch-For-Review, Wikidata, Discovery-Search (Current work)
dcausse moved T357966: Document limitations of blazegraph federation from Needs review to Needs Reporting on the Discovery-Search (Current work) board.
Thu, Apr 4, 9:29 AM · Discovery-Search (Current work), Wikidata
dcausse updated subscribers of T359580: CirrusSearch should not send outdated cirrussearch-request events.

According to @Urbanecm_WMF these queries are probably emitted while running https://github.com/wikimedia/mediawiki-extensions-GrowthExperiments/blob/master/maintenance/refreshLinkRecommendations.php.
Discussing possible fixes it would be ideal if cirrus could detect that it is being run via a maint script and possibly call something like disablePoolCountersAndLogging but perhaps without disabling statsd since the user script might require stats to be emitted.

Thu, Apr 4, 9:15 AM · Discovery-Search (Current work), CirrusSearch

Wed, Apr 3

dcausse moved T353683: Unable to find a file by filename while adding a Commons media file statement from Needs review to Needs Reporting on the Discovery-Search (Current work) board.

Should be working properly now

Wed, Apr 3, 5:35 PM · MW-1.42-notes (1.42.0-wmf.25; 2024-04-02), Structured-Data-Backlog, SDAW-MediaSearch, Discovery-Search (Current work), CirrusSearch, Wikidata

Fri, Mar 29

dcausse closed T361106: Restore wdqs1013 with a data transfer as Declined.

won't be required after all

Fri, Mar 29, 8:49 AM · Data-Platform-SRE (2024.03.25 - 2024.04.14), Wikidata, Discovery-Search (Current work), Wikidata-Query-Service
dcausse closed T361106: Restore wdqs1013 with a data transfer, a subtask of T360993: WDQS lag propagation to wikidata not working as intended, as Declined.
Fri, Mar 29, 8:47 AM · Data-Platform-SRE (2024.04.15 - 2024.05.05), MW-1.42-notes (1.42.0-wmf.25; 2024-04-02), Patch-For-Review, Wikidata, Discovery-Search (Current work)

Thu, Mar 28

dcausse updated the task description for T361246: scap deploy should not repool a wdqs node that is depooled.
Thu, Mar 28, 6:25 PM · Release-Engineering-Team, Data-Platform-SRE, Scap, Wikidata, Wikidata-Query-Service
dcausse moved T361106: Restore wdqs1013 with a data transfer from Backlog to Blocked / Waiting on the Data-Platform-SRE (2024.03.25 - 2024.04.14) board.

I restarted the updater on wdqs1013 and it's catching up, I have a note to check the status tomorrow and will repool it if necessary.

Thu, Mar 28, 6:12 PM · Data-Platform-SRE (2024.03.25 - 2024.04.14), Wikidata, Discovery-Search (Current work), Wikidata-Query-Service
dcausse updated the task description for T361246: scap deploy should not repool a wdqs node that is depooled.
Thu, Mar 28, 4:39 PM · Release-Engineering-Team, Data-Platform-SRE, Scap, Wikidata, Wikidata-Query-Service
dcausse added a project to T361246: scap deploy should not repool a wdqs node that is depooled: Wikidata-Query-Service.
Thu, Mar 28, 3:20 PM · Release-Engineering-Team, Data-Platform-SRE, Scap, Wikidata, Wikidata-Query-Service
dcausse created T361246: scap deploy should not repool a wdqs node that is depooled.
Thu, Mar 28, 3:20 PM · Release-Engineering-Team, Data-Platform-SRE, Scap, Wikidata, Wikidata-Query-Service
dcausse added a comment to T360993: WDQS lag propagation to wikidata not working as intended.

I could re-enable puppet on wdqs1013 and restart the updater to catchup on updates. But apparently this machine was repooled yesterday (as part of the wdqs scap deploy I suppose) and thus started to serve stale data without triggering any maxlag. It's when re-enabling puppet that I realized that this node was still pooled so I depooled it immediately but this caused a maxlag for several minutes.
Scap repooling machines might be something we might look into to avoid this kind of issues in the future.

Thu, Mar 28, 2:34 PM · Data-Platform-SRE (2024.04.15 - 2024.05.05), MW-1.42-notes (1.42.0-wmf.25; 2024-04-02), Patch-For-Review, Wikidata, Discovery-Search (Current work)
dcausse added a comment to T360993: WDQS lag propagation to wikidata not working as intended.

depooling the node we can see that the query rate actually going down to 0, request rate is generally very low on codfw so we might have to tune the threshold at around 0.2.

image.png (837×859 px, 223 KB)

Thu, Mar 28, 2:10 PM · Data-Platform-SRE (2024.04.15 - 2024.05.05), MW-1.42-notes (1.42.0-wmf.25; 2024-04-02), Patch-For-Review, Wikidata, Discovery-Search (Current work)

Mar 26 2024

dcausse removed a project from T336352: Update maxlag calculation maintenance script to reflect new prometheus queries: Patch-For-Review.
Mar 26 2024, 8:08 PM · Wikidata Dev Team (Sprint-∞), wmde-wikidata-tech, Wikidata.org, Wikidata-Query-Service, Wikidata
dcausse added a comment to T360993: WDQS lag propagation to wikidata not working as intended.

The approach taken is:

  • from nginx control a new header named 'x-monitoring-query' set to true if a list of criteria is met (currently using user-agent strings but could be extended to using source IPs as well I suppose)
  • from blazegraph, do not log query with the header x-monitoring-query set
  • adapt Wikidata.org to allow tuning the minimal query rate expected to be served from a pooled served (was hardcoded to 1.0)
  • change the systemd timer that runs updateQueryServiceLag.php to set --pooled-server-min-query-rate to 0.5 (will need to double check that this value is sane and works well for codfw and eqiad servers)
Mar 26 2024, 6:35 PM · Data-Platform-SRE (2024.04.15 - 2024.05.05), MW-1.42-notes (1.42.0-wmf.25; 2024-04-02), Patch-For-Review, Wikidata, Discovery-Search (Current work)
dcausse claimed T360993: WDQS lag propagation to wikidata not working as intended.
Mar 26 2024, 6:32 PM · Data-Platform-SRE (2024.04.15 - 2024.05.05), MW-1.42-notes (1.42.0-wmf.25; 2024-04-02), Patch-For-Review, Wikidata, Discovery-Search (Current work)
dcausse added a comment to T360993: WDQS lag propagation to wikidata not working as intended.

Here are the UAs seen in hour of a depooled server:

+------------------------------------------------------------------+-----+
|UA                                                                |count|
+------------------------------------------------------------------+-----+
|check_http/v2.3.3 (monitoring-plugins 2.3.3)                      |87   |
|Twisted PageGetter                                                |2146 |
|prometheus-public-sparql-ep-check                                 |1913 |
|wmf-prometheus/prometheus-blazegraph-exporter (root@wikimedia.org)|120  |
+------------------------------------------------------------------+-----+
Mar 26 2024, 3:05 PM · Data-Platform-SRE (2024.04.15 - 2024.05.05), MW-1.42-notes (1.42.0-wmf.25; 2024-04-02), Patch-For-Review, Wikidata, Discovery-Search (Current work)
dcausse triaged T360993: WDQS lag propagation to wikidata not working as intended as High priority.
Mar 26 2024, 11:07 AM · Data-Platform-SRE (2024.04.15 - 2024.05.05), MW-1.42-notes (1.42.0-wmf.25; 2024-04-02), Patch-For-Review, Wikidata, Discovery-Search (Current work)
dcausse added a comment to T360993: WDQS lag propagation to wikidata not working as intended.

Mitigation on wdqs1013:

  • blazegraph stopped
  • updater stopped with the /srv/wdqs/data_loaded flag removed
  • puppet disabled
Mar 26 2024, 11:06 AM · Data-Platform-SRE (2024.04.15 - 2024.05.05), MW-1.42-notes (1.42.0-wmf.25; 2024-04-02), Patch-For-Review, Wikidata, Discovery-Search (Current work)
dcausse created T360993: WDQS lag propagation to wikidata not working as intended.
Mar 26 2024, 10:51 AM · Data-Platform-SRE (2024.04.15 - 2024.05.05), MW-1.42-notes (1.42.0-wmf.25; 2024-04-02), Patch-For-Review, Wikidata, Discovery-Search (Current work)

Mar 25 2024

dcausse moved T355451: Update URLs on MediaWiki:Elastica-desc from Needs review to To Be Deployed on the Discovery-Search (Current work) board.
Mar 25 2024, 4:17 PM · MW-1.42-notes (1.42.0-wmf.22; 2024-03-12), Discovery-Search (Current work), Elasticsearch

Mar 21 2024

dcausse moved T357966: Document limitations of blazegraph federation from In Progress to Needs review on the Discovery-Search (Current work) board.

draft page: https://www.wikidata.org/wiki/Wikidata:SPARQL_query_service/WDQS_graph_split/Federation_Limits

Mar 21 2024, 7:05 PM · Discovery-Search (Current work), Wikidata

Mar 8 2024

dcausse moved T355451: Update URLs on MediaWiki:Elastica-desc from Incoming to Needs review on the Discovery-Search (Current work) board.
Mar 8 2024, 10:52 AM · MW-1.42-notes (1.42.0-wmf.22; 2024-03-12), Discovery-Search (Current work), Elasticsearch

Mar 7 2024

dcausse added a comment to T359215: mediawiki_cirrussearch_request data is regularly late.

Discussed the issue today with @JAllemandou and the reason is that CirrusSearch in some circonstances might send these outdated events, we will fix the root cause (T359580) and in the meantime these alerts for this dataset can be ignored.

Mar 7 2024, 6:39 PM · Performance Issue, Data-Platform
dcausse created T359580: CirrusSearch should not send outdated cirrussearch-request events.
Mar 7 2024, 6:37 PM · Discovery-Search (Current work), CirrusSearch

Mar 5 2024

dcausse claimed T357966: Document limitations of blazegraph federation.
Mar 5 2024, 5:38 PM · Discovery-Search (Current work), Wikidata
dcausse moved T353683: Unable to find a file by filename while adding a Commons media file statement from In Progress to Needs review on the Discovery-Search (Current work) board.

changed the layout of the query a bit by moving the logistic function introduced in T271799 to the top-level so that it wraps the new nearmatch clause

Mar 5 2024, 5:25 PM · MW-1.42-notes (1.42.0-wmf.25; 2024-04-02), Structured-Data-Backlog, SDAW-MediaSearch, Discovery-Search (Current work), CirrusSearch, Wikidata
dcausse moved T355451: Update URLs on MediaWiki:Elastica-desc from To Be Deployed to Needs Reporting on the Discovery-Search (Current work) board.
Mar 5 2024, 3:25 PM · MW-1.42-notes (1.42.0-wmf.22; 2024-03-12), Discovery-Search (Current work), Elasticsearch

Mar 4 2024

dcausse claimed T357980: Compile a set of queries rewritten with federation across the two graph splits.

Compiled 10 real world examples at https://www.wikidata.org/wiki/Wikidata:SPARQL_query_service/WDQS_graph_split/Federated_Queries_Examples

Mar 4 2024, 7:44 PM · Discovery-Search (Current work), Wikidata
dcausse added a comment to T355040: Compare the results of sparql queries between the fullgraph and the subgraphs.

final report available at https://wikitech.wikimedia.org/wiki/Wikidata_Query_Service/WDQS_Graph_Split_Impact_Analysis

Mar 4 2024, 7:41 PM · Patch-For-Review, Discovery-Search (Current work), Wikidata
lmata awarded T359033: EPIC: Convert CirrusSearch metrics to statslib a Like token.
Mar 4 2024, 7:39 PM · Observability-Metrics, Epic, Discovery-Search (Current work), CirrusSearch
dcausse added a comment to T356773: [tracking] Community feedback for the WDQS Split the Graph project.

@Physikerwelt thanks for your feedback.

Mar 4 2024, 7:25 PM · Discovery-Search (Current work), Wikidata
dcausse added a comment to T356773: [tracking] Community feedback for the WDQS Split the Graph project.

I tried to get the federation working, but got time outs too. The problem is that the current setup makes splits at a statement level. That is, given statements with some property (e.g. P2860 and P1433), some results are in one QS instance and some are in the other. That means a lot of federation-union combinations to get all results. I posted an example query that is affected (the first I tried) in this issue report: https://github.com/WDscholia/scholia/issues/2423

Mar 4 2024, 7:02 PM · Discovery-Search (Current work), Wikidata
dcausse moved T353683: Unable to find a file by filename while adding a Commons media file statement from To Be Deployed to In Progress on the Discovery-Search (Current work) board.

The new builder moved the result to #4 which is better but still not enough and it's beaten by 3 other images because other criteria:

  • weighted_tags:image.linked.from.wikipedia.lead_image/Q458
  • statement_keywords:p180=q458
Mar 4 2024, 5:00 PM · MW-1.42-notes (1.42.0-wmf.25; 2024-04-02), Structured-Data-Backlog, SDAW-MediaSearch, Discovery-Search (Current work), CirrusSearch, Wikidata
dcausse moved T359033: EPIC: Convert CirrusSearch metrics to statslib from Incoming to Epics on the Discovery-Search (Current work) board.
Mar 4 2024, 4:53 PM · Observability-Metrics, Epic, Discovery-Search (Current work), CirrusSearch
dcausse renamed T359033: EPIC: Convert CirrusSearch metrics to statslib from Convert CirrusSearch metrics to statslib to EPIC: Convert CirrusSearch metrics to statslib.
Mar 4 2024, 4:52 PM · Observability-Metrics, Epic, Discovery-Search (Current work), CirrusSearch
dcausse moved T355040: Compare the results of sparql queries between the fullgraph and the subgraphs from To Be Deployed to Needs Reporting on the Discovery-Search (Current work) board.
Mar 4 2024, 4:15 PM · Patch-For-Review, Discovery-Search (Current work), Wikidata
dcausse moved T355040: Compare the results of sparql queries between the fullgraph and the subgraphs from Needs review to To Be Deployed on the Discovery-Search (Current work) board.
Mar 4 2024, 4:15 PM · Patch-For-Review, Discovery-Search (Current work), Wikidata
dcausse moved T355451: Update URLs on MediaWiki:Elastica-desc from Needs review to To Be Deployed on the Discovery-Search (Current work) board.
Mar 4 2024, 4:15 PM · MW-1.42-notes (1.42.0-wmf.22; 2024-03-12), Discovery-Search (Current work), Elasticsearch
dcausse moved T353683: Unable to find a file by filename while adding a Commons media file statement from Needs review to To Be Deployed on the Discovery-Search (Current work) board.
Mar 4 2024, 4:14 PM · MW-1.42-notes (1.42.0-wmf.25; 2024-04-02), Structured-Data-Backlog, SDAW-MediaSearch, Discovery-Search (Current work), CirrusSearch, Wikidata
dcausse added a subtask for T343020: Converting MediaWiki Metrics to StatsLib: T359033: EPIC: Convert CirrusSearch metrics to statslib.
Mar 4 2024, 10:05 AM · SRE Observability (FY2023/2024-Q4), Observability-Metrics
dcausse added a parent task for T359033: EPIC: Convert CirrusSearch metrics to statslib: T343020: Converting MediaWiki Metrics to StatsLib.
Mar 4 2024, 10:05 AM · Observability-Metrics, Epic, Discovery-Search (Current work), CirrusSearch
dcausse created T359033: EPIC: Convert CirrusSearch metrics to statslib.
Mar 4 2024, 10:05 AM · Observability-Metrics, Epic, Discovery-Search (Current work), CirrusSearch

Mar 1 2024

dcausse added a comment to T316421: Upgrade etherpad.wikimedia.org to v1.9.7.

Since the upgrade I believe that we are affected by https://github.com/ether/etherpad-lite/issues/5401. Wondering if a stale settings.json file got kept with padOptions.userName & userColor set to false instead of null.

Mar 1 2024, 10:59 AM · User-notice-archive, collaboration-services, SRE, Wikimedia-Etherpad

Feb 29 2024

dcausse updated the task description for T358472: Search dag image_suggestions_weekly failed with: Empty dataframe provided.
Feb 29 2024, 9:27 AM · Patch-For-Review, Discovery-Search (Current work), Structured-Data-Backlog, Image-Suggestions

Feb 26 2024

dcausse added a comment to T357980: Compile a set of queries rewritten with federation across the two graph splits.

WIP at https://www.wikidata.org/wiki/Wikidata:SPARQL_query_service/WDQS_graph_split/Federated_Queries_Examples

Feb 26 2024, 3:25 PM · Discovery-Search (Current work), Wikidata
dcausse updated the task description for T358472: Search dag image_suggestions_weekly failed with: Empty dataframe provided.
Feb 26 2024, 1:31 PM · Patch-For-Review, Discovery-Search (Current work), Structured-Data-Backlog, Image-Suggestions
dcausse created T358472: Search dag image_suggestions_weekly failed with: Empty dataframe provided.
Feb 26 2024, 9:48 AM · Patch-For-Review, Discovery-Search (Current work), Structured-Data-Backlog, Image-Suggestions

Feb 20 2024

dcausse claimed T355451: Update URLs on MediaWiki:Elastica-desc.
Feb 20 2024, 3:50 PM · MW-1.42-notes (1.42.0-wmf.22; 2024-03-12), Discovery-Search (Current work), Elasticsearch
dcausse added a subtask for T337013: [Epic] Splitting the graph in WDQS: T357980: Compile a set of queries rewritten with federation across the two graph splits.
Feb 20 2024, 2:00 PM · Discovery-Search (Current work), Epic, Wikidata-Query-Service, Wikidata
dcausse added a parent task for T357980: Compile a set of queries rewritten with federation across the two graph splits: T337013: [Epic] Splitting the graph in WDQS.
Feb 20 2024, 2:00 PM · Discovery-Search (Current work), Wikidata
dcausse renamed T357980: Compile a set of queries rewritten with federation across the two graph splits from Compile a set of queries rewritten with federation accross the two graph splits to Compile a set of queries rewritten with federation across the two graph splits.
Feb 20 2024, 2:00 PM · Discovery-Search (Current work), Wikidata
dcausse created T357980: Compile a set of queries rewritten with federation across the two graph splits.
Feb 20 2024, 1:58 PM · Discovery-Search (Current work), Wikidata
dcausse added a subtask for T337013: [Epic] Splitting the graph in WDQS: T357966: Document limitations of blazegraph federation.
Feb 20 2024, 11:03 AM · Discovery-Search (Current work), Epic, Wikidata-Query-Service, Wikidata
dcausse added a parent task for T357966: Document limitations of blazegraph federation: T337013: [Epic] Splitting the graph in WDQS.
Feb 20 2024, 11:03 AM · Discovery-Search (Current work), Wikidata
dcausse created T357966: Document limitations of blazegraph federation.
Feb 20 2024, 10:59 AM · Discovery-Search (Current work), Wikidata

Feb 9 2024

dcausse edited P56589 ForceSearchIndex.
Feb 9 2024, 5:06 PM
dcausse edited P56589 ForceSearchIndex.
Feb 9 2024, 3:53 PM
dcausse updated the title for P56589 ForceSearchIndex from untitled to ForceSearchIndex.
Feb 9 2024, 3:46 PM
dcausse created P56589 ForceSearchIndex.
Feb 9 2024, 3:46 PM

Feb 8 2024

dcausse moved T355040: Compare the results of sparql queries between the fullgraph and the subgraphs from In Progress to Needs review on the Discovery-Search (Current work) board.

Draft report up at https://wikitech.wikimedia.org/wiki/User:DCausse/WDQS_Graph_Split_Impact_Analysis

Feb 8 2024, 8:38 PM · Patch-For-Review, Discovery-Search (Current work), Wikidata
dcausse added a comment to T353453: [Analytics] Impact of Scholia on WDQS.

Quick note on this:

There are two ways that need to be factored in to deriving if a query is from Scholia. Some queries do start with #tool: scholia as @dcausse suggested, but I checked for user agents and also found that the string "Scholia" is also used as a user agent. Big thing is that some of the queries have the comment and some have the user agent, but in no cases do we have both.

Feb 8 2024, 4:08 PM · Wikidata Analytics (Kanban), Wikidata
dr0ptp4kt awarded T349512: [Analytics] Collect multiple sets of SPARQL queries a Party Time token.
Feb 8 2024, 11:48 AM · Wikidata Analytics (Kanban), Discovery-Search (Current work), Wikidata, Wikidata-Query-Service

Feb 2 2024

dcausse updated the task description for T356030: Search dag image_suggestions_weekly failed waiting for analytics_platform_eng.image_suggestions_search_index_delta/snapshot=2024-01-15.
Feb 2 2024, 5:33 PM · Discovery-Search (Current work), Data-Engineering (Sprint 8), Image-Suggestions