Page MenuHomePhabricator

dcausse (David Causse)
User

Projects

Today

  • Clear sailing ahead.

Tomorrow

  • Clear sailing ahead.

Thursday

  • Clear sailing ahead.

User Details

User Since
Jun 9 2015, 9:03 AM (470 w, 14 h)
Availability
Available
IRC Nick
dcausse
LDAP User
DCausse
MediaWiki User
DCausse (WMF) [ Global Accounts ]

Recent Activity

Today

dcausse updated the task description for T365692: PHP Notice: Undefined index: lexeme_language / lexical_category.
Tue, Jun 11, 8:36 AM · MW-1.43-notes (1.43.0-wmf.8; 2024-06-04), Discovery-Search (Current work), wmde-wikidata-tech, Wikidata, Wikidata Lexicographical data, Wikimedia-production-error
dcausse moved T365692: PHP Notice: Undefined index: lexeme_language / lexical_category from In Progress to Needs Reporting on the Discovery-Search (Current work) board.

Triggered a reindex of all the lexemes using https://gitlab.wikimedia.org/repos/search-platform/cirrus-rerender, might take about 3 hours to complete.

Tue, Jun 11, 8:36 AM · MW-1.43-notes (1.43.0-wmf.8; 2024-06-04), Discovery-Search (Current work), wmde-wikidata-tech, Wikidata, Wikidata Lexicographical data, Wikimedia-production-error

Yesterday

dcausse added a comment to T366904: Improve mysql search for files.

@dcausse @Gehel As far as I can see, updateTitle is not implemented by CirrusSearch right, and thus a noop per the parent SearchEngine class ? If so, then i can safely modify this.

Mon, Jun 10, 9:14 PM · Patch-For-Review, User-TheDJ, Discovery-Search, MediaWiki-Search
dcausse awarded T358373: [Dumps 2] Reconcillation job to detect and fetch missing/corrupted revisions a Love token.
Mon, Jun 10, 6:22 PM · Dumps 2.0 (Kanban Board)

Thu, Jun 6

dcausse added a comment to P64016 Testing wdqs.data-reload with HDFS.

@RKemper for testing I created a smaller folder at hdfs:///wmf/discovery/wdqs-reload-cookbook-test-T349069/ it has only two chunks so I hope it might help iterate a bit faster on this, the command should become:

cookbook sre.wdqs.data-reload \
 --task-id T349069 \
 --reason "Test wdqs reload based on HDFS" \
 --reload-data wikidata_full \
 --from-hdfs hdfs:///wmf/discovery/wdqs-reload-cookbook-test-T349069/ \
 --stat-host stat1009.eqiad.wmnet \
 wdqs_host
Thu, Jun 6, 12:20 PM

Tue, Jun 4

dcausse edited P64016 Testing wdqs.data-reload with HDFS.
Tue, Jun 4, 3:31 PM
dcausse created P64016 Testing wdqs.data-reload with HDFS.
Tue, Jun 4, 3:24 PM

Mon, Jun 3

dcausse placed T331127: phantom redirects lingering in incategory searches after page moves up for grabs.
Mon, Jun 3, 4:18 PM · MW-1.40-notes (1.40.0-wmf.25; 2023-02-27), Discovery-Search (Current work), CirrusSearch
dcausse added a comment to T362518: Deprecate buster-backports.

@dcausse docker-registry.wikimedia.org/wikimedia/wikidata-query-flink-rdf-streaming-updater seems to be deprecated in favor of docker-registry.wikimedia.org/repos/search-platform/flink-rdf-streaming-updater, can you confirm?

Yes (all the images under docker-registry.wikimedia.org/wikimedia/wikidata-query-flink-rdf-streaming-updater should no longer be used and can be safely removed if needed)

Mon, Jun 3, 3:56 PM · Patch-For-Review, Infrastructure-Foundations, Release-Engineering-Team, serviceops
dcausse moved T331127: phantom redirects lingering in incategory searches after page moves from Needs Reporting to Incoming on the Discovery-Search (Current work) board.

Sorry to see this happening again, it is probable that we missed some edge cases when deploying T317045.

Mon, Jun 3, 8:08 AM · MW-1.40-notes (1.40.0-wmf.25; 2023-02-27), Discovery-Search (Current work), CirrusSearch

Fri, May 31

dcausse added a comment to T366253: Create a generic stream to populate CirrusSearch weighted_tags.

From a SUP perspective this would replace all sources of weighted tags (config option: stream name):

  • article-topic-stream: mediawiki.page_outlink_topic_prediction_change.v1
  • draft-topic-stream: mediawiki.revision_score_drafttopic
  • recommendation-create-stream: mediawiki.revision-recommendation-create
Fri, May 31, 7:38 AM · CirrusSearch, Discovery-Search

Thu, May 30

dcausse added a comment to T364856: Outreach to producers of "other dumps" to raise awareness about Dumps 2.0 and options for deprecation or migration.

Hi, we might have a use-case related to "other dumps" that might benefit from the Dumps 2.0 infrastructure, I filed T366248 with some details about it.

Thu, May 30, 9:32 AM · Data Products, Dumps 2.0, Dumps-Generation, Epic
dcausse created T366253: Create a generic stream to populate CirrusSearch weighted_tags.
Thu, May 30, 9:28 AM · CirrusSearch, Discovery-Search
dcausse created T366248: Source the CirrusSearch index dumps from hadoop instead of a MW maintenance script.
Thu, May 30, 9:07 AM · Data Products, CirrusSearch, Dumps 2.0, Discovery-Search

Wed, May 29

dcausse added a comment to T365692: PHP Notice: Undefined index: lexeme_language / lexical_category.

The system should now index lexemes properly.
We still have to reindex all the lexemes to fix the ones created/edited before the fix was applied.

Wed, May 29, 10:20 AM · MW-1.43-notes (1.43.0-wmf.8; 2024-06-04), Discovery-Search (Current work), wmde-wikidata-tech, Wikidata, Wikidata Lexicographical data, Wikimedia-production-error
dcausse updated the task description for T365692: PHP Notice: Undefined index: lexeme_language / lexical_category.
Wed, May 29, 10:18 AM · MW-1.43-notes (1.43.0-wmf.8; 2024-06-04), Discovery-Search (Current work), wmde-wikidata-tech, Wikidata, Wikidata Lexicographical data, Wikimedia-production-error
dcausse added a comment to T366043: Some dumps are not available since mid may 2024.

@BTullis thanks! Categories are reloaded via a cronjob on all WDQS machine, the job is about to run in 30 mins

Wed, May 29, 7:36 AM · Data-Platform-SRE (2024.05.27 - 2024.06.16), Discovery-Search, Data-Engineering, Dumps-Generation
dcausse added a comment to T366043: Some dumps are not available since mid may 2024.

@BTullis thanks! Categories are reloaded via a cronjob on all WDQS machine, the job is about to run in 30 mins

Wed, May 29, 7:15 AM · Data-Platform-SRE (2024.05.27 - 2024.06.16), Discovery-Search, Data-Engineering, Dumps-Generation

Tue, May 28

dcausse added a comment to P63465 extra fields in cirrus indices.

Output with:

cirrus = (spark.table("discovery.cirrus_index").where('cirrus_replica="codfw" AND snapshot="20240428"'))
Tue, May 28, 5:15 PM
dcausse created P63465 extra fields in cirrus indices.
Tue, May 28, 5:12 PM
dcausse committed rEWLC5e903c77c46b: Workaround missing lemma fields.
Workaround missing lemma fields
Tue, May 28, 4:48 PM
dcausse added a comment to T365692: PHP Notice: Undefined index: lexeme_language / lexical_category.

The search fields specific to Lexemes are currently ignored causing this NOTICE but also preventing lexemes from being searchable (esp. the new ones).
The schemas should be adapted to support these fields and the lexemes will have to be re-indexed.

Tue, May 28, 9:53 AM · MW-1.43-notes (1.43.0-wmf.8; 2024-06-04), Discovery-Search (Current work), wmde-wikidata-tech, Wikidata, Wikidata Lexicographical data, Wikimedia-production-error
dcausse merged task T365684: Particular lexeme (L1326823) not indexed so search with the Wikidata API returns nothing into T365692: PHP Notice: Undefined index: lexeme_language / lexical_category.
Tue, May 28, 9:51 AM · Discovery-Search (Current work), Wikidata
dcausse merged T365684: Particular lexeme (L1326823) not indexed so search with the Wikidata API returns nothing into T365692: PHP Notice: Undefined index: lexeme_language / lexical_category.
Tue, May 28, 9:50 AM · MW-1.43-notes (1.43.0-wmf.8; 2024-06-04), Discovery-Search (Current work), wmde-wikidata-tech, Wikidata, Wikidata Lexicographical data, Wikimedia-production-error
dcausse claimed T365692: PHP Notice: Undefined index: lexeme_language / lexical_category.
Tue, May 28, 8:47 AM · MW-1.43-notes (1.43.0-wmf.8; 2024-06-04), Discovery-Search (Current work), wmde-wikidata-tech, Wikidata, Wikidata Lexicographical data, Wikimedia-production-error
dcausse added a comment to T361483: Selectively disable changeprop functionality that is no longer used.

@achou except expert search users explicitly searching for topics (which I suspect are rare) the growth team is the only team using this data in a user facing product, it is hard to tell what would be the impact for them but I suspect that if only a few (<100) are lost these might hardly impact anything. If you suspect that more might be lost perhaps having duplicates is better if this is an option for you.

Tue, May 28, 8:03 AM · Patch-For-Review, Machine-Learning-Team, Lift-Wing, ORES, RESTBase Sunsetting, Content-Transform-Team, serviceops, ChangeProp, API Platform (RESTBase Deprecation Roadmap)
dcausse created T366043: Some dumps are not available since mid may 2024.
Tue, May 28, 7:44 AM · Data-Platform-SRE (2024.05.27 - 2024.06.16), Discovery-Search, Data-Engineering, Dumps-Generation

Thu, May 23

dcausse moved T349069: Design and implement a WDQS data-reload mechanism that sources its data from HDFS instead of the snapshot servers from In Progress to Needs review on the Discovery-Search (Current work) board.
Thu, May 23, 2:52 PM · Patch-For-Review, Discovery-Search (Current work), Data-Platform-SRE, Wikidata-Query-Service, Wikidata
dcausse moved T365190: Cannot provide empty array to wikis as $wgCirrusSearchWriteClusters from Needs review to Needs Reporting on the Discovery-Search (Current work) board.
Thu, May 23, 2:51 PM · MW-1.43-notes (1.43.0-wmf.6; 2024-05-21), Discovery-Search (Current work), CirrusSearch
dcausse moved T364837: Q125918173 missing from elastic@codfw from Needs review to Needs Reporting on the Discovery-Search (Current work) board.
Thu, May 23, 2:51 PM · Discovery-Search (Current work), CirrusSearch
dcausse moved T362060: Generalize ScholarlyArticleSplitter from To Be Deployed to Needs Reporting on the Discovery-Search (Current work) board.
Thu, May 23, 2:51 PM · Discovery-Search (Current work), Wikidata

Thu, May 16

dcausse moved T364837: Q125918173 missing from elastic@codfw from In Progress to Needs review on the Discovery-Search (Current work) board.
Thu, May 16, 6:40 PM · Discovery-Search (Current work), CirrusSearch
dcausse updated the task description for T364837: Q125918173 missing from elastic@codfw.
Thu, May 16, 6:40 PM · Discovery-Search (Current work), CirrusSearch

Wed, May 15

dcausse triaged T364837: Q125918173 missing from elastic@codfw as High priority.
Wed, May 15, 7:42 AM · Discovery-Search (Current work), CirrusSearch

Tue, May 14

dcausse updated the task description for T364837: Q125918173 missing from elastic@codfw.
Tue, May 14, 1:21 PM · Discovery-Search (Current work), CirrusSearch
dcausse updated the task description for T364837: Q125918173 missing from elastic@codfw.
Tue, May 14, 12:40 PM · Discovery-Search (Current work), CirrusSearch
dcausse moved T364837: Q125918173 missing from elastic@codfw from Incoming to In Progress on the Discovery-Search (Current work) board.
Tue, May 14, 10:48 AM · Discovery-Search (Current work), CirrusSearch
dcausse edited projects for T364837: Q125918173 missing from elastic@codfw, added: Discovery-Search (Current work); removed Discovery-Search.
Tue, May 14, 10:47 AM · Discovery-Search (Current work), CirrusSearch
dcausse created T364837: Q125918173 missing from elastic@codfw.
Tue, May 14, 10:44 AM · Discovery-Search (Current work), CirrusSearch
dcausse created P62377 Q125918173 missing from elastic@codfw.
Tue, May 14, 10:07 AM

Mon, May 13

dcausse awarded T362920: Benchmark Blazegraph import with increased buffer capacity (and other factors) a Love token.
Mon, May 13, 8:07 AM · Wikidata, Wikidata-Query-Service

May 7 2024

dcausse moved T362508: WDQS updater misbehaving in codfw from Needs review to Needs Reporting on the Discovery-Search (Current work) board.
May 7 2024, 6:27 AM · Discovery-Search (Current work), Wikidata
dcausse updated the task description for T362508: WDQS updater misbehaving in codfw.
May 7 2024, 6:27 AM · Discovery-Search (Current work), Wikidata

May 6 2024

dcausse updated the task description for T350597: Audit and prioritize metrics for conversion to statslib that are used for graphite-based alerting.
May 6 2024, 3:29 PM · SRE Observability (FY2023/2024-Q4), Discovery-Search (Current work), Data-Platform-SRE, MW-1.42-notes (1.42.0-wmf.20; 2024-02-27), User-fgiunchedi, Observability-Metrics
dcausse moved T360993: WDQS lag propagation to wikidata not working as intended from To Be Deployed to Needs Reporting on the Discovery-Search (Current work) board.
May 6 2024, 3:22 PM · Data-Platform-SRE (2024.05.06 - 2024.05.26), MW-1.42-notes (1.42.0-wmf.25; 2024-04-02), Wikidata, Discovery-Search (Current work)
dcausse added a comment to T349069: Design and implement a WDQS data-reload mechanism that sources its data from HDFS instead of the snapshot servers.

Possible options I see so far:

  1. Runs hdfs-rsync directly from the blazegraph hosts
    • requires installing its dependencies
    • open a holes between blazegraph and the hadoop cluster
  2. Schedule hdfs-rsync on a stat machine copying the ttl dumps from hdfs to /srv/analytics-search/wikibase_processed_dumps/wikidata/$SNAPSHOT
    • cons: consumes some space on a stat machine
  3. Run hdfs-rsync on-demand to copy the ttl dump from hdfs to /srv/analytics-search/wikibase_processed_dumps/temp and cleanup this folder once done
    • cons: slows down a bit the process
May 6 2024, 1:11 PM · Patch-For-Review, Discovery-Search (Current work), Data-Platform-SRE, Wikidata-Query-Service, Wikidata
dcausse added a comment to T349069: Design and implement a WDQS data-reload mechanism that sources its data from HDFS instead of the snapshot servers.

Another approach could be to use the /mnt/hdfs mountpoint? I have been told that it might not be stable enough but perhaps it's OK for doing a copy?

May 6 2024, 9:11 AM · Patch-For-Review, Discovery-Search (Current work), Data-Platform-SRE, Wikidata-Query-Service, Wikidata

May 3 2024

dcausse added a comment to T355298: Investigate the impact of the WDQS graph split on constraints checks.

Looking at the constraints I believe that 4 may use sparql:

  • FormatChecker.php
  • TypeChecker.php
  • UniqueValueChecker.php
  • ValueTypeChecker.php
May 3 2024, 3:23 PM · Discovery-Search (Current work), Wikidata Dev Team, Wikibase-Quality-Constraints, Wikidata
dcausse created T364077: Adapt the wdqs data-transfer cookbook to operate with federated subgraphs.
May 3 2024, 8:23 AM · Discovery-Search (Current work), Wikidata

May 2 2024

dcausse added a comment to T349069: Design and implement a WDQS data-reload mechanism that sources its data from HDFS instead of the snapshot servers.

@BTullis @bking I plan to use a cookbook to transfer some data out of hdfs to blazegraph machines, a naive approach I thought about was to use a temp folder somewhere in /srv of a stat100x machine that would be populated using hdfs dfs or hdfs-rsync and then re-use the transferpy python module.
The current dumps are about 200G, do you think that this option is viable? Can we use a folder in /srv as a temp folder for such transfers? This data is only useful for the transfer and should be deleted by the cookbook when it ends.

May 2 2024, 6:03 PM · Patch-For-Review, Discovery-Search (Current work), Data-Platform-SRE, Wikidata-Query-Service, Wikidata

Apr 30 2024

dcausse updated the task description for T362508: WDQS updater misbehaving in codfw.
Apr 30 2024, 3:56 PM · Discovery-Search (Current work), Wikidata
dcausse moved T362060: Generalize ScholarlyArticleSplitter from In Progress to Needs review on the Discovery-Search (Current work) board.
Apr 30 2024, 8:38 AM · Discovery-Search (Current work), Wikidata
dcausse updated the task description for T349069: Design and implement a WDQS data-reload mechanism that sources its data from HDFS instead of the snapshot servers.
Apr 30 2024, 8:19 AM · Patch-For-Review, Discovery-Search (Current work), Data-Platform-SRE, Wikidata-Query-Service, Wikidata
dcausse claimed T349069: Design and implement a WDQS data-reload mechanism that sources its data from HDFS instead of the snapshot servers.
Apr 30 2024, 8:14 AM · Patch-For-Review, Discovery-Search (Current work), Data-Platform-SRE, Wikidata-Query-Service, Wikidata
dcausse added a project to T349069: Design and implement a WDQS data-reload mechanism that sources its data from HDFS instead of the snapshot servers: Discovery-Search (Current work).
Apr 30 2024, 8:14 AM · Patch-For-Review, Discovery-Search (Current work), Data-Platform-SRE, Wikidata-Query-Service, Wikidata

Apr 29 2024

dcausse added a comment to T363521: Completion suggester can promote a bad build.

https://gerrit.wikimedia.org/r/c/mediawiki/extensions/CirrusSearch/+/1024698 switches from using a scroll to a search_after approach which should be more robust by handling retries and errors properly.
Question is whether we should do more by adding more checks or not? Unfortunately not all wikis are building a new index and promoting it, to optimize cluster operations most of the wikis recycle the same index where we don't have a chance to do such sanity checks prior to promoting.

Apr 29 2024, 2:24 PM · Discovery-Search (Current work), serviceops-radar, CirrusSearch
dcausse closed T335974: ‘Remember selection’ option / Vector-2022 have search results that do not start with user input as Resolved.

Closing, the issue is tracked at T363516.

Apr 29 2024, 8:33 AM · MediaWiki-Search, Discovery-Search, Regression, Desktop Improvements (Vector 2022), MediaWiki-User-Interface (autocomplete search), Advanced-Search

Apr 26 2024

dcausse added projects to T363521: Completion suggester can promote a bad build: serviceops, CirrusSearch.

tagging @serviceops for help regarding the connectivity issue and this new delayed connect error: 113 error

Apr 26 2024, 1:05 PM · Discovery-Search (Current work), serviceops-radar, CirrusSearch
dcausse lowered the priority of T363516: Many search suggestions missing when connecting to eqiad, but not when connecting to codfw from Unbreak Now! to Medium.

completion traffic is now served from codfw which has proper indices, lowering prio

Apr 26 2024, 10:03 AM · CirrusSearch, Discovery-Search (Current work), Patch-For-Review
dcausse triaged T363516: Many search suggestions missing when connecting to eqiad, but not when connecting to codfw as Unbreak Now! priority.

This is still happening, raising to UBN

Apr 26 2024, 9:16 AM · CirrusSearch, Discovery-Search (Current work), Patch-For-Review
dcausse added a comment to T363521: Completion suggester can promote a bad build.

The errors "delayed connect error: 113" seem to have started on apr 24 21:30 right after deploying https://gerrit.wikimedia.org/r/c/operations/puppet/+/1023937.
The errors affect both mw@wikikube and mwmaint1002 https://logstash.wikimedia.org/goto/5ac680b477389129ffb5ddf33fa09940
I think we should switch completion traffic to codfw while we work on a more resilient version of this maint script and also understand why we get these errors.

Apr 26 2024, 9:05 AM · Discovery-Search (Current work), serviceops-radar, CirrusSearch

Apr 24 2024

dcausse claimed T362508: WDQS updater misbehaving in codfw.
Apr 24 2024, 7:33 AM · Discovery-Search (Current work), Wikidata
dcausse added a comment to T359215: mediawiki_cirrussearch_request data is regularly late.

Quick update that some fix has been deployed two weeks ago (T359580#9699108) to stop pushing these late events.

Apr 24 2024, 7:18 AM · Performance Issue, Data-Platform

Apr 23 2024

dcausse claimed T362060: Generalize ScholarlyArticleSplitter.
Apr 23 2024, 7:17 PM · Discovery-Search (Current work), Wikidata

Apr 19 2024

dcausse updated the task description for T362977: WDQS updater missed some updates.
Apr 19 2024, 4:38 PM · Data-Engineering, Data-Platform, Wikidata, Wikidata-Query-Service
dcausse awarded T336443: Investigate performance differences between wdqs2022 and older hosts a Love token.
Apr 19 2024, 4:30 PM · Data-Platform-SRE (2024.04.15 - 2024.05.05)
dcausse added a comment to T120242: Eventually Consistent MediaWiki State Change Events.

I think there are two issues to be discussed here. Defining qualitative requirements and how to repair inconsistencies.
Regarding qualitative requirements, for search and WDQS we don't have a good sense of what would be good enough. the only visible criteria we have at the moment is when users complain about stale data but without concrete measurement of the instability it is hard to define a number I guess. Could we do the other way around by starting to measure how consistent the streams are compared to the source of truth? Could this be done for some important streams like revision-create/page-delete/page-undelete/page-state by applying similar techniques than the one used in T215001#7523796? It is probable that missed events are rare in normal conditions but I still see huge spikes in the logs with many events failing to reach event-gate (T362977), could there be ways to improve the situation at a reasonable cost?

Apr 19 2024, 1:17 PM · Data-Engineering, Analytics, DBA, WMF-Architecture-Team, Platform Team Legacy (Later), Event-Platform, Services (later)
dcausse created T362977: WDQS updater missed some updates.
Apr 19 2024, 12:31 PM · Data-Engineering, Data-Platform, Wikidata, Wikidata-Query-Service

Apr 16 2024

dcausse updated the task description for T362508: WDQS updater misbehaving in codfw.
Apr 16 2024, 4:34 PM · Discovery-Search (Current work), Wikidata
dcausse updated the task description for T362508: WDQS updater misbehaving in codfw.
Apr 16 2024, 12:49 PM · Discovery-Search (Current work), Wikidata
dcausse updated the task description for T362508: WDQS updater misbehaving in codfw.
Apr 16 2024, 12:30 PM · Discovery-Search (Current work), Wikidata
dcausse created P60589 Checkpoint read timeout from object store.
Apr 16 2024, 9:12 AM

Apr 15 2024

dcausse created T362508: WDQS updater misbehaving in codfw.
Apr 15 2024, 8:01 AM · Discovery-Search (Current work), Wikidata

Apr 9 2024

dcausse updated the task description for T361935: Adapt the WDQS Streaming Updater to update multiple WDQS subgraphs.
Apr 9 2024, 4:29 PM · Patch-For-Review, Discovery-Search (Current work), Wikidata
dcausse updated the task description for T361935: Adapt the WDQS Streaming Updater to update multiple WDQS subgraphs.
Apr 9 2024, 4:28 PM · Patch-For-Review, Discovery-Search (Current work), Wikidata

Apr 8 2024

dcausse created T362074: WDQS wikibase:around sometimes ignore exact matches.
Apr 8 2024, 1:11 PM · Wikidata, Wikidata-Query-Service
dcausse added a subtask for T337013: [Epic] Splitting the graph in WDQS: T362060: Generalize ScholarlyArticleSplitter.
Apr 8 2024, 11:48 AM · Discovery-Search (Current work), Epic, Wikidata-Query-Service, Wikidata
dcausse added a parent task for T362060: Generalize ScholarlyArticleSplitter: T337013: [Epic] Splitting the graph in WDQS.
Apr 8 2024, 11:48 AM · Discovery-Search (Current work), Wikidata
dcausse created T362060: Generalize ScholarlyArticleSplitter.
Apr 8 2024, 11:48 AM · Discovery-Search (Current work), Wikidata
dcausse reopened T361305: decommission elastic20[37-54].codfw.wmnet as "Open".

Reopening since it seems some of these hosts are still mentioned somewhere. The elastic settings check is complaining with CRITICAL - ['elastic2047.codfw.wmnet:9500', 'elastic2052.codfw.wmnet:9500', 'elastic2073.codfw.wmnet:9500', 'elastic2086.codfw.wmnet:9500', 'elastic2092.codfw.wmnet:9500', 'elastic2100.codfw.wmnet:9500', 'elastic2106.codfw.wmnet:9500'] does not match ['elastic2073.codfw.wmnet:9500', 'elastic2086.codfw.wmnet:9500', 'elastic2092.codfw.wmnet:9500', 'elastic2100.codfw.wmnet:9500', 'elastic2106.codfw.wmnet:9500']

Apr 8 2024, 8:14 AM · SRE, ops-codfw, decommission-hardware
dcausse reopened T361305: decommission elastic20[37-54].codfw.wmnet, a subtask of T358882: Decommission elastic2037-2054, as Open.
Apr 8 2024, 8:13 AM · Data-Platform-SRE (2024.03.25 - 2024.04.14)

Apr 5 2024

dcausse moved T349911: Explore the feasibility of using SPARQL federation for scholia queries from Blocked/Waiting to Needs Reporting on the Discovery-Search (Current work) board.

Two scholia queries were rewritten:

The pages also contains some documentation about to approach such rewrites.
I'm boldly moving this ticket to our Needs Reporting (prior to be closed) column as I believe further explorations about how to rewrite scholia queries to support the split could perhaps be better handled in https://github.com/WDscholia/scholia.

Apr 5 2024, 3:39 PM · Discovery-Search (Current work), Wikidata, Wikidata-Query-Service
dcausse added a subtask for T337013: [Epic] Splitting the graph in WDQS: T361950: Ensure that WDQS query throttling does not interfere with federation.
Apr 5 2024, 3:29 PM · Discovery-Search (Current work), Epic, Wikidata-Query-Service, Wikidata
dcausse added a parent task for T361950: Ensure that WDQS query throttling does not interfere with federation: T337013: [Epic] Splitting the graph in WDQS.
Apr 5 2024, 3:29 PM · Patch-For-Review, Discovery-Search (Current work), Wikidata
dcausse renamed T361950: Ensure that WDQS query throttling does not interfere with federation from Ensure that WDQS query throttling do not interfere with federation to Ensure that WDQS query throttling does not interfere with federation.
Apr 5 2024, 3:29 PM · Patch-For-Review, Discovery-Search (Current work), Wikidata
dcausse created T361950: Ensure that WDQS query throttling does not interfere with federation.
Apr 5 2024, 3:29 PM · Patch-For-Review, Discovery-Search (Current work), Wikidata
dcausse updated the task description for T361935: Adapt the WDQS Streaming Updater to update multiple WDQS subgraphs.
Apr 5 2024, 12:55 PM · Patch-For-Review, Discovery-Search (Current work), Wikidata
dcausse added a subtask for T337013: [Epic] Splitting the graph in WDQS: T361935: Adapt the WDQS Streaming Updater to update multiple WDQS subgraphs.
Apr 5 2024, 12:53 PM · Discovery-Search (Current work), Epic, Wikidata-Query-Service, Wikidata
dcausse added a parent task for T361935: Adapt the WDQS Streaming Updater to update multiple WDQS subgraphs: T337013: [Epic] Splitting the graph in WDQS.
Apr 5 2024, 12:53 PM · Patch-For-Review, Discovery-Search (Current work), Wikidata
dcausse created T361935: Adapt the WDQS Streaming Updater to update multiple WDQS subgraphs.
Apr 5 2024, 12:52 PM · Patch-For-Review, Discovery-Search (Current work), Wikidata

Apr 4 2024

dcausse added a comment to T361114: Alert Search Platform and/or DPE SRE when Wikidata is lagged.

Thanks! I'm not very familiar with alerts being set from grafana neither, I'll try to get more info on this, worst case we can always set up a new one directly in alertmanager just for the wdqs lag and sent to the search team using the same formula used by updateQueryServiceLag.php.

Apr 4 2024, 12:29 PM · Patch-For-Review, Data-Platform-SRE (2024.05.27 - 2024.06.16), Wikidata, Wikidata-Query-Service
dcausse placed T361114: Alert Search Platform and/or DPE SRE when Wikidata is lagged up for grabs.

@Lucas_Werkmeister_WMDE thanks! Do you know where we could update this to include our alert email for such alerts?

Apr 4 2024, 9:34 AM · Patch-For-Review, Data-Platform-SRE (2024.05.27 - 2024.06.16), Wikidata, Wikidata-Query-Service
dcausse moved T355451: Update URLs on MediaWiki:Elastica-desc from To Be Deployed to Needs Reporting on the Discovery-Search (Current work) board.
Apr 4 2024, 9:30 AM · MW-1.42-notes (1.42.0-wmf.22; 2024-03-12), Discovery-Search (Current work), Elasticsearch
dcausse moved T360993: WDQS lag propagation to wikidata not working as intended from Needs review to To Be Deployed on the Discovery-Search (Current work) board.
Apr 4 2024, 9:29 AM · Data-Platform-SRE (2024.05.06 - 2024.05.26), MW-1.42-notes (1.42.0-wmf.25; 2024-04-02), Wikidata, Discovery-Search (Current work)
dcausse moved T357966: Document limitations of blazegraph federation from Needs review to Needs Reporting on the Discovery-Search (Current work) board.
Apr 4 2024, 9:29 AM · Discovery-Search (Current work), Wikidata
dcausse updated subscribers of T359580: CirrusSearch should not send outdated cirrussearch-request events.

According to @Urbanecm_WMF these queries are probably emitted while running https://github.com/wikimedia/mediawiki-extensions-GrowthExperiments/blob/master/maintenance/refreshLinkRecommendations.php.
Discussing possible fixes it would be ideal if cirrus could detect that it is being run via a maint script and possibly call something like disablePoolCountersAndLogging but perhaps without disabling statsd since the user script might require stats to be emitted.

Apr 4 2024, 9:15 AM · Discovery-Search (Current work), CirrusSearch

Apr 3 2024

dcausse moved T353683: Unable to find a file by filename while adding a Commons media file statement from Needs review to Needs Reporting on the Discovery-Search (Current work) board.

Should be working properly now

Apr 3 2024, 5:35 PM · MW-1.42-notes (1.42.0-wmf.25; 2024-04-02), Structured-Data-Backlog, SDAW-MediaSearch, Discovery-Search (Current work), CirrusSearch, Wikidata

Mar 29 2024

dcausse closed T361106: Restore wdqs1013 with a data transfer as Declined.

won't be required after all

Mar 29 2024, 8:49 AM · Data-Platform-SRE (2024.03.25 - 2024.04.14), Wikidata, Discovery-Search (Current work), Wikidata-Query-Service
dcausse closed T361106: Restore wdqs1013 with a data transfer, a subtask of T360993: WDQS lag propagation to wikidata not working as intended, as Declined.
Mar 29 2024, 8:47 AM · Data-Platform-SRE (2024.05.06 - 2024.05.26), MW-1.42-notes (1.42.0-wmf.25; 2024-04-02), Wikidata, Discovery-Search (Current work)

Mar 28 2024

dcausse updated the task description for T361246: scap deploy should not repool a wdqs node that is depooled.
Mar 28 2024, 6:25 PM · Release-Engineering-Team, Data-Platform-SRE, Scap, Wikidata, Wikidata-Query-Service