Clean up failed reindexing indexes
Closed, ResolvedPublic
Actions

Assigned To

Authored By

	TJones
	Apr 7 2021, 10:53 PM

Description

User story: As a search engineer I would like there to be only one copy of a given index in production so I know which one is the live index, and to know that reindexing was succcessful.

From mwmaint1002 you can run this janky command line code to find duplicate indexes:

for cluster in search.svc.eqiad.wmnet search.svc.codfw.wmnet cloudelastic.wikimedia.org; do for port in 9243 9443 9643; do echo "$cluster:$port"; curl -s https://$cluster:$port/_cat/indices | perl -pe 's/^\S+\s+\S+\s+(\S+)\s+.*/$1/; s/_(\d+|first)//;' | sort | uniq -c | sort -n | grep -vP "^\s+1\s"; done; done

There are 46 indexes with duplicates across 8 of the nine cluster/port combos, including glent on eqiad and codfw. Most have 2, but many have 3, especially on cloudelastic:9243. It was unclear in the Wednesday Meeting today whether there should be multiple glent indexes or not, so those may be okay. One duplicate at a time during reindexing is probably valid, too.

When the current round of reindexing is done, we can clean up duplicates.

Acceptance Criteria:

There are no unexpected (glent?) duplicate indexes in the eqiad, codfw, or cloudelastic clusters.

Bonus result:

A better way to find duplicate indexes than the command line abomination above.

Details

	Subject	Repo	Branch	Lines +/-
	Reconcile configured indices with live state	mediawiki/extensions/CirrusSearch	master	+514 -6

Customize query in gerrit

Related Objects

Mentioned In: T279009: Cleanup duplicate indices in cloudelastic
Mentioned Here: T279009: Cleanup duplicate indices in cloudelastic

Event Timeline

TJones created this task.Apr 7 2021, 10:53 PM

Restricted Application added a subscriber: Aklapper. · View Herald TranscriptApr 7 2021, 10:53 PM

• dcausse updated the task description. (Show Details)Apr 8 2021, 7:36 AM

• MPhamWMF moved this task from needs triage to Current work on the Discovery-Search board.Apr 19 2021, 3:14 PM

• MPhamWMF edited projects, added Discovery-Search (Current work); removed Discovery-Search.

Note that T279009 also has some indices to cleanup, it might make sense to address both at the same time.

Change 682189 had a related patch set uploaded (by Ebernhardson; author: Ebernhardson):

[mediawiki/extensions/CirrusSearch@master] Reconcile configured indices with live state

https://gerrit.wikimedia.org/r/682189

gerritbot added a project: Patch-For-Review.Apr 23 2021, 6:55 PM

Change 682189 merged by jenkins-bot:

[mediawiki/extensions/CirrusSearch@master] Reconcile configured indices with live state

https://gerrit.wikimedia.org/r/682189

Maintenance_bot removed a project: Patch-For-Review.May 3 2021, 9:10 AM

Gehel moved this task from Incoming to Needs Reporting on the Discovery-Search (Current work) board.May 3 2021, 3:27 PM

Gehel closed this task as Resolved.May 5 2021, 12:26 PM

Gehel claimed this task.

Clean up failed reindexing indexesClosed, ResolvedPublicActions

Description

Details

Related Objects

Event Timeline

Clean up failed reindexing indexes
Closed, ResolvedPublic
Actions