Page MenuHomePhabricator

Document CirrusSearch schema, especially with respect to the dumps we now provide
Open, LowPublic

Description

We have recently started putting dumps of our search indexes on dumps.wikimedia.org. It would be nice to document what the structure of these documents are so that researchers can figure out if they are useful to them. Also so they can make sure they are interpreting the data within them correctly.

Event Timeline

EBernhardson raised the priority of this task from to Needs Triage.
EBernhardson updated the task description. (Show Details)
EBernhardson subscribed.
Deskana moved this task from Needs triage to Search on the Discovery-ARCHIVED board.
Deskana subscribed.
MPhamWMF subscribed.

Closing out low/est priority tasks over 6 months old with no activity within last 6 months in order to clean out the backlog of tickets we will not be addressing in the near term. Please feel free to reopen if you think a ticket is important, but bare in mind that given current priorities and resourcing, it is unlikely for the Search team to pick up these tasks for the indefinite future. We hope that the requested changes have either been addressed by or made irrelevant by work the team has done or is doing -- e.g. upgrading Elasticsearch to a newer version will solve various ES-related problems -- or will be subsumed by future work in a more generalized way.

Reverting misguided closure.