Page MenuHomePhabricator

CirrusSearch should expose a function to reset its weighted_tags for a particular tag category
Closed, ResolvedPublic3 Estimated Story Points

Description

As an extension developer I want to reset the weighted_tags field managed by CirrusSearch so that I can update the index without relying on regular page updates.

AC:

  • rename ores_articletopics to weitghted_tags in the mapping configuration
  • the weighted_tags field should be used with the multilist noop handler during updates
  • a public function CirrusSearch::resetWeightedTag( $pageId, $tagCategory ) is available to reset the tag synchronously (using an ElasticaWrite job)

Event Timeline

dcausse moved this task from Incoming to In Progress on the Discovery-Search (Current work) board.
dcausse set the point value for this task to 3.

Change 661379 had a related patch set uploaded (by DCausse; owner: DCausse):
[mediawiki/extensions/CirrusSearch@master] Generalize ORES article topics with WeightedTags

https://gerrit.wikimedia.org/r/661379

Change 661380 had a related patch set uploaded (by DCausse; owner: DCausse):
[mediawiki/extensions/CirrusSearch@master] Make weighted_tags updatable from CirrusSearch using the multilist handler

https://gerrit.wikimedia.org/r/661380

Change 661383 had a related patch set uploaded (by DCausse; owner: DCausse):
[operations/mediawiki-config@master] [cirrus] rename ores_articletopics -> weighted_tags

https://gerrit.wikimedia.org/r/661383

Change 661384 had a related patch set uploaded (by DCausse; owner: DCausse):
[operations/mediawiki-config@master] [cirrus] drop deprecated ores_articletopics config

https://gerrit.wikimedia.org/r/661384

Change 661383 merged by jenkins-bot:
[operations/mediawiki-config@master] [cirrus] rename ores_articletopics -> weighted_tags

https://gerrit.wikimedia.org/r/661383

Change 661379 merged by jenkins-bot:
[mediawiki/extensions/CirrusSearch@master] Generalize ORES article topics with WeightedTags

https://gerrit.wikimedia.org/r/661379

Change 661380 merged by jenkins-bot:
[mediawiki/extensions/CirrusSearch@master] Make weighted_tags updatable from CirrusSearch using the multilist handler

https://gerrit.wikimedia.org/r/661380

When I try to run this in vagrant I get an error about ElasticSearch receiving a string but excepting a list. Do I need to do some kind of manual index update?

(Aside: the log is fairly useless, it just says Error in one or more bulk request actions: update: /wiki_content_first/page/56 caused failed to execute script. I have to look into the exception object in a debugger to see the response object with the detailed error message. Is there some configuration change needed to get better errors?)

(Aside: the log is fairly useless, it just says Error in one or more bulk request actions: update: /wiki_content_first/page/56 caused failed to execute script. I have to look into the exception object in a debugger to see the response object with the detailed error message. Is there some configuration change needed to get better errors?)

That was fixed in https://gerrit.wikimedia.org/r/c/mediawiki/extensions/CirrusSearch/+/664596, hopefully.