Fatal "cannot perform this operation with arrays" from CirrusSearch/ElasticaWrite (using JobQueueDB)
Closed, ResolvedPublicPRODUCTION ERROR
Actions

Description

some of my json blobs are larger than the the max allowed size of a mysql blob (job_params is a blob column):

2016-01-20 18:09:16 cirrusSearchElasticaWrite Q14475 clientSideTimeout= method=sendData arguments=array(3) cluster=default createdAt=1452822217 errorCount=1 retryCount=0 (id=139253,timestamp=20160115014337) t=593 good

Notice: Unable to unserialize: [a:7:{s:17:"clientSideTimeout";N;s:6:"method";s:8:"sendData";s:9:"arguments";a:3:{i:0;s:7:"content";i:1;a:10:{i:0;O:15:"Elastica\Script":5:{s:24:"]. Unexpected end of buffer during unserialization. in /var/www/wiki/w/includes/jobqueue/JobQueueDB.php on line 803

Fatal error: Invalid operand type was used: cannot perform this operation with arrays in /var/www/wiki/w/extensions/CirrusSearch/includes/Job/ElasticaWrite.php on line 45

i have a lot of wikidata stuff that i am putting in and somehow would be nice if the code could cope with this better. (or make job_params bigger)

this might not affect production which doesn't use JobQueueDB

Details

	Subject	Repo	Branch	Lines +/-
	Change job table params from blob to mediumblob	mediawiki/core	master	+4 -1

Customize query in gerrit

Related Objects
Search...

Status	Subtype	Assigned	Task
Open		None	T46529 Wikidata search problems (tracking)
Resolved	PRODUCTION ERROR	EBernhardson	T124196 Fatal "cannot perform this operation with arrays" from CirrusSearch/ElasticaWrite (using JobQueueDB)
Resolved		Marostegui	T219887 Change job table to use mediumblob for job_params field

Event Timeline

aude created this task.Jan 20 2016, 6:25 PM

aude raised the priority of this task from to Medium.

aude updated the task description. (Show Details)

aude added projects: CirrusSearch, Wikidata.

aude subscribed.

Restricted Application added a project: Discovery-ARCHIVED. · View Herald TranscriptJan 20 2016, 6:25 PM

Restricted Application added a subscriber: Aklapper. · View Herald Transcript

the job is batching 10 updates (e.g. 10 cirrus documents for wikibase items) and the size of these together is too much for a mysql blob :/

maybe the batch could be smaller / configurable and/or job_params could be mediumblob.

hoo subscribed.Jan 20 2016, 7:00 PM

EBernhardson moved this task from Needs triage to Search on the Discovery-ARCHIVED board.Feb 11 2016, 11:22 PM

EBernhardson mentioned this in T157759: ElasticaWrite.php: Unsupported operand types error with CirrusSearch in runJobs.php.Feb 10 2017, 8:57 PM

When will this bug get fix? Using Redis instead of mysql to store jobs now.
However, We can't afford a large memory, which made jobs stored in Redis been truncated when reaching memory cap (2,000 to 3,000 jobs).

Restricted Application added a project: Discovery-Search. · View Herald TranscriptApr 4 2017, 2:49 PM

Zoglun mentioned this in T162565: How to re-run claimed jobs?.Apr 9 2017, 9:54 PM

• Deskana moved this task from needs triage to search-icebox on the Discovery-Search board.Apr 20 2017, 5:17 PM

aude added a project: User-aude.May 20 2017, 9:39 AM

It is not immediately obvious how to install Redis for this scenario. For my setup, on CentOS 7, I had to do the following:

sudo yum install redis php70u-pecl-redis
sudo systemctl start redis.service
sudo systemctl enable redis.service

Then, per the Redis setup docs, I added the following to my LocalSettings.php.
https://www.mediawiki.org/wiki/Redis#Job_queue

$wgJobTypeConf['default'] = [
    'class' => 'JobQueueRedis'
    , 'redisServer' => '127.0.0.1:6379'
    , 'redisConfig' => []
    , 'claimTTL' => 3600
    , 'daemonized' => true
];

NOTE: The daemonized parameter is required, see: https://www.mediawiki.org/w/index.php?title=Topic:Ss9ues5n7gtctppm&topic_showPostId=tbv2820lgm06ipbo#flow-post-tbv2820lgm06ipbo

At this point I expected my setup to be functional again, however it was not. I now received errors when searching. Maybe I didn't give it long enough, but I waited a few minutes and restarted apache with no luck. Finally I just re-indexed per the CirrusSearch README: https://phabricator.wikimedia.org/diffusion/ECIR/browse/master/README

I have tested that pre-existing content as well as new content now searches as expected without any errors.

matej_suchanek added a parent task: T46529: Wikidata search problems (tracking).Jul 30 2017, 8:58 AM

Using reddis worked for me too @GFXDude2010

Rudloff subscribed.Oct 1 2017, 9:17 PM

Lydia_Pintscher moved this task from incoming to monitoring on the Wikidata board.Dec 18 2017, 2:59 PM

dcausse mentioned this in T197917: runJobs.php: PHP Fatal error.Jun 22 2018, 11:53 AM

Is this still an issue?

Removing WMF-JobQueue as we don't use JobQueueDB in production.

• mobrovac unsubscribed.Oct 15 2018, 5:15 PM

Oops, used the wrong tag. Thanks!

Why does the job itself have all of the transformed text rather than just a revision/page ID and use them to derive the transformed text? I get that some metadata is not stored elsewhere and would have to go in the job.

Error

Request ID: fe2ef475a4166ddf6d2b9afa

message

PHP Notice: Unable to unserialize: [a:9:{s:6:"method";s:8:"sendData";s:9:"arguments";a:2:{i:0;s:7:"content";i:1;a:1:{i:0;a:3:{s:4:"data";a:23:{s:7:"version"……;i:10]. Unexpected end of buffer during unserialization.

trace

#0 /srv/mediawiki/php-1.33.0-wmf.22/includes/jobqueue/JobQueueDB.php(858): MWExceptionHandler::handleError(integer, string, string, integer, array, array)
#1 /srv/mediawiki/php-1.33.0-wmf.22/includes/jobqueue/JobQueueDB.php(312): JobQueueDB::extractBlob(string)
#2 /srv/mediawiki/php-1.33.0-wmf.22/includes/jobqueue/JobQueue.php(377): JobQueueDB->doPop()
#3 /srv/mediawiki/php-1.33.0-wmf.22/includes/jobqueue/JobQueueGroup.php(260): JobQueue->pop()
#4 /srv/mediawiki/php-1.33.0-wmf.22/includes/jobqueue/JobRunner.php(167): JobQueueGroup->pop(integer, integer, array)
#5 /srv/mediawiki/php-1.33.0-wmf.22/maintenance/runJobs.php(90): JobRunner->run(array)
#6 /srv/mediawiki/php-1.33.0-wmf.22/maintenance/doMaintenance.php(94): RunJobs->execute()
#7 /srv/mediawiki/php-1.33.0-wmf.22/maintenance/runJobs.php(126): include(string)

Impact

Whichever jobs these represented are unable to be run, this means some jobs (e.g. e-mails, notifications, or derived data updates like category membership, page links etc.) are not recorded on wikitech.wikimedia.org.

Notes

New regression in 1.33.wmf-22 it appears. No reports of this from before the branch went out.

• GTirloni mentioned this in T218935: wikitech runJobs.php error.Mar 22 2019, 12:01 AM

EBernhardson mentioned this in T218943: Fatal "cannot perform this operation with arrays" from Job/ElasticaWrite.php.Mar 28 2019, 7:37 PM

Still seen. Causing some search jobs to fail for wikitech.wikimedia.org.

error

[c032e62f71eb06fbe34c1b7a] /srv/mediawiki/multiversion/MWScript.php   PHP Fatal Error from line 79 of /srv/mediawiki/php-1.33.0-wmf.22/extensions/CirrusSearch/includes/Job/ElasticaWrite.php: Invalid operand type was used: cannot perform this operation with arrays

trace

#0 [internal function]: MWExceptionHandler::handleFatalError()
#1 {main}

In T218943#5067284, @EBernhardson wrote:

The two available fixes are a complete rewrite of the cirrussearch indexing retry pipeline, or changing the job queue to use a non-size limited field type [..]

Those are two ways to actually build the behaviour that the code currently pretends exists. That's cool, but the immediate fix is to not produce a fatal error.

A fatal error signifies that the code is broken, raises our error levels, and may abort a deployment, or trigger Ops pages.

In this case, however, it is known that this ability doesn't exist. Until this ability exists, the code either needs to be disabled (e.g. not deployed on Wikitech), or the code needs to handle this error and respond in some way. E.g. avoid queuing updates of this type or this size (possibly configurable), or run them differently, or to try it as today and then catch/suppress the failure - maybe logging a warning in its stead.

Until this ability exists, the code either needs to be disabled (e.g. not deployed on Wikitech), or the code needs to handle this error and respond in some way. E.g. avoid queuing updates of this type or this size (possibly configurable), or run them differently,

What you just described is option 1, rewrite the indexing retry pipeline. If you think turning off CirrusSearch on wikitech is the best alternative I can float that to wikitech list, but perhaps unsurprisingly I don't prefer that option.

E.g. avoid queuing updates of this type or this size (possibly configurable), or run them differently, or to try it as today and then catch/suppress the failure - maybe logging a warning in its stead.

Imo the JobQueue should raise an error if it's not able to save the message correctly. Since the Queue owns the way the message is serialized it's hard for an extension to determine what will be the actual size of the stored message.

Change 500481 had a related patch set uploaded (by EBernhardson; owner: EBernhardson):
[mediawiki/core@master] Change job table params from blob to mediumblob

https://gerrit.wikimedia.org/r/500481

gerritbot added a project: Patch-For-Review.Apr 1 2019, 4:07 PM

EBernhardson claimed this task.Apr 1 2019, 5:00 PM

EBernhardson edited projects, added Discovery-Search (Current work); removed Discovery-Search.

EBernhardson moved this task from Incoming to Needs review on the Discovery-Search (Current work) board.

Change 500481 merged by jenkins-bot:
[mediawiki/core@master] Change job table params from blob to mediumblob

https://gerrit.wikimedia.org/r/500481

ReleaseTaggerBot added a project: MW-1.33-notes (1.33.0-wmf.24; 2019-04-02).Apr 1 2019, 10:01 PM

EBernhardson moved this task from Needs review to Needs Reporting on the Discovery-Search (Current work) board.Apr 2 2019, 3:46 PM

• GTirloni unsubscribed.Apr 3 2019, 10:24 AM

Marostegui closed subtask T219887: Change job table to use mediumblob for job_params field as Resolved.Apr 4 2019, 5:34 AM

debt closed this task as Resolved.Apr 5 2019, 10:37 PM