This the current datasources that ores sends over the wire from uwsgi to celery:
{'datasource.revision.user.info.doc': {'editcount': 1247, 'registration': None, 'name': 'E.B.', 'userid': 67, 'gender': 'unknown', 'groups': ['*', 'user', 'autoconfirmed']}, 'datasource.revision.user.id': 67, 'datasource.revision.parent.text': 'lots of text', 'datasource.revision.user.info.registration': None, 'datasource.revision.id': 4569, 'datasource.revision.user.info.groups': {'*', 'user', 'autoconfirmed'}, 'datasource.extractor.dependents': {<datasource.revision.user.id>, <datasource.revision.parent.text>, <datasource.revision.user.info.registration>, <datasource.revision.user.info.groups>, <datasource.revision.comment>, <datasource.revision.text>, <datasource.revision.timestamp>, <datasource.revision.page.namespace.id>}, 'datasource.revision.doc': {'slots': {'main': {'*': 'lots of text', 'contentformat': 'text/x-wiki', 'contentmodel': 'wikitext'}}, 'revid': 4569, 'size': 6192, 'userid': 67, 'page': {'pageid': 105, 'ns': 0, 'title': 'Književnost'}, 'comment': '', 'parentid': 4562, 'user': 'E.B.', 'timestamp': '2004-08-22T06:12:32Z'}, 'datasource.revision.text': 'lots of text', 'datasource.revision.parent.doc': {'slots': {'main': {'*': 'lots of text', 'contentformat': 'text/x-wiki', 'contentmodel': 'wikitext'}}, 'revid': 4562, 'size': 6196, 'userid': 67, 'page': {'pageid': 105, 'ns': 0, 'title': 'Književnost'}, 'comment': 'sintaksa, gramatika...', 'parentid': 4560, 'user': 'E.B.', 'timestamp': '2004-08-22T05:56:51Z'}, 'datasource.revision.timestamp': Timestamp('2004-08-22T06:12:32Z'), 'datasource.revision.page.namespace.id': 0, 'datasource.revision.comment': ''}
I replaced lots of text with "lots of text". As you can see both revision text and parent revision text is getting duplicated. Fixing this issue will increase the redis capacity and saves some cute polar bears from dying.