Page MenuHomePhabricator

Humaniki Bug
Open, HighPublicBUG REPORT

Description

Hi! I have noted the following bugs:

Steps to Reproduce:

Actual Results:
API reachable but : ["Query returned 0 results"]

Expected Results:
Should show results similar as the ones that appear with date 2022-02-21 (651 k biographies, 19.125% of women)

  1. Similar but not exactly the same

Steps to Reproduce:

Actual Results:
Total: 401 434
female: 75 877
female Percent: 18.901%
male: 325 155
male Percent: 80.998%

Expected Results:
Should show a higher number of biographies and a higher rate of women biographies, such as results of 2022-02-21 (651 178 biographies, 19.125% of women)

Related Objects

Duplicates Merged Here
T305085: Humaniki Bug

Event Timeline

I see this as well. Could this be related to the:

Unreachable : "https://humaniki.wmcloud.org/api/v1/gender/gap/latest/gte_one_sitelink/properties?label_lang=en"

notice on the home page, and the main links on the home page "Gender by Country", "Gender by Wikimedia Project" showing a blank page when selected?

Envlh raised the priority of this task from High to Unbreak Now!.Apr 6 2022, 6:33 PM
Envlh subscribed.

We're working on this (as volunteers, so resolution will not be immediate).

An unbreak now task should be one where everyone that can is dropping everything to fix it. Does this task really meet that criteria?

I assume by your comment that it's getting 0 WMF resources so the answer is probably not.

An unbreak now task should be one where everyone that can is dropping everything to fix it. Does this task really meet that criteria?

I assume by your comment that it's getting 0 WMF resources so the answer is probably not.

My understanding is that Phabricator can be used by projects not directly maintained by WMF. For this task, from Humaniki's project point of view, this task is at the highest priority because the tool is broken. At the moment, “everyone that can” are volunteers. I don't know if this disqualifies this task to have the priority Unbreak Now!, but feel free to lower the priority if it's the case.

Phab can be used by anyone within the movement. That makes sense.

notconfusing claimed this task.
notconfusing subscribed.

Tl;dr. FIXED! This indeed was an out-of-disk space issue.

The trick part was that humaniki's main disk-space was not full the network-file system, where we store some temporary files was. I have implemented a fix to do delete those temporary files after 90 days.

In fact the main humaniki database still has 500GB remaining, which at the current rate should last at least another year.

I cleaned up the strange ½ fill that occurred when the disk ran out of space on 2022-02-28. I confirmed that the filling process can run again on 2022-02-28 and 2022-03-07. I believe, as humaniki runs once per day, we should slowly come back up to do within the week.

Need post-push code review /audit if there is a volunteer. Commits:
https://github.com/notconfusing/humaniki-schema/commit/7a023a0e3c5df8b9dbbb5a34731806449e54cd21
https://github.com/notconfusing/humaniki-schema/commit/ac490c5d52a07b7bd12e936597d9527e07c18e60

Remaining to-dos. (logged in phabricator)

  1. #TODO Catch the WDTK error java.io.IOException: No space left on device and handle it in the humanik-schema orchestration layer, through emailing.
  2. #TODO start thinning out the database to save on space in the future. I believe it would be a good retention to policy to, for instance, on keep the 1st fill of the month, for fills older than 1 year. And maybe 1 fill per quarter for fills older than 2 years. That way we may be able to make the remaining 500GB last us quite some time.

Folowing message from notconfusing, I have tried using Humaniki and I still got the message API reachable but : ["Query returned 0 results"] so the problem does not look solved yet. I reopen the task.

I thought I had fixed it but some old code was still running after I made the push. I reset the state, so I pe it is now fixed. I will leave the ticket ofen until we feel more confident in stability.

Envlh lowered the priority of this task from Unbreak Now! to High.Apr 12 2022, 8:41 PM

Lowering the priority as the main issue is now dealt with.