Need Wikisource specific edit counter for Proofread and validation work. There are no edit counter for Wikisource specific job. If we want to find out the edit count of specific user in a specific time span, how many pages (Page:) have done as "Without text", "Problematic" "Proofread" or "Validate". I think this could be implemented at x-tool very easily.
Description
| Status | Subtype | Assigned | Task | ||
|---|---|---|---|---|---|
| In Progress | Alien333 | T173012 Need Wikisource specific counter for "Without text", "Problematic" "Proofread" or "Validate" work. | |||
| Open | None | T172408 Implement way of querying Proofread Page Status of a Page: (or revision) from the databases directly | |||
| Open | Feature | None | T290578 Use the change_tag table to store the proofreading quality level | ||
| Open | None | T289140 ProofreadPage: Enable change-tag status system on Wikisources |
Event Timeline
I have added a images with description about where we need the wikisource specific edit count.
@Samwilson Did you want to comment on this? From the looks of it, such a feature seems useful, but I wonder if it's right for the Edit Counter. Is this need specific to English Wikisource? Perhaps it makes sense to implement it as a separate tool, maybe even apart from XTools (which is meant to be cross-wiki)?
This would be a good statistic for any wiki with the ProofreadPage extension, so it could conceivably be part of XTools I think.
However, the data about page quality is currently not cheaply-queriable, as this data is stored in the page text. T172408 will fix this. Then we'd be able to get the count of a user's revisions and what quality level they were changed to (i.e. would also exclude those revisions in which the user didn't change the quality).
So, my thought is that this could work, after the above changes are made to ProofreadPage, and would be nice to have in XTools, but is blocked for now.
@Tpt what do you think?
@Nivas10798: Removing task assignee as this open task has been assigned for more than two years - See the email sent to task assignee on October 11th.
Please assign this task to yourself again if you still realistically [plan to] work on this task - it would be welcome! :)
If this task has been resolved in the meantime, or should not be worked on by anybody ("declined"), please update its task status via "Add Action… 🡒 Change Status".
Also see https://www.mediawiki.org/wiki/Bug_management/Assignee_cleanup for tips how to best manage your individual work in Phabricator. Thanks!
On the XTools side: while querying page status doesn't seem possible right now, we could at least use the same tag lists we'll soon for the last 5k edits (added with T342402), to just get the proportions of edits that set to a certain status using proofreadpage-quality{0,1,2,3,4}/(sum of all proofreadpage-quality[0-4]).
As the revision tags have only been there since July 2021, we should add a footnote to that effect.
I suppose this would be a pie chart at the end of general_statistics. Ought to get hidden when there are no proofreadpage-quality[0-4] edits.
Did what said (proportions with tags) in PR #551. That's about as far as we'll get in the current state of things.
Hang on—all PRP pages (including old ones) have a page_props entry with pp_propname = "proofread_page_quality_level". Getting that isn't too costly, too. So in the end, we actually can count page status. I'm surprised I haven't seen it mentioned yet (perhaps it has/had other issues). Some things we would/could want in this vein:
- PagesCreated - get a pagestatus column, get a pie chart of the status of pages created by user. I'd say we should def get that. Code: just a join.
- EditCounter - a pie chart somewhere (?) of the status of pages edited by the user. We probably want that (and maybe we want to get the number of edits by quality status, as opposed to number of pages, but IDK). Code: something like
SELECT pp_value, COUNT(pp_value) AS `count` FROM page_props JOIN ( SELECT DISTINCT(rev_page) FROM revision_userindex WHERE rev_actor = :actorId ) a ON rev_page = pp_page WHERE pp_propname = "proofread_page_quality_level" GROUP BY pp_value
- TopEdits - adding a column, getting a pie chart, here definitely weighed by number of edits. I'm not sure to what extent we want that here, and to what extent we'd be duplicating what we'd put in EC (or, the other way, would EC be duplicating what TE'd do?). The distinction would be that in EC, not counting edits, we'd go substantially faster; whereas TE would get the complete but probably slower edit counts. Code: just a join.
In the end: added column and pie chart in PagesCreated, just kept the change tag pie chart for EditCounter, added column (but not pie chart) to TopEdits (all still in same PR, see there for details).
