Page MenuHomePhabricator

Enable non-file page creation logs on Wikimedia Commons
Open, Stalled, Needs TriagePublic

Description

Please enable visible page creation logs on Wikimedia Commons. Based on current statistics if implemented retroactively, excluding filespace (namespace 6) from such logs would reduce the size of the resulting table by about 75% (from ~100M to ~25M) , and such logs are not necessary in filespace because we have upload logs for files. The previous task for most projects was T12331. We have consensus for it at https://commons.wikimedia.org/wiki/Commons:Village_pump#Page_creation_logs.

Event Timeline

taavi changed the task status from Open to Stalled.Aug 12 2021, 3:16 PM
taavi moved this task from Backlog to Blocked on development on the Wikimedia-Site-requests board.
taavi subscribed.

Currently the setting for this ($wgPageCreationLog) can't be adjusted per-namespace, it's unfortunately per-wiki only.

And forgot to mention this yesterday, this would definitely need DBA approval.

For context, the logging table was the reason of a large commonswiki outage T277416 (public task T277417), so people are quite careful about it, that is why I suggested to enable it only for non-bot/non-image-ns traffic as a potential compromise :-(.

I believe that for Commons, bot-created pages are fairly low in frequency. This is more a concern on Wikidata.

And skipping it on bot-created pages actually seems problematic imho from a user perspective since those pages are (after being created) as any other page, so missing the creation event would greatly deminish the utility of the page log if a page would be missing it if it happened to be first edited by a bot.

However, I agree that file pages specifically indeed seem like that could be avoided. However, that applies more generally to MediaWiki and not Wikimedia Commons in particular. As such, perhaps it would make sense to codify that in MediaWiki natively that for NS_FILE we don't emit this event and instead assume it to be an upload (manually created pages there are very rare and usually an accident, and for catching/reviewing that we have recent changes already).

[...]
As such, perhaps it would make sense to codify that in MediaWiki natively that for NS_FILE we don't emit this event and instead assume it to be an upload (manually created pages there are very rare and usually an accident, and for catching/reviewing that we have recent changes already).

Would it be possible to only log filespace creations not associated with uploads? From a user's perspective, this wouldn't be too unexpected, as Special:Newpages already does that in the file namespace.

[...]
As such, perhaps it would make sense to codify that in MediaWiki natively that for NS_FILE we don't emit this event and instead assume it to be an upload (manually created pages there are very rare and usually an accident, and for catching/reviewing that we have recent changes already).

Would it be possible to only log filespace creations not associated with uploads? From a user's perspective, this wouldn't be too unexpected, as Special:Newpages already does that in the file namespace.

That would work for me. The consensus has now been archived to c:Commons:Village pump/Archive/2021/07#Page creation logs.