Data need: Explore range of article revision comparisons
Closed, ResolvedPublic2 Estimated Story Points
Actions

Description

For the revision slider it could be useful to know what is actually done when comparing revisions. Thus I/we could optimize the design for the most common tasks instead of possibly going into a direction that does not cover frequent usecases.

Data needs:

How many steps back in time do users go when comparing versions? (measured in current state -n revisions)
How big is the difference between compared versions (measured in revisions)
Is it common to jump between versions back and forth in a short time or do users find the revisions quickly without iterating and adjusting often?

Data Format
All data makes the most sense if given in[[ https://en.wikipedia.org/wiki/Quantile#Specialized_quantiles | Percentiles/Deciles ]], so we can see the distribution.

Skills
Could be delivered as Excel/Calc, csv, RDataFrame

Coversation

On a diff you have a "previous" and "next" edit button.
Would tracking how often these were used be of use?

Yes, that is a reasonable proxy for what interests me, I suppose.

Details

Subject	Repo	Branch	Lines +/-
Add dewiki_diffstats to wmgMonologChannels	operations/mediawiki-config	master	+1 -0
Make dewiki_diffstats debug instead of info	mediawiki/extensions/WikimediaEvents	master	+1 -1
Track dewiki diff page usage	mediawiki/extensions/WikimediaEvents	master	+55 -0

Customize query in gerrit

Related Objects
Search...

		Status	Subtype	Assigned	Task
		Resolved		Addshore	T135751 Workable CSV for Data need: Explore range of article revision comparisons
		Resolved		Addshore	T134861 Data need: Explore range of article revision comparisons

Event Timeline

Jan_Dittrich created this task.May 10 2016, 10:26 AM

Restricted Application added subscribers: Zppix, Aklapper. · View Herald TranscriptMay 10 2016, 10:26 AM

Tobi_WMDE_SW moved this task from Incoming to Backlog on the Revision-Slider board.May 11 2016, 11:52 AM

Lea_WMDE moved this task from Incoming to Revision Slider on the TCB-Team (now WMDE-TechWish) board.May 11 2016, 12:22 PM

Change 287647 had a related patch set uploaded (by Addshore):
Track dewiki diff page usage

https://gerrit.wikimedia.org/r/287647

Change 288158 had a related patch set uploaded (by Addshore):
Don't log dewiki_diffstats to logstash

https://gerrit.wikimedia.org/r/288158

The above change would provide a JSON log of data that we could then work with.
A blob would be logged every time the diff view was loaded on dewiki.
The blob would contain:

current timestamp
revision ids for the diff
page id
Total number of revisions of the page
Number of revisions between the compared revisions
Number of revisions back in time of the latest revision being compared.

This would likely need review / approval from someone at the WMF to ensure this is okay and not going to stress anything too much.

Just a summary of the data Jan and I are interested in:
For the revision view as is (without the new revision slider):

date and position of older revision (as in "the nth revision")
date and position of younger revision
total number of revisions of this article
maybe: article id

So the patch above collects the oldid and newid, from this we can get the timestamp of each and the position in history.
It also directly collects the number of revisions of the page.
Article id can be collected using either of the revision ids, but it is also included in the patch

• Nuria moved this task from Incoming to Radar on the Analytics board.May 16 2016, 4:47 PM

Lea_WMDE added a project: TCB-Team-Sprint-2016-05-19.May 17 2016, 2:49 PM

Tobi_WMDE_SW moved this task from Proposed to Review on the TCB-Team-Sprint-2016-05-19 board.May 19 2016, 2:32 PM

Tobi_WMDE_SW set the point value for this task to 2.

Addshore added a project: WMDE-Analytics-Engineering.May 19 2016, 2:35 PM

Addshore moved this task from Incoming to Doing on the WMDE-Analytics-Engineering board.

Tobi_WMDE_SW triaged this task as Medium priority.May 19 2016, 4:00 PM

Lea_WMDE moved this task from Backlog to Doing on the Revision-Slider board.May 19 2016, 4:10 PM

Lea_WMDE mentioned this in T135751: Workable CSV for Data need: Explore range of article revision comparisons .May 19 2016, 4:15 PM

Tobi_WMDE_SW added a project: TCB-Team-Sprint-2016-06-02.Jun 2 2016, 1:28 PM

Tobi_WMDE_SW moved this task from Proposed to Review on the TCB-Team-Sprint-2016-06-02 board.

Lea_WMDE renamed this task from Data need: User Behaviour when comparing article revisions to Data need: Data need: Explore range of article revision comparisons .Jun 6 2016, 3:50 PM

Lea_WMDE updated the task description. (Show Details)

Tobi_WMDE_SW added a project: TCB-Team-Sprint-2016-06-16.Jun 16 2016, 1:49 PM

Tobi_WMDE_SW moved this task from Proposed to Review on the TCB-Team-Sprint-2016-06-16 board.

WMDE-leszek renamed this task from Data need: Data need: Explore range of article revision comparisons to Data need: Explore range of article revision comparisons .Jun 27 2016, 11:35 AM

Tobi_WMDE_SW added a project: TCB-Team-Sprint-2016-06-29.Jun 29 2016, 9:43 AM

Tobi_WMDE_SW moved this task from Proposed to Review on the TCB-Team-Sprint-2016-06-29 board.Jun 29 2016, 9:44 AM

Addshore added a parent task: T135751: Workable CSV for Data need: Explore range of article revision comparisons .Jul 11 2016, 9:07 AM

Addshore added a project: User-Addshore.Jul 12 2016, 1:01 PM

Addshore moved this task from Unsorted 💣 to Active 🚁 on the User-Addshore board.Jul 12 2016, 1:04 PM

WMDE-Fisch added a project: TCB-Team-Sprint-2016-07-14.Jul 14 2016, 1:57 PM

WMDE-Fisch moved this task from Proposed to Review on the TCB-Team-Sprint-2016-07-14 board.Jul 14 2016, 2:02 PM

Change 287647 merged by jenkins-bot:
Track dewiki diff page usage

https://gerrit.wikimedia.org/r/287647

ReleaseTaggerBot added a project: MW-1.28-release (WMF-deploy-2016-07-19_(1.28.0-wmf.11)).Jul 14 2016, 6:00 PM

Addshore moved this task from Active 🚁 to Back Burner 🏛️ on the User-Addshore board.Jul 19 2016, 12:18 PM

Change 288158 abandoned by Addshore:
Don't log dewiki_diffstats to logstash

https://gerrit.wikimedia.org/r/288158

Change 299744 had a related patch set (by Addshore) published:
Make dewiki_diffstats debug instead of info

https://gerrit.wikimedia.org/r/299744

Change 299744 merged by jenkins-bot:
Make dewiki_diffstats debug instead of info

https://gerrit.wikimedia.org/r/299744

ReleaseTaggerBot added a project: MW-1.28-release (WMF-deploy-2016-07-26_(1.28.0-wmf.12)).Jul 19 2016, 8:00 PM

Addshore moved this task from Review to Done on the TCB-Team-Sprint-2016-07-14 board.Jul 20 2016, 9:24 AM

Change 288158 restored by Addshore:
Don't log dewiki_diffstats to logstash

https://gerrit.wikimedia.org/r/288158

Addshore moved this task from Doing to Done on the Revision-Slider board.Jul 25 2016, 8:23 PM

Addshore moved this task from Back Burner 🏛️ to Closing ✔️ on the User-Addshore board.Jul 25 2016, 8:28 PM

Change 288158 merged by jenkins-bot:
Add dewiki_diffstats to wmgMonologChannels

https://gerrit.wikimedia.org/r/288158

Changed merged and the logs are now accessible on fluorine

Addshore moved this task from Doing to Done on the WMDE-Analytics-Engineering board.Jul 25 2016, 11:10 PM

Mentioned in SAL [2016-07-25T23:11:15Z] <dereckson@tin> Synchronized wmf-config/InitialiseSettings.php: Add dewiki_diffstats to wmgMonologChannels ([[Gerrit:288158]], T134861) (duration: 00m 25s)

Aklapper edited projects, added Analytics-Radar; removed MW-1.28-release (WMF-deploy-2016-07-26_(1.28.0-wmf.12)), Analytics.Jun 10 2020, 6:44 AM

thiemowmde removed a project: TCB-Team (now WMDE-TechWish).Jan 7 2022, 3:15 PM

Data need: Explore range of article revision comparisons Closed, ResolvedPublic2 Estimated Story PointsActions

Description

Details

Related ObjectsSearch...

Event Timeline

Data need: Explore range of article revision comparisons
Closed, ResolvedPublic2 Estimated Story Points
Actions

Related Objects
Search...