T315510 will start a maintenance script to populate the talk page comment database.
This task involves the work of analyzing said database to learn what percentage of comments stored in the database are duplicates?
Knowing the answer to the above will enable us to estimate the likelihood that someone tapping/clicking on a permalink will directed to Special:GoToComment as opposed to being directly taken to the comment they are expecting to see.
With the probability described above "in-hand," we'll be able to decide whether any adjustments need to be made to:
- A) How we're generating permalinks to lower the rate of duplicates
- B) How the user experience looks/functions to help people develop more accurate expectations for what is likely to occur when they tap a permalink
Requirements
- Once the talk page comment database contains a sufficiently large and representative amount of comments, calculate the percentage of said comments that are duplicates of one another
Open questions
- 1. When and how will we know the comment database is filled with a large and representative enough sample of comments for us to analyze its contents and make conclusions based on said analysis?
Done
- Answers to all Open questions are documented
- Requirements are met