Page MenuHomePhabricator

Unusual high page view on Chinese Wikipedia
Closed, DuplicatePublic

Description

The article 中華電信MOD is on the top read list for months in a row, usually topping it. Nobody knows exactly why.

By inspecting page view statistics, I discovered that the mobile access percentage of this page is also ridiculously high (99.7%).

Having unrelated pages on the top read list makes patrolling popular pages harder, so I would like to request an investigation to find out whether this is a software bug, or some sort of planned mass viewing. And, if possible, fix that reading.

Event Timeline

Hi @MilkyDefer, thanks for taking the time to report this. Could you elaborate what you expect from this task, and from who? Thanks. :)

I think there are two possibilities:

  1. Pageviews tool is broken; or
  2. Someone is sending frequent page queries, which can easily turn into a DoS attack.

I would like the developers from Pageviews to examine if there is a bug in the tool or is a false positive; I also request someone to do a User-Agent sampling for this page so that we can know whether possibility No.2 is correct.

Several editors from the community, including admins, have expressed their concern on this. I am representing them.

If a separate UA sampling cannot be conducted, I would like to turn this into a feature request that for most-read pages, Pageviews should show the browser, os and hardware information of the reader group, should WMF's privacy policy permit.

在T269065#6659583中,@MilkyDefer写道:

I think there are two possibilities:

  1. Pageviews tool is broken; or
  2. Someone is sending frequent page queries, which can easily turn into a DoS attack.

see https://wikimedia.org/api/rest_v1/metrics/pageviews/top/zh.wikipedia/all-access/2020/11/30

{
        "article": "中華電信MOD",
        "views": 32425,
        "rank": 3
      },

https://zh.wikipedia.org/api/rest_v1/feed/featured/2020/12/01

"mostread": {
    "date": "2020-11-30Z",
    "articles": [
      {
        "views": 32425,
        "rank": 3,
        "view_history": [
          {
            "date": "2020-11-26Z",
            "views": 36255
          },
          {
            "date": "2020-11-27Z",
            "views": 34462
          },
          {
            "date": "2020-11-28Z",
            "views": 35287
          },
          {
            "date": "2020-11-29Z",
            "views": 34254
          },
          {
            "date": "2020-11-30Z",
            "views": 32425
          }
        ],
        "type": "standard",
        "title": "中華電信MOD",
        "displaytitle": "中华电信MOD",
        "namespace": {
          "id": 0,
          "text": ""
        },

May not recognize some kind of bot view?

Milimetric triaged this task as High priority.
Milimetric moved this task from Incoming to Data Quality on the Analytics board.
Milimetric added a project: Analytics-Kanban.