Page MenuHomePhabricator

Add terms of use to https://dumps.wikimedia.org/index.html and https://dumps.wikimedia.org/backup-index.html
Closed, ResolvedPublic

Description

Requested change

Add a new line within the Privacy section below the privacy line:

The terms of use for Wikimedia Foundation websites.

Scope

Both the static index.html landing page as well as the backup-index.html page have to be adjusted.

Index.html is puppet controlled.
backup-index.html is templated and generated as part of the dumps 1 process

Event Timeline

Mock up
{F68889201}

(updated)

The backup-index.html page is generated using a template file "donwload-index.html".

The file is generated in this dumps DAG on test_k8s:

https://airflow-test-k8s.wikimedia.org/dags/mediawiki_sql_xml_monitor/grid

with the config path:

/etc/dumps/confs/wikidump.conf.dumps:monitor

defined here:

https://gitlab.wikimedia.org/repos/data-engineering/airflow-dags/-/blob/main/test_k8s/dags/dumps/common.py?ref_type=heads#L37

(mediawiki-dumps-legacy-configs Ceph image)

Please also edit index.html directly in the cloudsdumps1002 server dumps directory to make the change visible asap:

ahoelzl@clouddumps1002:/srv/dumps/xmldatadumps/public$ ls -l index.html
-rw-r--r-- 1 root root 3866 Aug 27 14:30 index.html

Change #1199783 had a related patch set uploaded (by Xcollazo; author: Xcollazo):

[operations/puppet@production] dumps: Release the new MW Content File Export. Deprecate legacy XML dumps.

https://gerrit.wikimedia.org/r/1199783

Change #1200379 merged by CDanis:

[operations/puppet@production] dumps: Add Terms of use to the index page.

https://gerrit.wikimedia.org/r/1200379

Change #1200192 had a related patch set uploaded (by Xcollazo; author: Ahoelzl):

[operations/deployment-charts@master] Adding terms of use for download-index.html

https://gerrit.wikimedia.org/r/1200192

Ahoelzl renamed this task from Add terms of use to https://dumps.wikimedia.org/index.html to Add terms of use to https://dumps.wikimedia.org/index.html and https://dumps.wikimedia.org/backup-index.html.EditedOct 31 2025, 4:53 PM
Ahoelzl updated the task description. (Show Details)

Updated title and description to clarify the two different update pathways.

The https://dumps.wikimedia.org/index.html change is live.

Change #1200192 merged by Bking:

[operations/deployment-charts@master] Adding terms of use for download-index.html

https://gerrit.wikimedia.org/r/1200192

@BTullis or @brouberol : Would you be so kind as to push this mediawiki-dumps-legacy patch to prod?

Change #1199783 merged by Brouberol:

[operations/puppet@production] dumps: Release the new MW Content File Export. Deprecate legacy XML dumps.

https://gerrit.wikimedia.org/r/1199783