Page MenuHomePhabricator

XML-Dumps should include namespacealiases
Closed, ResolvedPublic

Description

Author: fgwiki

Description:
Now XML-Dumps include only <namespaces>. They should also include <namespacealiases>, because it is a necessary information to work with the file.

There is no knowledge within a dump file which <namespacealiases> are in the system. To resolve this someone must have a online connection to the server to ask the api or ask the administrator.

i.e. The German Wiktionary changed 2009 the name space 104 from "WikiSaurus" to "Thesaurus" and made an namespacealias. Many links go today to WikiSaurus. In the system there is the alias and all is OK. But if you download the dump and after this have no internet connection / or you get the dump file per harddrive, you have no idea how to handle links to the namespace "WikiSaurus". Or with an offline-reader which import the dumpfile (like Wikitaxi). There are many links to this namespace and all articles to which the link should go are there. But they can not come together.

If there is the information of <namespacealiases> within the XML-Dump, the interpretation software can read this information and remap the namespace in their system.


Version: unspecified
Severity: enhancement

Details

Reference
bz34218
TitleReferenceAuthorSource BranchDest Branch
Enable TLS for connections to the aqs cassandra clusterrepos/data-engineering/airflow-dags!678btullisupdate_default_cassandra_tlsmain
Customize query in GitLab

Event Timeline

bzimport raised the priority of this task from to Low.Nov 22 2014, 12:12 AM
bzimport set Reference to bz34218.

jcsahnwaldt wrote:

Similar to bug 31955 and bug 21200.

jcsahnwaldt wrote:

Similar to bug 36178.

Change 338899 had a related patch set uploaded (by ArielGlenn):
add api job handler, config file in yaml, siteinfo props jobs

https://gerrit.wikimedia.org/r/338899

The changeset is ready to deploy and will go out for the next dump run before Mar 20th.

Change 338899 merged by ArielGlenn:
[operations/dumps] add api job handler, config file in yaml, siteinfo props jobs

https://gerrit.wikimedia.org/r/338899

These dumps appeared in the current dump run. Closing.