Page MenuHomePhabricator

enwiki dump does not contain data for "Authority control"
Closed, InvalidPublic

Description

After downloading enwiki-20160601-pages-articles-multistream.xml.bz2 I found out that the data for "Authority control" is empty in the dump, but filled for the corresponding pages in en.wikipedia.org as example:

For the article Anarchism in the dump I find:

<title>Anarchism</title>
<ns>0</ns>
<id>12</id>
<revision>
<id>721573764</id>
<parentid>719202660</parentid>
<timestamp>2016-05-22T19:25:12Z</timestamp>
[...]
--&gt;
{{Anarchism}}
{{Philosophy topics}}
{{Political culture}}
{{Political ideologies}}
{{Social and political philosophy}}
{{Aspects of Capitalism}}

{{Portal bar|Anarchism|Social and political philosophy|Social movements}}

{{Good article}}

{{Authority control}} <------------- No data in dump, but on en.wikipedia.org web page (see below)

[[Category:Anarchism| ]]

[[Category:Political culture]]

On the web page the Authority control is filled:
[...]
BlackFlagSymbol.svg Anarchism portal P derecho.svg Social and political philosophy portal Peace sign.svg Social movements portal
Authority control
LCCN: sh85004812 GND: 4001887-8 HDS: 17399 NKC: ph118417
Categories:
[...]

In the dump for the germany wikipedia the equivalent field "Normdaten" is filled in the dump like on the webpage:

<title>Alan Smithee</title>
[...]

Referenzen

&lt;references /&gt;

{{Normdaten|TYP=p|GND=123396956}} <--------- O.K. and identical to de.wikipedia.org

[[Kategorie:Fiktive Person|Smithee, Alan]]

Event Timeline

Recommend closing as invalid. the information that the user wants is being pulled from wikidata via module see https://en.wikipedia.org/wiki/Template:Authority_control

Is the template itself in enwiki-20160601-pages-articles-multistream.xml.bz2? If it is then you should look at whether wikidata is used as the source for the de page or whether it's in the wikitext. If the templat eitself is missing entirely from the pages articles dump then the next step would be to see if the stub for it is present in the stubs dump.

Is the template itself in enwiki-20160601-pages-articles-multistream.xml.bz2? If it is then you should look at whether wikidata is used as the source for the de page or whether it's in the wikitext. If the templat eitself is missing entirely from the pages articles dump then the next step would be to see if the stub for it is present in the stubs dump.

Looks like the user isnt comparing apples to apples, They are comparing the HTML output to the database dump. See my comment above

The dump for the german wikipedia contains the data for the logical identical template "Normdaten". So i thought it should be the same for the "en" dump.

What @Betacommand said above...

{{Authority Control}} was depreciated so the actual information pulls from / lives on Wikidata.

If you do a SPARQL query you might have better luck.

Erika
User:BrillLyle

@BrillLyle Thanks for the patience. I just want to understand
I found that some data types in the template are deprecated, but no more.

@ArielGlenn; I have found some pages with filled Authority control templates in the dump, e.g.
<title>Clare Maclean</title>
[...]
{{Authority control|VIAF=21412343}}

If you look at the source of the page (view source) for Anarchism, you will see

{{Authority control}}

near the bottom.

If you look at the source of the page for Clare Maclean, you will see

{{Authority control|VIAF=21412343}}

near the bottom.

The contents of each page show up in exactly this fashion in the dumps. The template itself is also contained in the dumps, so I think that covers everything. I'm going to close this as invalid (that is, it's not actually a bug); if you have any more questions about the issue, feel free to respond here on the ticket.