Page MenuHomePhabricator

Consider alternative "url" parameters
Closed, DeclinedPublic

Description

Our reference extraction script keeps (presumably citation) templates that meet the following criteria:

  1. Their name is listed in our list of citation templates, and
  2. The citation template has a value in the "url" field.

Otherwise, template is simply ignored.

However, some templates have aliases for the "url" parameter. For example, the French template "Lien web" has equivalent "url texte" and "lire en ligne" parameters.

Therefore, instead of looking for a URL in the "url" parameter alone to decide whether to keep or ignore a template, consider using the parameters declared in a "url" column in our list of citation templates.

Event Timeline

@Nidiah: Hi! This task has been assigned to you a while ago. Could you maybe share an update? Do you still plan to work on this task, or do you need any help? Thanks!

Hi @Aklapper, sorry for the delayed response, I was out of town for a couple of weeks. Thank you for your reminder and for the link!

Since we already have a good number of URLs (~461k), I decided to focus on moving forward with the processing. But I will certainly get back to this task when I have the time!

Our script is using this column from the list of citation templates already.

Pending confirmation that the parameters in this column are OK (no unrelated parameters) @Gimenadelrioriande.