Page MenuHomePhabricator

Relax author and year match?
Open, LowPublic

Description

After T228666 we're missing some useful matches, for instance

{{cite journal | vauthors = Angeloni D, Lee JD, Johnson BE, Teh BT, Dean M, Lerman MI, Sterneck E | title = C306A single nucleotide polymorphism in the human CEBPD gene that maps at 8p11.1-p11.2 | journal = Molecular and Cellular Probes | volume = 15 | issue = 6 | pages = 395\u20137 | date = Dec 2001 | pmid = 11851384 | doi = 10.1006/mcpr.2001.0377 }}

does not match https://dissem.in/p/26445613/ aka https://www.iris.sssup.it/handle/11382/3170 because that record has a diffent year (January 2002 instead of December 2001, coming from ResearchGate) and because the template does not have the surname of the first author in a structured way.

It's not a tragedy but we might consider to relax the criteria a bit, e.g. by allowing a difference of one year (to account for delays between deposits/preprints and the official publication date) or by comparing the author names in a different way (difflib to compare arrays? check the presence of N surnames in the on-wiki citation?).