Page MenuHomePhabricator

pagegenerators: follow redirects, intersection, exclusion
Open, Needs TriagePublic

Description

Originally from: http://sourceforge.net/p/pywikipediabot/patches/625/
Reported by: andreasjs
Created on: 2013-08-24 21:57:56.794000
Subject: Pagegenerator: follow redirects, intersection, exclusion
Original description:
I added three new arguments:

-followredirects  
Used with other arguments that specify a set of pages. 
                  If a specified page is a redirect page, work on its 
                  target page.

-intersecting     
Argument to be used between two other arguments. 
                  Work only on pages normally specified by both the
                  previous and the next argument.

-excluding        
Argument to be used between two other arguments. 
                  Work only on pages normally specified by the
                  previous argument but not by the next argument.

For example, one could want to find the pages edited by a specific user that contain a certain keyword in a title.

A few other suggestions:
Exclude sections, even on files.
Compare pages via the Page.\_\_cmp\_\_ property to exclude duplicate pages instead of
u"%s:%s:%s" % (page._site.family.name, page._site.lang, page._title).
(more transparent and easier to maintain).


Version: unspecified
Severity: enhancement
See Also:
https://sourceforge.net/p/pywikipediabot/patches/625

Details

Reference
bz54537

Event Timeline

bzimport raised the priority of this task from to Needs Triage.Nov 22 2014, 2:09 AM
bzimport set Reference to bz54537.
Legoktm created this task.Sep 24 2013, 10:03 PM

Created attachment 13365
Copy of attachment from http://sourceforge.net/p/pywikipediabot/patches/625/attachment/pagegenerators.py.diff

Attached:

Patch does not apply cleanly to either core or compat

jayvdb added a comment.Aug 5 2014, 1:22 AM

excluding would be very helpful to workaround bugs, where one page is causing a problem, such as bug 69133.

jayvdb added a comment.Nov 6 2014, 3:50 PM

Patch is by Andreas, if I am understanding the old sf.net history correctly.

Mpaa is creating 'intersecting' with https://gerrit.wikimedia.org/r/#/c/170832/

jayvdb updated the task description. (Show Details)Aug 15 2015, 3:55 AM
jayvdb set Security to None.
Restricted Application added a subscriber: Aklapper. · View Herald TranscriptAug 15 2015, 3:55 AM
Restricted Application added a subscriber: Cyberpower678. · View Herald TranscriptMay 18 2018, 8:38 AM
Xqt removed jayvdb as the assignee of this task.Tue, Sep 24, 3:53 PM