Page MenuHomePhabricator

Implement extracts in Pywikibot
Closed, ResolvedPublicFeature

Description

Mediawiki has the extracts api function. It should be implemented in Pywikibot too.

This module requires read rights
Parameters:

exchars             - How many characters to return, actual text returned might be slightly longer.
                      The value must be no less than 1
exsentences         - How many sentences to return
                      The value must be between 1 and 10
exlimit             - How many extracts to return
                      No more than 20 (20 for bots) allowed
                      Default: 1
exintro             - Return only content before the first section
explaintext         - Return extracts as plaintext instead of limited HTML
exsectionformat     - How to format sections in plaintext mode:
                       plain - No formatting
                       wiki - Wikitext-style formatting == like this ==
                       raw - This module's internal representation (section titles prefixed with <ASCII 1><ASCII 2><section level><ASCII 2><ASCII 1>
                      One value: plain, wiki, raw
                      Default: wiki
excontinue          - When more results are available, use this to continue
exvariant           - Convert content into this language variant`

Example:

Get a 175-character extract:
  api.php?action=query&prop=extracts&exchars=175&titles=Therion

https://nl.wikipedia.org/w/api.php?action=query&prop=extracts&exchars=175&titles=Nicolaas_IJzendoorn&format=json


Version: core-(2.0)
Severity: enhancement
See Also:
T56569: Retrieve / edit the section

Details

Reference
bz70682

Event Timeline

bzimport raised the priority of this task from to Needs Triage.Nov 22 2014, 3:50 AM
bzimport set Reference to bz70682.
bzimport added a subscriber: Unknown Object (????).

How do you intend to use this?

I'm already using it to extract date of birth and date of death. Extracts already gets rid of the infobox template or image so I don't have to do that myself.

Why not extract those dates from the infobox?

A lot of articles don't have an infobox with this information.

Xqt triaged this task as Low priority.Jan 13 2019, 1:30 PM
Aklapper changed the subtype of this task from "Task" to "Feature Request".Feb 4 2022, 11:12 AM

Change 770098 had a related patch set uploaded (by Xqt; author: Xqt):

[pywikibot/core@master] [IMPR] Add TextExtracts support

https://gerrit.wikimedia.org/r/770098

Change 770098 merged by Xqt:

[pywikibot/core@master] [IMPR] Add TextExtracts support

https://gerrit.wikimedia.org/r/770098