Change Details

In this task, we want to exercise simple parsing of a Wikipedia article and classifying some of its sentences. Please write a program or script in your preferred language that: 1- Receives as input the title of a English Wikipedia article. 2- Retrieves the text of that article from the MediaWiki API. If using Python, consider using [python-mwapi](https://github.com/mediawiki-utilities/python-mwapi) for this. 3- Identifies individual sentences within that text, along with the corresponding section titles. If using Python, [mwparserfromhell](https://github.com/earwig/mwparserfromhell/) can help you work with wiki markup. 4- Runs those sentences through the model to classify them. 5- Outputs the sentences, one per line, sorted by score given by the model. This is similar to the run_citation_need_model.py script in the model repository, but that one loads its input statements from an already structured file, and you have to extract that informations directly from a Wikipedia article. Please create a GitHub (or similar, like BitBucket) repository with your code and send us a link to it in a comment on this Phabricator entry. **Deadline**: This task has no deadline of its own, other than the [November 5th deadline for contributions in Outreachy](https://www.outreachy.org/apply/project-selection/). The sooner the better though, as we would like to look at your code, maybe file an issue and/or discuss design decisions before the actual deadline. Feel free to ping @Miriam or myself if you have questions.