Maniphest T106765

migrate textlib parser functionality to mwparserfromhell
Open, MediumPublic
Actions

Assigned To

None

Authored By

	jayvdb
	Jul 23 2015, 11:57 PM

Tags

Referenced Files

None

Subscribers

pywikibot-bugs-list

Tokens

"Like" token, awarded by Ricordisamoa.

Description

textlib contains a lot of parsing functionality, built using regex. This usually works well, however it becomes complicated / error prone especially with nested templates.

However, as mwparserfromhell tokenizer doesn't expose start & end of each token, the textlib algorithms will need to be heavily revised, or mwparserfromhell needs improvements.

Related Objects
Search...

		Status	Subtype	Assigned	Task
		Open		None	T229723 Insufficient wikitext regex parser functions in textlib (tracking)
		Resolved		Xqt	T110529 template.py does not recognize nested templates
		Open		None	T106765 migrate textlib parser functionality to mwparserfromhell
		Resolved		Xqt	T106763 Mandatory dependency on mwparserfromhell
		Resolved		Ladsgroup	T88069 Add pure python mwparserfromhell to nightlies
		Resolved		Xqt	T71384 extract_templates_and_params parser bugs loading w:en:Main_Page with mwparserfromhell
		Open		None	T279005 Use textlib.extract_templates_and_param or Page.templatesWithParams in commons_information.py

Event Timeline

jayvdb created this task.Jul 23 2015, 11:57 PM

jayvdb raised the priority of this task from to Medium.

jayvdb updated the task description. (Show Details)

jayvdb added projects: Pywikibot, Pywikibot-textlib.py.

jayvdb added a subtask: T106763: Mandatory dependency on mwparserfromhell.

jayvdb added subscribers: Xqt, Earwig, Ricordisamoa and 3 others.

Also reflinks.py

Dalba added a parent task: T110529: template.py does not recognize nested templates.Apr 11 2016, 11:51 PM

Dalba subscribed.

Dvorapa added a subtask: T71384: extract_templates_and_params parser bugs loading w:en:Main_Page with mwparserfromhell.May 14 2019, 2:44 PM

Xqt added a parent task: T229723: Insufficient wikitext regex parser functions in textlib (tracking).Aug 3 2019, 9:57 AM

Xqt closed subtask T71384: extract_templates_and_params parser bugs loading w:en:Main_Page with mwparserfromhell as Resolved.Mar 30 2021, 8:39 AM

Xqt added a subtask: T279005: Use textlib.extract_templates_and_param or Page.templatesWithParams in commons_information.py.Apr 1 2021, 6:25 AM

JJMC89 closed subtask T106763: Mandatory dependency on mwparserfromhell as Resolved.Apr 13 2021, 4:05 PM

Izno removed a parent task: T229723: Insufficient wikitext regex parser functions in textlib (tracking).Oct 30 2021, 6:48 PM