Page MenuHomePhabricator

Add Support for Parsing Wiki Tables in ParseWiki
Open, In Progress, MediumPublic

Description

Currently, ParseWiki does not support parsing MediaWiki tables ({| ... |}). This task aims to extend the parser to support table syntax and include tables in the generated parse tree.

Requirements:

  • Detect and parse table syntax:
    • Table start {| and end |}
    • Row separator |-
    • Header cells !
    • Regular cells |
    • Attributes and classes (e.g., class="wikitable")
  • Add unit tests for various table cases (simple, nested, with attributes, etc.).
  • Ensure the code follows the ParseWiki project’s current structure and style.

Resources:

Impact:
Supporting tables will significantly improve the accuracy and completeness of ParseWiki output, especially for use cases involving data extraction or wiki content analysis.

Event Timeline

Gerges triaged this task as Medium priority.Jul 15 2025, 12:32 PM

Thank you for tagging this task with good first task for Wikimedia newcomers!

Newcomers often may not be aware of things that may seem obvious to seasoned contributors, so please take a moment to reflect on how this task might look to somebody who has never contributed to Wikimedia projects.

A good first task is a self-contained, non-controversial task with a clear approach. It should be well-described with pointers to help a completely new contributor, for example it should clearly point to the codebase URL and provide clear steps to help a contributor get set up for success. We've included some guidelines at https://phabricator.wikimedia.org/tag/good_first_task/ !

Thank you for helping us drive new contributions to our projects <3

Tejashxv removed a project: RoadToWiki.
Tejashxv subscribed.
sanjaisid renamed this task from Add Support for Parsing Wiki Tables in ParseWiki to Add Support for Parsing Wiki Tables in ParseWiki.Oct 5 2025, 12:32 PM
sanjaisid claimed this task.

Change #1194683 had a related patch set uploaded (by Sanjai Siddharthan; author: Sanjai Siddharthan):

[mediawiki/tools/ParseWiki@main] Add functionality to parse wiki tables in ParseWiki

https://gerrit.wikimedia.org/r/1194683

sanjaisid changed the task status from Open to In Progress.Oct 10 2025, 7:35 PM

@Gerges I have submitted a patch 10 days ago, please review this when you are free

@Gerges I have fixed the pipeline issues and the main build succeeded.

Hi @sanjaisid, Thanks for your contribution, Sorry for the delay, I've been busy with several projects outside of Wikimedia. I'll try my best to review your patch.