Page MenuHomePhabricator

SDelbecque-WMF (Stephanie Delbecque)
User

Today

  • No visible events.

Tomorrow

  • No visible events.

Monday

  • No visible events.

User Details

User Since
Apr 20 2022, 11:57 AM (198 w, 3 d)
Availability
Available
LDAP User
Unknown
MediaWiki User
SDelbecque-WMF [ Global Accounts ]

Recent Activity

Wed, Feb 4

SDelbecque-WMF updated the task description for T416344: Prod Deploy Images, Lists and Attribution Signals.
Wed, Feb 4, 10:24 AM · Wikimedia Enterprise (Sprint 85)
SDelbecque-WMF updated the task description for T416344: Prod Deploy Images, Lists and Attribution Signals.
Wed, Feb 4, 10:23 AM · Wikimedia Enterprise (Sprint 85)
SDelbecque-WMF updated the task description for T416344: Prod Deploy Images, Lists and Attribution Signals.
Wed, Feb 4, 10:22 AM · Wikimedia Enterprise (Sprint 85)

Wed, Jan 28

SDelbecque-WMF renamed T415763: Add Welsh Wikipedia to Structured Contents snapshots for NLW Hackathon [2 days] from Welsh Wikipedia Structured Contents dataset for NLW Hackathon [2 days] to Add Welsh Wikipedia to Structured Contents snapshots for NLW Hackathon [2 days].
Wed, Jan 28, 2:21 PM · Wikimedia Enterprise (Sprint 85), Wikimedia Enterprise - Machine Readability
SDelbecque-WMF updated the task description for T415763: Add Welsh Wikipedia to Structured Contents snapshots for NLW Hackathon [2 days].
Wed, Jan 28, 2:20 PM · Wikimedia Enterprise (Sprint 85), Wikimedia Enterprise - Machine Readability
SDelbecque-WMF updated the task description for T415763: Add Welsh Wikipedia to Structured Contents snapshots for NLW Hackathon [2 days].
Wed, Jan 28, 10:16 AM · Wikimedia Enterprise (Sprint 85), Wikimedia Enterprise - Machine Readability
SDelbecque-WMF created T415763: Add Welsh Wikipedia to Structured Contents snapshots for NLW Hackathon [2 days].
Wed, Jan 28, 10:12 AM · Wikimedia Enterprise (Sprint 85), Wikimedia Enterprise - Machine Readability
SDelbecque-WMF added a project to T415191: Estimate Wikidata entity accumulation: Wikimedia Enterprise - Machine Readability.
Wed, Jan 28, 9:58 AM · Wikimedia-Enterprise-Kanban-On-Call, Wikimedia Enterprise - Machine Readability
SDelbecque-WMF updated the task description for T415191: Estimate Wikidata entity accumulation.
Wed, Jan 28, 9:57 AM · Wikimedia-Enterprise-Kanban-On-Call, Wikimedia Enterprise - Machine Readability

Wed, Jan 21

SDelbecque-WMF created T415191: Estimate Wikidata entity accumulation.
Wed, Jan 21, 12:55 PM · Wikimedia-Enterprise-Kanban-On-Call, Wikimedia Enterprise - Machine Readability

Wed, Jan 14

SDelbecque-WMF updated the task description for T383331: {Machine Readability}{lists} Improve List Parsing in Structured Contents.
Wed, Jan 14, 3:13 PM · Wikimedia Enterprise (Sprint 85), Wikimedia Enterprise - Machine Readability
SDelbecque-WMF updated the task description for T414575: {Machine Readability}{Images} Parsed Images in article sections for Structured Contents.
Wed, Jan 14, 3:13 PM · Wikimedia Enterprise (Sprint 85)
SDelbecque-WMF updated the task description for T414575: {Machine Readability}{Images} Parsed Images in article sections for Structured Contents.
Wed, Jan 14, 3:12 PM · Wikimedia Enterprise (Sprint 85)
SDelbecque-WMF updated the task description for T414575: {Machine Readability}{Images} Parsed Images in article sections for Structured Contents.
Wed, Jan 14, 3:12 PM · Wikimedia Enterprise (Sprint 85)
SDelbecque-WMF updated the task description for T414575: {Machine Readability}{Images} Parsed Images in article sections for Structured Contents.
Wed, Jan 14, 3:10 PM · Wikimedia Enterprise (Sprint 85)
SDelbecque-WMF created T414575: {Machine Readability}{Images} Parsed Images in article sections for Structured Contents.
Wed, Jan 14, 1:04 PM · Wikimedia Enterprise (Sprint 85)
SDelbecque-WMF moved T383331: {Machine Readability}{lists} Improve List Parsing in Structured Contents from Machine Readability PB to To Be Estimated/To Be Discussed on the Wikimedia Enterprise board.
Wed, Jan 14, 11:18 AM · Wikimedia Enterprise (Sprint 85), Wikimedia Enterprise - Machine Readability
SDelbecque-WMF renamed T383331: {Machine Readability}{lists} Improve List Parsing in Structured Contents from {Machine Readability}{lists} Feasibility of releasing parsed lists to {Machine Readability}{lists} Improve List Parsing in Structured Contents.
Wed, Jan 14, 8:02 AM · Wikimedia Enterprise (Sprint 85), Wikimedia Enterprise - Machine Readability

Tue, Jan 13

SDelbecque-WMF closed T386276: {Machine Readability} Abstract contains incorrect data on structured data fetch as Resolved.
Tue, Jan 13, 11:48 AM · Wikimedia Enterprise - Machine Readability, Wikimedia Enterprise

Jan 7 2026

SDelbecque-WMF updated the task description for T412840: Schema feedback.
Jan 7 2026, 11:05 AM · Wikimedia Enterprise (Sprint 84)
SDelbecque-WMF added a comment to T412840: Schema feedback.

This has been split off as it's own ticket as it requires more investigation: https://phabricator.wikimedia.org/T413096

Jan 7 2026, 10:35 AM · Wikimedia Enterprise (Sprint 84)

Dec 18 2025

SDelbecque-WMF updated the task description for T412840: Schema feedback.
Dec 18 2025, 3:20 PM · Wikimedia Enterprise (Sprint 84)
SDelbecque-WMF created T413096: "date_created" shows same value as "date_modified" instead of original creation date.
Dec 18 2025, 3:18 PM · Wikimedia Enterprise (Sprint 84)
SDelbecque-WMF created T413072: Scholarly items where only "main" Wikidata items are expected.
Dec 18 2025, 11:44 AM · Wikimedia Enterprise (WME Kanban)

Dec 17 2025

SDelbecque-WMF created T412963: Roll out Structured Content snapshots for Indonesian Wikipedia for Semantic Search MVP.
Dec 17 2025, 3:16 PM · Wikimedia Enterprise (Sprint 84)

Dec 16 2025

SDelbecque-WMF created T412840: Schema feedback.
Dec 16 2025, 3:52 PM · Wikimedia Enterprise (Sprint 84)

Dec 11 2025

SDelbecque-WMF updated the task description for T412110: Schema documentation for the Release Readiness Marketing Checklist.
Dec 11 2025, 2:41 PM · Wikimedia Enterprise (WME Kanban)

Dec 9 2025

SDelbecque-WMF updated the task description for T412116: Change wikidata endpoint naming.
Dec 9 2025, 3:16 PM · Wikimedia Enterprise (WME Kanban)
SDelbecque-WMF created T412116: Change wikidata endpoint naming.
Dec 9 2025, 2:08 PM · Wikimedia Enterprise (WME Kanban)

Dec 8 2025

SDelbecque-WMF created T412021: Fix Structured Contents headings.
Dec 8 2025, 2:02 PM · Wikimedia Enterprise (Sprint 84), Wikimedia Enterprise - Machine Readability

Oct 16 2025

SDelbecque-WMF renamed T407386: Create EN Structured Contents Sample Set (with and without tables) for Reuser Feedback from Create Structured Contents Sample Set (with and without tables) for Reuser Feedback to Create EN Structured Contents Sample Set (with and without tables) for Reuser Feedback.
Oct 16 2025, 1:33 PM · Wikimedia Enterprise, Wikimedia Enterprise - Machine Readability

Oct 15 2025

SDelbecque-WMF created T407386: Create EN Structured Contents Sample Set (with and without tables) for Reuser Feedback.
Oct 15 2025, 3:23 PM · Wikimedia Enterprise, Wikimedia Enterprise - Machine Readability

Oct 14 2025

SDelbecque-WMF updated the task description for T407255: Payload example of a regular item for wikidata.
Oct 14 2025, 3:51 PM · Wikimedia Enterprise - Machine Readability, Wikimedia Enterprise (Sprint 83)
SDelbecque-WMF created T407255: Payload example of a regular item for wikidata.
Oct 14 2025, 3:42 PM · Wikimedia Enterprise - Machine Readability, Wikimedia Enterprise (Sprint 83)

Sep 25 2025

SDelbecque-WMF updated the task description for T401936: Assess impact of sub-references on our Parsed references: .
Sep 25 2025, 2:48 PM · Wikimedia Enterprise (Sprint 82)

Aug 7 2025

SDelbecque-WMF added a parent task for T396588: Prod deploy Tables Parsing: T391361: [APP: Objective 3 Term 3.2 - WME OKR TBD - Q1 FY26/26] - Parsed tables are deployed in beta on the structured contents endpoints.
Aug 7 2025, 2:57 PM · Wikimedia Enterprise (Sprint 81), Wikimedia Enterprise - Machine Readability
SDelbecque-WMF added a subtask for T391361: [APP: Objective 3 Term 3.2 - WME OKR TBD - Q1 FY26/26] - Parsed tables are deployed in beta on the structured contents endpoints: T396588: Prod deploy Tables Parsing.
Aug 7 2025, 2:57 PM · Wikimedia Enterprise - Machine Readability, Wikimedia Enterprise

Aug 4 2025

SDelbecque-WMF created T401119: Structured Contents snapshots size increase with tables.
Aug 4 2025, 2:16 PM · Wikimedia-Enterprise-Kanban-On-Call

Jun 18 2025

SDelbecque-WMF updated the task description for T396604: Tables On-demand Automated QA.
Jun 18 2025, 1:45 PM · Wikimedia Enterprise (Sprint 78), Wikimedia Enterprise - Machine Readability
SDelbecque-WMF created T397285: Access to references codebase.
Jun 18 2025, 8:05 AM · Wikimedia-Enterprise-Kanban-On-Call

May 7 2025

SDelbecque-WMF added a comment to T383834: Migrate Wikipedia Datasets to Kaggle.

Available here: https://www.kaggle.com/datasets/wikimedia-foundation/wikipedia-structured-contents!

May 7 2025, 9:07 AM · Wikimedia Enterprise
SDelbecque-WMF closed T383834: Migrate Wikipedia Datasets to Kaggle, a subtask of T387578: P1- Set up initial Wikimedia datasets on Kaggle, as Resolved.
May 7 2025, 9:06 AM · Wikimedia Enterprise (Sprint 75)
SDelbecque-WMF closed T383834: Migrate Wikipedia Datasets to Kaggle as Resolved.
May 7 2025, 9:06 AM · Wikimedia Enterprise

May 6 2025

SDelbecque-WMF created T393452: Cached vandalized Wikidata description persists on PCS Summary endpoint.
May 6 2025, 11:51 AM · Content-Transform-Team (Work In Progress)

Apr 8 2025

SDelbecque-WMF updated the task description for T391250: Bug Report: Poetry "abstract" includes wrong sentence in Structured Contents.
Apr 8 2025, 11:42 AM · Wikimedia-Enterprise-Kanban-On-Call, Wikimedia Enterprise - Machine Readability

Apr 7 2025

SDelbecque-WMF updated the task description for T391250: Bug Report: Poetry "abstract" includes wrong sentence in Structured Contents.
Apr 7 2025, 2:09 PM · Wikimedia-Enterprise-Kanban-On-Call, Wikimedia Enterprise - Machine Readability
SDelbecque-WMF updated the task description for T391250: Bug Report: Poetry "abstract" includes wrong sentence in Structured Contents.
Apr 7 2025, 1:25 PM · Wikimedia-Enterprise-Kanban-On-Call, Wikimedia Enterprise - Machine Readability
SDelbecque-WMF created T391250: Bug Report: Poetry "abstract" includes wrong sentence in Structured Contents.
Apr 7 2025, 12:02 PM · Wikimedia-Enterprise-Kanban-On-Call, Wikimedia Enterprise - Machine Readability

Apr 1 2025

SDelbecque-WMF triaged T389925: Bug: structured-contents description incorrect in on-demand API if it contains double quotes as High priority.
Apr 1 2025, 8:52 AM · Wikimedia Enterprise (Sprint 76), Wikimedia Enterprise - Machine Readability

Mar 31 2025

SDelbecque-WMF moved T389879: Provide more detail on text position in links from Incoming to Machine Readability PB on the Wikimedia Enterprise board.
Mar 31 2025, 11:53 AM · Wikimedia Enterprise - Machine Readability, Wikimedia Enterprise
SDelbecque-WMF added a project to T389879: Provide more detail on text position in links: Wikimedia Enterprise - Machine Readability.
Mar 31 2025, 11:53 AM · Wikimedia Enterprise - Machine Readability, Wikimedia Enterprise
SDelbecque-WMF moved T390533: Add Dutch to Structured Contents Snapshot beta endpoint from Incoming to To Be Estimated/To Be Discussed on the Wikimedia Enterprise board.
Mar 31 2025, 11:53 AM · Wikimedia Enterprise (Sprint 75), Wikimedia Enterprise - Machine Readability
SDelbecque-WMF created T390533: Add Dutch to Structured Contents Snapshot beta endpoint .
Mar 31 2025, 11:51 AM · Wikimedia Enterprise (Sprint 75), Wikimedia Enterprise - Machine Readability

Mar 13 2025

SDelbecque-WMF moved T388790: {Tables} [Investigation]: Evaluate table parser code [10 days] from Incoming to To Be Estimated/To Be Discussed on the Wikimedia Enterprise board.
Mar 13 2025, 1:37 PM · Wikimedia Enterprise (Sprint 75), Wikimedia Enterprise - Machine Readability
SDelbecque-WMF created T388790: {Tables} [Investigation]: Evaluate table parser code [10 days].
Mar 13 2025, 1:36 PM · Wikimedia Enterprise (Sprint 75), Wikimedia Enterprise - Machine Readability
SDelbecque-WMF moved T388788: Add Structured Contents schema to json-schema from Machine Readability PB to To Be Estimated/To Be Discussed on the Wikimedia Enterprise board.
Mar 13 2025, 1:29 PM · Wikimedia Enterprise - Machine Readability, Wikimedia Enterprise
SDelbecque-WMF created T388788: Add Structured Contents schema to json-schema.
Mar 13 2025, 1:19 PM · Wikimedia Enterprise - Machine Readability, Wikimedia Enterprise
SDelbecque-WMF updated the task description for T388779: Update Structured Contents Usecase demo .
Mar 13 2025, 12:53 PM · Wikimedia Enterprise (Sprint 75), Wikimedia Enterprise - Machine Readability

Feb 28 2025

SDelbecque-WMF updated the task description for T384448: {Parsed References} Deploy SC to prod and Run Re-ingestion.
Feb 28 2025, 10:30 AM · Wikimedia Enterprise (sprint 73), Wikimedia Enterprise - Machine Readability
SDelbecque-WMF updated the task description for T384448: {Parsed References} Deploy SC to prod and Run Re-ingestion.
Feb 28 2025, 10:30 AM · Wikimedia Enterprise (sprint 73), Wikimedia Enterprise - Machine Readability

Feb 26 2025

SDelbecque-WMF updated the task description for T384448: {Parsed References} Deploy SC to prod and Run Re-ingestion.
Feb 26 2025, 1:22 PM · Wikimedia Enterprise (sprint 73), Wikimedia Enterprise - Machine Readability
SDelbecque-WMF updated subscribers of T384448: {Parsed References} Deploy SC to prod and Run Re-ingestion.

@JArguello-WMF updated the description to reflect the dark launch date

Feb 26 2025, 12:25 PM · Wikimedia Enterprise (sprint 73), Wikimedia Enterprise - Machine Readability
SDelbecque-WMF updated the task description for T384448: {Parsed References} Deploy SC to prod and Run Re-ingestion.
Feb 26 2025, 12:25 PM · Wikimedia Enterprise (sprint 73), Wikimedia Enterprise - Machine Readability

Feb 20 2025

SDelbecque-WMF updated the task description for T371015: {Machine Readability} {tables} Selection of tables sample set.
Feb 20 2025, 1:33 PM · Wikimedia Enterprise (sprint 73), Wikimedia Enterprise - Machine Readability

Feb 13 2025

SDelbecque-WMF added a comment to T386276: {Machine Readability} Abstract contains incorrect data on structured data fetch.

side note: summary endpoint also includes some special characters and styling: https://en.wikipedia.org/api/rest_v1/page/summary/List_of_The_Elusive_Samurai_chapters

Feb 13 2025, 9:18 AM · Wikimedia Enterprise - Machine Readability, Wikimedia Enterprise

Feb 12 2025

SDelbecque-WMF created T386197: {Machine Readability} [Investigation] Replace SummaryAPI call for description [2 days].
Feb 12 2025, 1:36 PM · Wikimedia Enterprise (sprint 73)

Jan 29 2025

SDelbecque-WMF added a comment to T365369: Investigation: understand internet archive dump of wiki references.

Link to tool in description doesn't work for me anymore, but this one does: https://internetarchive.github.io/iare/?url=https://en.wikipedia.org/wiki/Lionel_Messi&method=IABOT

Jan 29 2025, 10:09 AM · Wikimedia Enterprise - Machine Readability, Internet-Archive, Wikimedia Enterprise - Content Integrity, Wikimedia Enterprise
SDelbecque-WMF added a comment to T365369: Investigation: understand internet archive dump of wiki references.

and https://github.com/internetarchive/wiki-references-db, https://github.com/internetarchive/wiki-references-extractor

Jan 29 2025, 9:32 AM · Wikimedia Enterprise - Machine Readability, Internet-Archive, Wikimedia Enterprise - Content Integrity, Wikimedia Enterprise

Jan 22 2025

SDelbecque-WMF added a comment to T365369: Investigation: understand internet archive dump of wiki references.

code: https://github.com/internetarchive/iari

Jan 22 2025, 1:56 PM · Wikimedia Enterprise - Machine Readability, Internet-Archive, Wikimedia Enterprise - Content Integrity, Wikimedia Enterprise
SDelbecque-WMF renamed T384447: {Parsed References} Structured Contents Reference Parsing QA from Structured Contents Parsing QA to Structured Contents Reference Parsing QA.
Jan 22 2025, 1:51 PM · Wikimedia Enterprise (Sprint 72), Wikimedia Enterprise - Machine Readability

Jan 15 2025

SDelbecque-WMF updated the task description for T383680: {Parsed references} Help craft release notes for references producs.
Jan 15 2025, 12:42 PM · Wikimedia Enterprise (sprint 73), Wikimedia Enterprise - Machine Readability, Wikimedia Enterprise - Content Integrity

Jan 9 2025

SDelbecque-WMF moved T383331: {Machine Readability}{lists} Improve List Parsing in Structured Contents from Machine Readability PB to To Be Estimated/To Be Discussed on the Wikimedia Enterprise board.
Jan 9 2025, 2:14 PM · Wikimedia Enterprise (Sprint 85), Wikimedia Enterprise - Machine Readability
SDelbecque-WMF added a project to T383331: {Machine Readability}{lists} Improve List Parsing in Structured Contents: Wikimedia Enterprise - Machine Readability.
Jan 9 2025, 2:13 PM · Wikimedia Enterprise (Sprint 85), Wikimedia Enterprise - Machine Readability
SDelbecque-WMF created T383331: {Machine Readability}{lists} Improve List Parsing in Structured Contents.
Jan 9 2025, 2:13 PM · Wikimedia Enterprise (Sprint 85), Wikimedia Enterprise - Machine Readability

Dec 5 2024

SDelbecque-WMF added a comment to T381439: Investigation: Should we we grab data from xtools? [timebox 6 days].

More info on API: https://xtools.wmcloud.org/api and https://www.mediawiki.org/wiki/XTools/API

Dec 5 2024, 3:52 PM · Wikimedia Enterprise (Sprint 76), Wikimedia Enterprise - Content Integrity

Oct 30 2024

SDelbecque-WMF updated the task description for T376375: Create a unified list of wikitext citation templates across 6 project/languages.
Oct 30 2024, 3:00 PM · Wikimedia Enterprise - Machine Readability, Wikimedia Enterprise
SDelbecque-WMF moved T376375: Create a unified list of wikitext citation templates across 6 project/languages from Incoming to To Be Estimated/To Be Discussed on the Wikimedia Enterprise board.
Oct 30 2024, 11:29 AM · Wikimedia Enterprise - Machine Readability, Wikimedia Enterprise

Oct 23 2024

SDelbecque-WMF added a comment to T377188: Reference items' order is different in Parsoid and legacy HTML.

Ok, thanks a lot for the quick help and clarification, @cscott !

Oct 23 2024, 4:40 PM · Parsoid, Wikimedia Enterprise
SDelbecque-WMF added a comment to T377192: Parsoid title has underscore between words.

great, thanks!

Oct 23 2024, 10:05 AM · Content-Transform-Team-WIP, Essential-Work, Parsoid, Wikimedia Enterprise
SDelbecque-WMF added a comment to T375462: {HuggingFace} Check if the schema is up to date (schema.yaml).

More feedback:

Oct 23 2024, 9:03 AM · Wikimedia Enterprise (Sprint 69), Wikimedia Enterprise - Machine Readability
SDelbecque-WMF updated the task description for T375462: {HuggingFace} Check if the schema is up to date (schema.yaml).
Oct 23 2024, 8:57 AM · Wikimedia Enterprise (Sprint 69), Wikimedia Enterprise - Machine Readability

Oct 22 2024

SDelbecque-WMF added a comment to T377189: Parsoid title first char is upper case, while legacy HTML title first char is lower case.

Thanks @cscott!

Oct 22 2024, 2:26 PM · Documentation, Content-Transform-Team-WIP, Parsoid, Essential-Work, Wikimedia Enterprise
SDelbecque-WMF added a comment to T377192: Parsoid title has underscore between words.

Thanks @ABreault-WMF and @cscott! The expected behaviour is spaces?

Oct 22 2024, 2:12 PM · Content-Transform-Team-WIP, Essential-Work, Parsoid, Wikimedia Enterprise
SDelbecque-WMF added a comment to T377188: Reference items' order is different in Parsoid and legacy HTML.

Thanks @cscott, is the different order due to the same issue as the missing reference? For this particular page, the missing reference was reference nr 9, but the order is already different from nr 3, and the inline citations are now pointing to different references in legacy html vs parsoid for the same statement (see screenshot).

Oct 22 2024, 1:35 PM · Parsoid, Wikimedia Enterprise

Oct 15 2024

SDelbecque-WMF created T377194: tag inside Parsoid <title> becomes text: e.g: <i>..
Oct 15 2024, 10:13 AM · Wikimedia Enterprise, Content-Transform-Team
SDelbecque-WMF created T377192: Parsoid title has underscore between words.
Oct 15 2024, 10:08 AM · Content-Transform-Team-WIP, Essential-Work, Parsoid, Wikimedia Enterprise
SDelbecque-WMF created T377189: Parsoid title first char is upper case, while legacy HTML title first char is lower case.
Oct 15 2024, 10:05 AM · Documentation, Content-Transform-Team-WIP, Parsoid, Essential-Work, Wikimedia Enterprise
SDelbecque-WMF created T377188: Reference items' order is different in Parsoid and legacy HTML.
Oct 15 2024, 10:02 AM · Parsoid, Wikimedia Enterprise

Oct 7 2024

SDelbecque-WMF moved T376372: {References} [Investigation] Wikitext vs HTML Parsoid for references [3 weeks-end of sprint 68] from Machine Readability PB to To Be Estimated/To Be Discussed on the Wikimedia Enterprise board.
Oct 7 2024, 11:32 AM · Wikimedia Enterprise (Sprint 68), Wikimedia Enterprise - Machine Readability

Oct 3 2024

SDelbecque-WMF updated the task description for T372645: empty "values" defect in Infobox in Structured Contents endpoints (On-demand & Snapshot).
Oct 3 2024, 1:45 PM · Wikimedia Enterprise (Sprint 67), Wikimedia Enterprise - Machine Readability
SDelbecque-WMF updated the task description for T372645: empty "values" defect in Infobox in Structured Contents endpoints (On-demand & Snapshot).
Oct 3 2024, 1:42 PM · Wikimedia Enterprise (Sprint 67), Wikimedia Enterprise - Machine Readability
SDelbecque-WMF renamed T372645: empty "values" defect in Infobox in Structured Contents endpoints (On-demand & Snapshot) from empty "values" defect in Infobox in Structured Contents On-demand endpoint to empty "values" defect in Infobox in Structured Contents endpoints (On-demand & Snapshot).
Oct 3 2024, 1:22 PM · Wikimedia Enterprise (Sprint 67), Wikimedia Enterprise - Machine Readability

Sep 26 2024

SDelbecque-WMF renamed T375462: {HuggingFace} Check if the schema is up to date (schema.yaml) from Expand Hugging Face dataset card with metadata to Update schema.yaml.
Sep 26 2024, 1:44 PM · Wikimedia Enterprise (Sprint 69), Wikimedia Enterprise - Machine Readability
SDelbecque-WMF closed T373429: Prepare sample datasets from Structured Contents Snapshot endpoint as Resolved.
Sep 26 2024, 1:35 PM · Wikimedia Enterprise - Machine Readability, Wikimedia Enterprise

Sep 25 2024

SDelbecque-WMF added a comment to T375462: {HuggingFace} Check if the schema is up to date (schema.yaml).

and
sections.has_parts.has_parts.has_parts.has_parts.name
sections.has_parts.has_parts.has_parts.has_parts.has_parts.links.images
sections.has_parts.has_parts.has_parts.has_parts.has_parts.has_parts
sections.has_parts.has_parts.has_parts.has_parts.has_parts.has_parts.links

Sep 25 2024, 8:46 AM · Wikimedia Enterprise (Sprint 69), Wikimedia Enterprise - Machine Readability
SDelbecque-WMF added a comment to T375462: {HuggingFace} Check if the schema is up to date (schema.yaml).

Albert also added missing field infoboxes.has_parts.images

Sep 25 2024, 6:34 AM · Wikimedia Enterprise (Sprint 69), Wikimedia Enterprise - Machine Readability

Sep 24 2024

SDelbecque-WMF added a comment to T375462: {HuggingFace} Check if the schema is up to date (schema.yaml).

More feedback:

Sep 24 2024, 9:17 AM · Wikimedia Enterprise (Sprint 69), Wikimedia Enterprise - Machine Readability
SDelbecque-WMF updated the task description for T375462: {HuggingFace} Check if the schema is up to date (schema.yaml).
Sep 24 2024, 9:07 AM · Wikimedia Enterprise (Sprint 69), Wikimedia Enterprise - Machine Readability
SDelbecque-WMF added a comment to T375462: {HuggingFace} Check if the schema is up to date (schema.yaml).

First changes are done.
Feedback from Albert:
After implementing the schema, I am finding some additional issues because the data is not aligned with the schema. For example:

  • I had to fix the names false and true to 'false' and 'true' (with quotes)
  • I had to rename the field no_index to noindex (without underscore)
  • I had to add the missing field event.date_published

Also I discovered the root field in_language is duplicated.
And now I am also facing a new missing field called "images": I am investigating in which super-field.
So I was wondering if you have a newer version of the complete schema. Otherwise, I should continue trying locally until I get the complete schema.

Sep 24 2024, 9:05 AM · Wikimedia Enterprise (Sprint 69), Wikimedia Enterprise - Machine Readability
SDelbecque-WMF updated the task description for T375462: {HuggingFace} Check if the schema is up to date (schema.yaml).
Sep 24 2024, 9:04 AM · Wikimedia Enterprise (Sprint 69), Wikimedia Enterprise - Machine Readability