Page MenuHomePhabricator

Top article-level topics with schema.org markup
Open, Needs TriagePublic

Description

Currently deployed schema.org metadata on a Wikipedia article such as Joey Ramone looks like this:

{
  "@context": "https://schema.org",
  "@type": "Article",
  "name": "Joey Ramone",
  "url": "https://en.wikipedia.org/wiki/Joey_Ramone",
  "sameAs": "http://www.wikidata.org/entity/Q312387",
  "mainEntity": "http://www.wikidata.org/entity/Q312387",
  "author": {
    "@type": "Organization",
    "name": "Contributors to Wikimedia projects"
  },
  "publisher": {
    "@type": "Organization",
    "name": "Wikimedia Foundation, Inc.",
    "logo": {
      "@type": "ImageObject",
      "url": "https://www.wikimedia.org/static/images/wmf-hor-googpub.png"
    }
  },
  "datePublished": "2001-06-28T21:48:04Z",
  "dateModified": "2022-09-18T15:33:56Z",
  "image": "https://upload.wikimedia.org/wikipedia/commons/1/15/Joey_Ramone.jpg",
  "headline": "American punk rock singer (1951–2001)"
}

We can enhance it with the top-N topic QIDs ranked by descending article-level relevance score (see T316925#8208516) via the keywords property as follows (top-3 example here):

"keywords": [
  "https://www.wikidata.org/wiki/Q483407",
  "https://www.wikidata.org/wiki/Q317262",
  "https://www.wikidata.org/wiki/Q2335089"
]

See also T302735: [SPIKE] Investigate effects of schema.org section metadata on Google search results