The current RDF Serializer and Generator use EaseRDF for RDF output. This requires us to build an in-memory model of the RDF first. This is highly inefficient. A streaming RDF output interface would be much preferrable.
Description
Description
Status | Subtype | Assigned | Task | ||
---|---|---|---|---|---|
Open | None | T88728 Improve Wikimedia dumping infrastructure | |||
Open | None | T88991 improve Wikidata dumps [tracking] | |||
Open | None | T46581 Partial dumps | |||
Resolved | Smalyshev | T46580 Script for creating RDF dumps of all entities | |||
Duplicate | None | T211495 Dump(s) of Wikidata classes | |||
Duplicate | None | T211497 Dump(s) of Wikidata instances of Q5 | |||
Open | None | T162351 Create a "page prop" RDF dump for Wikidata entities ("pagePropertiesRdf") | |||
Open | None | T98320 [Task] Create dump of entity redirects (JSON or n-triples) | |||
Open | None | T285307 Create randomly split partial entity dumps | |||
Open | None | T44063 [Epic] Provide a plain linked data interface for accessing entities | |||
Resolved | hoo | T101837 [Story] switch default rdf format to full (include statements) | |||
Open | None | T50143 Implement complete RDF mapping for entities (tracking) | |||
Resolved | daniel | T92523 Implement fast sequential RDF output generation |
Event Timeline
This comment was removed by daniel.
Comment Actions
Change 195185 had a related patch set uploaded (by JanZerebecki):
Introduce fast RDF writer