Page MenuHomePhabricator

Implement processing of large dataset
Closed, ResolvedPublic

Description

As we don't have access to the entire authorities dataset yet, the current implementation can only process single posts (from local file or downloaded via URI) for test purposes.

According to the information I received, the dump will be formatted as one (json-LD) file per post, so the script will have to be adjusted to that.

Event Timeline

We have received the complete dataset today, so we can now start working on this.