As a user of the wikidata triples database available in hive I want to have all the triples to be unique so that analysis are more accurate.
Due to how the RDF dumps are generated they may contain duplicates. After discussing with @JAllemandou we agreed to do the deduplication early when importing the data.
AC:
- all the triples are unique