As a developer I want the streaming updater to retry on errors that can be retried instead of failing so that pipeline can continue when it's possible.
- Streaming updater pipeline is not failing because of recoverable errors
- Calls to wikibase are retried (see current implementation of WikibaseRepository - it has a mechanism for retries already)
- If error that fails the pipeline is unrecoverable, Flink shouldn't retry ad infinitum
- Data errors ( e.g. revisions after page delete, constant 404 on new revision) should be logged to a file on HDFS in structured form.