Narratives
- As a PET data engineer, I want the ability to observe the ImageMatchingAlgorithm dataset metrics, so that I can be able to ensure higher data quality for the experience of the end user.
- As a Researcher, I want the ability to view the ImageMatchingAlgorithm dataset metrics, so that I am able to train the model to improve overtime.
- As a Product Manager, I want the ability to view the ImageMatchingAlgorithm dataset metrics, so that I am able to make design decisions based on how the algorithm performs.
Acceptance Criteria
- As an PET Data Engineer, I want the ability to generate a csv file with the following metrics, so that I can have a baseline of how the pipeline performs.
- Total number of records (per wiki)
- Total number of images per page
- Per Wiki
- Summary of population statistics
- Size and counts of intermediate and final datasets
Notes
- This is the initial part of how we start to collect metrics. We will iterate so that we have designated stores for our collected metrics.