**Goal**
- Surface metric numbers from scraper data
- Think about how we support self-serve for accessing scraper data
**Steps**
- Retrieve the data from the results
- Document instructions on how the data can be fund/extracted
**Metrics**:
Should be retrievable from current scraper results
[] # of duplicate (identical) refs in a given wiki
[] # of articles with at least one identical ref
[] # of articles with more than 25 refs and have at least one identical reference,
[] proportion of duplicate refs in articles with >25 refs vs. proportion of duplicates in articles <25 refs, split by wiki.
- Assumption: longer reference lists have more duplicates because hard to find and manage
[] # of articles without references
[] ratio of reference to paragraph per wiki **( TBD: Can we even do that without a code change to the scraper and a re-run? )**