Description
Description
Related Objects
Related Objects
Event Timeline
Comment Actions
Running a preliminary proof of concept on a dataset of about 500,000 entities, reveals that <~0.02 % are actually being excluded by this condition (471524 out of 471591 are not instances of scholarly articles or astronomical objects). Should we consider checking for instances of transitive subclasses as well (direct subclasses of Q13442814 or Q6999 as well as subclasses of those subclasses). The following SPARQL Query reveals that there are 1577 of those: https://w.wiki/F2fy.
Comment Actions
I ran some presto queries on the full dump in the analytics cluster, here are the actual entity counts as of 2025-07-28:
| count | % of total | |
|---|---|---|
| Total | 116955797 | 100.00% |
| Humans | 12504270 | 10.69% |
| Subclasses of Humans | 183 | 0.00% |
| Astronomical Objects | 25892 | 0.02% |
| Subclasses of Astronomical Objects | 8387361 | 7.17% |
| Scholarly Articles | 45193358 | 38.64% |
| Subclasses of Scholarly Articles | 362353 | 0.31% |