Week: Dec 7 - Dec 13
Task: Identify the best strategy/APIs to find Commons categories that are within a certain radius of the specified GPS coordinates
Deliverable: Wiki memo: cURL requests that provide the right categories (30% false positives OK) for all possible use cases and edge cases
I will be using https://github.com/nicolas-raoul/apps-android-commons/wiki/Location-based-category-search to document the results of testing the categories obtained via different APIs/strategies against the benchmark of categories that have been manually entered (by the Commons community or by myself) for each picture.
Pictures are found by:
- Visiting https://commons.wikimedia.org/wiki/Special:Random/File
- Eliminating files that are not photos or could not possibly be obtained via a smartphone
- File must have location data available
For each picture, I aim to perform a comparison for instance:
- Manually: x0 good categories
- WikiData API: x1 good categories, y1 false positives
- Commons API: x2 good category, y2 false positive
- "Existing pics at that location" strategy: x3 good category, y3 false positives
WikiData API: I am running queries via TABernacle for instance
claim[373] AND around[625,49.27066666666666,14.073769444444444,0.1]
Property 373 signifies the Commons category. I start with radius 0.1km and increase the number if no categories are found.
Method C: Search for existing pics at that location" strategy
This is described in more detail at https://etherpad.wikimedia.org/p/commons-app-android-nearby-categories
Method D - Same as Method C, except we increase radius until at least 5 unique categories are found. Results on GitHub wiki.
Conclusion: We will go with Method D