Project title: Improve Commons Android app using privacy-friendly edge AI
Brief summary:
AI could help make the app more user-friendly, as well as improve the quality of the metadata of the uploaded pictures.
Tasks:
- Detect meaningless captions such as DSCF1234 etc
- Detect joke/unwanted pictures
- Detect wrongly-set caption language
- Add illustrations to the warning pop-ups
- Use the caption that the user filled, to suggest depictions (classification)
- Show metadata translations when needed (for instance if a picture only has an Arabic caption and the user can only read Spanish)
- Brainstorm and find other areas where AI/LLMs can really improve the quality of pictures/metadata the user uploads to Commons, then implement them. If none, implement phases 1 and 2 of Nearby notable things from OpenStreetMap instead.
Expected outcomes: AI-enabled version available at least to a test group on Play Store. GitHub article detailing pros/cons, technical challenges and early feedback received.
Skills required/preferred: Android programming. If using vibe-coding, the ability to understand what the code is doing and critically evaluate changes is required. Experience with LLM/ML coding is not strictly necessary, but a strong willingness to experiment with these technologies is needed.
Possible mentors: @Nicolas_Raoul (past maintainer), @RitikaPahwa4444 (current maintainer)
Expected size of the project: 350 hours
Rating: Intermediate
Microtasks: Please read gsoc.md which guides you on how to choose an issue and perform a microtask.
Any other additional information for contributors: https://github.com/commons-app/apps-android-commons
Why are you proposing this project? What needs are you aiming to meet? Is it for your Wiki chapter, your community, etc? Improving both usability of the Android Commons app (for its users community) and the quality of its output (for the wikis using the pictures), which we have been maintaining for years.
What is the expected impact? What does success look like? How will this affect the needs you have identified?
20% less metadata quality issues in uploaded Wikimedia Commons files.