IMPORTANT: Make sure to read the [Outreachy participant instructions](https://www.mediawiki.org/wiki/Outreachy/Participants) and [communication guidelines](https://www.mediawiki.org/wiki/New_Developers/Communication_tips) thoroughly before commenting on this task. This space is for project-specific questions, so avoid asking questions about getting started, setting up Gerrit, etc. When in doubt, ask your question on [Zulip](https://wikimedia.zulipchat.com/#narrow/stream/365030-gsoc23-outreachy26/topic/welcome) first!
===Brief summary
This mentorship is a component of an active [[ https://meta.wikimedia.org/wiki/Research:Content_Translation_language_imbalances | research project ]] about translation imbalances.
When we compare the number of translations made between pairs of languages, we find very high ratios of articles being translated from languages with a larger wiki presence into languages with a smaller presence. English alone is the source language for 70% of all published translations, and the pattern seems to repeat for other colonial tongues.
We would like to understand why this is. We've begun to find explanations in the software design choices, and there are many potential influences behind each translator's choice of article and languages. Some of these factors might be: the number of articles available in each language, cultural richness and blind spots, suggestions made by software, the availability and quality of machine translation, and more.
The Outreachy component of our project will follow one of these possible avenues for investigation.
===Suggested skills
There are many entry points into this topic area, and candidates can choose where they want to engage. The areas we will work in include:
* User experience research
* Data engineering and analysis
* Node.js backend programming
* PHP backend programming
* Vue.js frontend programming
===Mentors
@awight, @Simulo
===Microtasks
Please feel free to work on tasks even if another candidate has started commenting, since there could be many ways of addressing each question and duplicated work is not wasted.
Initial tasks (mentors will continue to add tasks here throughout the contribution period).
* {T331199}
* {T331200}
* {T331201}
* {T331202}
* {T331204}
* {T331207}
* {T332643}