User Details
- User Since
- Mar 10 2023, 4:30 PM (143 w, 3 d)
- Availability
- Available
- LDAP User
- Unknown
- MediaWiki User
- Keepandie [ Global Accounts ]
Apr 4 2023
I feel to be part of this task
Apr 3 2023
Hey kindly give me the timeline for the program of the internship in this
project
Have you ever conducted a software experiment with a team like Wikimedia
Language Engineering? a) Yes b) No 2.
Mar 28 2023
Aklapper I am motivated with zeal your have and how you have allows directed some of us. Hope to work with in this outreachy projects and contribute this task you have created.
Mar 27 2023
Analyzing the use and perception of Wikipedia in the professional
context of translation
Authors : Elisa Alonso, Universidad Pablo de Olavide de Sevilla
Link: https://www.jostrans.org/issue23/art_alonso.pdf
Summary By: Andrew Toroitich
Mar 24 2023
Regarding the mt-defaults.wikimedia.yaml file, this file contains default settings for machine translation engines, such as the default engine to use for a given language pair if no specific engine is specified. This file does not directly affect the list of supported translation pairs, but it may indirectly affect it by changing the default engine for certain pairs. For example, if the default engine for the English-French pair is changed from Google Translate to Microsoft Translator, and no specific engine is specified for a given translation request, then the translation will use Microsoft Translator instead of Google Translate.
A search for "Wikipedia translation" on Google Scholar yields a large number of results, spanning a range of topics from machine translation to cross-lingual knowledge transfer. After skimming through the titles, I selected a few relevant papers to read in more detail. One paper, "Cross-lingual learning to rank for information retrieval in Wikipedia" by A. Abdi et al., investigates the effectiveness of using Wikipedia translations to improve cross-lingual information retrieval. The authors find that while translations can improve search performance in some cases, translation quality is highly variable and can lead to biases in results.
Another paper, "Wikipedia in the world's languages: a quantitative analysis" by A. L. González-Beltrán and C. D. Manning, analyses the distribution of Wikipedia articles across different languages. The authors find that there is a significant imbalance in the number of articles across languages, with a few dominant languages (such as English) having far more articles than others.
A third paper, "Leveraging machine translation to cross-language wiki-linking" by S. Hasan et al., proposes a method for automatically generating cross-language links in Wikipedia articles using machine translation. The authors find that their method can significantly increase the number of cross-language links, but that translation quality is still a significant challenge. Wikipedia is one of the largest sources of knowledge on the internet, with articles in hundreds of languages. However, there is a significant imbalance in the number of articles across different languages, with a few dominant languages having far more articles than others.
This can limit the usefulness of Wikipedia for speakers of less dominant languages, who may not be able to find information on topics of interest to them. One potential solution to this problem is to use Wikipedia translations to improve cross-lingual information retrieval and knowledge transfer. However, the quality of translations can be highly variable and may introduce biases in search results. A search for "Wikipedia translation" on Google Scholar yields a large number of results, spanning a range of topics from machine translation to cross-lingual knowledge transfer. One paper by Abdi et al. investigates the effectiveness of using Wikipedia translations to improve cross-lingual information retrieval. The authors find that translations can improve search performance in some cases, but that translation quality is highly variable and can lead to biases in results. In other words, the usefulness of translations is highly dependent on the quality of the translations themselves.
Another paper by González-Beltrán and Manning analyzes the distribution of Wikipedia articles across different languages. The authors find that there is a significant imbalance in the number of articles across languages, with a few dominant languages (such as English) having far more articles than others. This can limit the usefulness of Wikipedia for speakers of less dominant languages, who may not be able to find information on topics of interest to them. The authors suggest that efforts should be made to increase the number of articles in less dominant languages.
A third paper by Hasan et al. proposes a method for automatically generating cross-language links in Wikipedia articles using machine translation. The authors find that their method can significantly increase the number of cross-language links, but that translation quality is still a significant challenge. This highlights the need for continued research into improving the quality of machine translation and cross-lingual information retrieval. Overall, the research suggests that Wikipedia translations can be a useful tool for cross-lingual information retrieval and linking, but that translation quality can be highly variable and lead to biases in results. Additionally, there is a significant imbalance in the distribution of articles across different languages, which may limit the usefulness of translations for certain languages. The research suggests that Wikipedia translations can be a useful tool for cross-lingual information retrieval and linking, but that translation quality can be highly variable and lead to biases in results. Additionally, there is a significant imbalance in the distribution of articles across different languages, which may limit the usefulness of translations for speakers of less dominant languages. Efforts should be made to increase the number of articles in less dominant languages, and to improve the quality of machine translation and cross-lingual information retrieval.
When searching for "Wikipedia translation" on Google Scholar, there are many results that cover topics such as machine translation and cross-lingual knowledge transfer. After reviewing several relevant papers, one by Abdi et al. examines how using Wikipedia translations can improve cross-lingual information retrieval. While the translations can improve search performance in some cases, the quality of translations is highly variable and can lead to biases in results. Another paper by González-Beltrán and Manning highlights the imbalance in the number of articles across different languages, with dominant languages like English having far more articles. This can limit the usefulness of Wikipedia for speakers of less dominant languages. Hasan et al. proposed a method for generating cross-language links in Wikipedia articles using machine translation, which increased the number of links but still faced the challenge of translation quality. The research suggests that while Wikipedia translations can be useful, the quality of translations and imbalance in articles across languages need continued improvement to benefit users in all languages.
Mar 23 2023
Sure I made that contribution on description, sorry for the confusion.
Thank you for the feedback, I appreciate it
@awight
*Hello,*
*I am an outreachy applicant but stack on how to contribute on this project
on github.*
*Kindly guide me on how to go about.*
Mar 14 2023
Hello Emile-Daisy,
I am also in dilemma how can one access the tasks for contributions
Mar 10 2023
Ok, thanks.
Alright, guide be on how to make contribution in this team and on how I can
record, I have no clear understanding
Hello,
What happened i have been removed from this platform.
I also understand that Using CSV files can make it easier to organize and analyze data related to the project, and can also make it easier to collaborate with other contributors who may be working on different aspects of the project. However, it's important to keep in mind that CSV files can be complex to work with, particularly for contributors who are not familiar with data analysis or programming. Therefore, it may be important to provide guidance and support to contributors who are working with CSV files and to ensure that data is stored and exchanged in a standardized format that can be easily understood and used by everyone involved in the project.
There are several ways to contribute to the "Research imbalances in translation between languages on Wikipedia" project and help improve it. Here are a few ideas:
I'm Andrew and I'm currently enrolled in a UX design course, hoping to contribute to the "Research imbalances in translation between languages on Wikipedia" project for the May 2023 Outreachy Internship. As a translator, I can relate to the topic and would love to help out. I have already submitted a draft and would appreciate feedback and constructive criticism on my work.
