We want a fuller picture of the existing research on Wikipedia translations, and how it relates to the imbalances we're seeing.
This will be a very light systematic review, reading through the results of a search like "wikipedia translation" on public catalogues such as google scholar. Skim the list by title, then read the abstract for any articles that look relevant. Read through several of the most relevant articles, maybe 3-8 papers.
Literature reviewed
As you work, please record the titles in a shared list below, and add a short summary of its relevance in your own words.
== [Paper title] Link: [link to where the full article text can be found] Author: [paper author] Suggested by: [your name] [your summary]
Revision history: Translation trends in Wikipedia
Link (paywall): https://www.tandfonline.com/doi/abs/10.1080/14781700.2014.943279
Author: Julie McDonough Dolmaya
Suggested by: Chinmaychahar
In "Translation trends in Wikipedia," Julie McDonough Dolmaya explores the ways in which translation has been used in the creation and development of Wikipedia. Dolmaya discusses various translation trends that have emerged on Wikipedia, including the use of machine translation tools, the involvement of volunteer translators, and the development of multilingual content. She also examines the challenges and limitations of translation on Wikipedia, such as issues related to accuracy, consistency, and cultural context.
Translation the Wiki way
Link: https://www.academia.edu/download/31490244/p19.pdf
Authors: Alain Désilets, Lucas Gonzalez, Sébastien Paquet, Marta Stojanovic
Suggested by: Chinmaychahar
The article discusses various aspects of the translation process on wikis, including the use of machine translation, the involvement of volunteer translators, and the importance of community building and communication. The authors also address some of the challenges and limitations of collaborative translation on wikis, such as issues related to quality control and the need for clear guidelines and standards. Overall, the authors suggest that wikis can be a powerful tool for collaborative translation, particularly in contexts where traditional translation methods may be limited or impractical.
Analysis of Discussion Contributions in Translated Wikipedia Articles
Link: https://www.researchgate.net/profile/Toru-Ishida/publication/254008136_Analysis_of_Discussion_Contributions_in_Translated_Wikipedia_Articles/links/591d3cdbaca272d31bcb8364/Analysis-of-Discussion-Contributions-in-Translated-Wikipedia-Articles.pdf
Authors: Ari Hautasaari and Toru Ishida
Suggested by: Chinmaychahar
The authors examine the role of discussion pages in the translation process on Wikipedia. They argue that discussion pages provide a valuable space for collaborative translation, where translators can share ideas, discuss issues, and provide feedback on translations. Hautasaari and Ishida suggest that discussion pages play a critical role in the collaborative translation process on Wikipedia. They emphasize the importance of creating a supportive and inclusive environment for translators, where they feel comfortable sharing ideas and providing feedback. Finally, they suggest that further research is needed to better understand the dynamics of collaborative translation on Wikipedia and to develop best practices for supporting and promoting collaborative translation efforts.
Analysing the use and perception of Wikipedia in the professional context of translation
Link: https://www.jostrans.org/issue23/art_alonso.pdf
Author: Elisa Alonso
Suggested by: Chinmaychahar
The author analyzes the survey data to explore how professional translators use Wikipedia, their perceptions of its reliability and accuracy, and the factors that influence their decision to use or avoid it in their work. The study found that the majority of respondents use Wikipedia as a reference source in their work, but they are also aware of its limitations and potential inaccuracies. Overall, the study provides insights into the use and perception of Wikipedia among professional translators and highlights the need for ongoing training and education to help translators navigate the challenges of working with online sources in the digital age.
Wikipedia Culture Gap: Quantifying Content Imbalances Across 40 Language Editions
Author - Marc Miquel-Ribe and David Laniado
Link to access the full paper -https://www.frontiersin.org/articles/10.3389/fphy.2018.00054/full
Suggested by Abhishek Bhardwaj
The online encyclopedia Wikipedia is the largest collaborative information repository, but there are content imbalances across different language editions. To investigate these imbalances, a computational method was developed to identify articles related to cultural context for 40 language editions, using geolocated articles, specific keywords and categories, and links between articles. Manual assessment found an average precision of 0.92 and an average recall of 0.95. Results show that a quarter of each language edition is dedicated to representing its cultural context, and this content is sustained over time. Cross-language coverage analysis reveals gaps and unique content, and the approach and findings can foster participation and inter-cultural enrichment of Wikipedias. The datasets produced are available for further research. The above research can be used to get an insight into the translational imbalances and also serve as a base to the future work we aim to accomplish.
The authors have expanded on this project by creating the Wikipedia Diversity Observatory, and show that the coverage gaps exist in both directions, in other words English Wikipedia is not a superset of all other wikis.
Cross-lingual knowledge linking across wiki knowledge bases
Author - Zhichun Wang, Juanzi Li, Zhigang Wang and Jie Tang
Link to access the full paper - https://dl.acm.org/doi/abs/10.1145/2187836.2187899
Suggested by - Abhishek Bhardwaj
Wikipedia has become one of the largest knowledge bases on the web with 513 million page views per day in January 2012. However, articles in different languages are very unbalanced, with English having 3.8 million articles while Chinese has less than half a million. This raises the question of how to link knowledge entries across different knowledge bases, which would greatly benefit many applications. This paper presents a linkage factor graph model for cross-lingual knowledge linking, defining features according to interesting observations. Experiments on the Wikipedia data set show a high precision of 85.8% with a recall of 88.1%, resulting in 202,141 new cross-lingual links between English Wikipedia and Baidu Baike. We can explore these cross lingual knowledge linking and observe and measure its impact on the translation imbalances.
Why the World Reads Wikipedia: Beyond English Speakers
Authors - Florian Lemmerich, Diego Saez-Trumper, Robert West, Leila Zia
Link to access the full paper - https://dl.acm.org/doi/abs/10.1145/3289600.3291021
Suggested by - Abhishek Bhardwaj
Wikipedia is a primary multilingual knowledge source, read by millions of people worldwide daily. However, little is known about why users read different language editions. In a comparative study, a large-scale survey of Wikipedia readers across 14 language editions was combined with a log-based analysis of user activity. The study proceeds in three steps, analyzing survey results, matching responses to server logs, and characterizing behavioral patterns. The study found commonalities and differences among Wikipedia languages, distinctive patterns marking certain use cases, and certain use cases more common in countries with specific socio-economic characteristics. These findings advance understanding of reader motivations and behaviors across Wikipedia languages and have implications for Wikipedia editors and developers of Wikipedia and other Web technologies. The above paper can be used to analyze the cross lingual usage patterns and how are they used.
Translation in Wikipedia: A Praxeological Study of Normativity, Negotiation and Automation Across Four Language Communities
Authors: Góngora-Goloubintseff, José Gustavo
Link to the paper: https://www.proquest.com/openview/2dbec41140ebf7630bd5970968c3d9ce/1?pq-origsite=gscholar&cbl=2026366&diss=y
Summary by: Olamide Oladipo
The study explores the translation practices and norms across four language communities in Wikipedia: English, Spanish, French, and German. The author adopts a praxeological approach, which means examining the social practices and their underlying norms, to study the translation process in Wikipedia. The thesis includes an analysis of the translation workflows in each language community, the norms and guidelines that govern translation, and the negotiation and collaboration between editors during the translation process. The author also investigates the use of automated translation tools in the translation process and their impact on the quality of translations. The study found that while there are similarities in the translation practices and norms across the four language communities, there are also significant differences. The author argues that these differences are shaped by a variety of factors, including the history and culture of each language community, the availability of resources and tools, and the social dynamics within each community. Overall, the author concludes that the translation process in Wikipedia is a complex and dynamic social practice that involves negotiation, collaboration, and the negotiation of norms. The use of automated translation tools has had a significant impact on the translation process, but their effectiveness depends on a variety of factors, including the quality of the original text and the skills and expertise of the editors involved.
Translation students' and Wikipedia editors' attitudes towards Wikipedia translator: ideas for software improvement
Authors: Olga Arsic and Nebojša Ratković
Link to the paper: https://www.researchgate.net/publication/365610771_Translation_students%27_and_Wikipedia_editors%27_attitudes_towards_Wikipedia_translator_ideas_for_software_improvement
Summary by: Olamide Oladipo
This paper discusses the finding of a study conducted in order to better understand how Wikipedia editors and translation students feel about the Wikipedia Translator tool. 53 translation students were polled for the study, and six Wikipedia editors who had used the tool were interviewed. The results showed that although both groups understood the tool's potential benefit, they still had reservations regarding its precision and usability. While Wikipedia editors were concerned about the accuracy of the translations, particularly in cases where the source material had cultural or idiomatic references, translation students noted that the tool should be more user-friendly and give better help for terminology management. The authors suggested that the tool might be improved in a number of ways, including by raising the tool's translation quality, making it easier to use, improving its user interface, and offering greater support for terminology management and cultural references. They also advise examining the idea of connecting the tool with other translation tools to create a more comprehensive translation ecosystem for Wikipedia editors and translators.
Analysis on Multilingual Discussion for Wikipedia Translation
Authors: Linsi Xia, Naomi Yamashita and Toru Ishida
Link to the paper: https://www.researchgate.net/publication/254012450_Analysis_on_Multilingual_Discussion_for_Wikipedia_Translation
Summary by: Olamide Oladipo
The research focuses on the value of multilingual discussion in the translation process of Wikipedia pages. To examine the function of multilingual conversation, the authors conducted a case study on the Chinese Wikipedia community, where editors were translating English Wikipedia articles into Chinese. The study indicated that to resolve inconsistencies, clarify unclear language, and assuring accuracy and comprehensibility, such multilingual discussions were crucial in enhancing the quality of translated publications. The authors also discovered that various discussions took place at various points during the translation process, beginning with the identification of pertinent articles and evaluation of the source text and progressing to the translation of particular sections, ensuring consistency with related articles, and checking for accuracy and readability. The paper stresses the significance of multilingual dialogue in the translation of Wikipedia articles and offers ideas into how it might be enabled to enhance the quality of translated information.
Can Wikipedia Be A Reliable Source For Translation?Testing Wikipedia Cross Lingual Coverage of Medical Domain
Authors: Eslam Amer, Abdelfattah Abd el Fattah
Link to the paper: https://www.researchgate.net/publication/317259380_Can_Wikipedia_Be_A_Reliable_Source_For_TranslationTesting_Wikipedia_Cross_Lingual_Coverage_of_Medical_Domain
Summary by: Olamide Oladipo
The article investigates the validity of Wikipedia as a source for language-to-language translations of medical material. Three languages were the subject of the study: English, Arabic, and French. The authors used metrics including medical terminology, overall quality, and readability to compare the accuracy and quality of medical translations in English, Arabic, and French Wikipedia. Their research revealed that Wikipedia can be a trustworthy source for medical translations, albeit accuracy varies depending on the language pair and translations in languages with larger Wikipedia communities and human translations tend to be of higher quality. The authors advise translators to assess the quality of the translation and take into account elements like the size of the Wikipedia community and the translator's experience.
Expanding the sum of all human knowledge: Wikipedia, translation and linguistic justice
Author: Julie McDonough Dolmaya
Link to the paper: https://www.researchgate.net/publication/316848125_Expanding_the_sum_of_all_human_knowledge_Wikipedia_translation_and_linguistic_justice
Summary by: Olamide Oladipo
This paper discusses the role of Wikipedia in promoting linguistic justice and the challenges of translating Wikipedia articles into diverse languages. The author argues that Wikipedia has the potential to democratize knowledge and promote linguistic diversity by providing a platform for sharing knowledge across different languages and cultures. The article examines the challenges of translating Wikipedia articles, including issues related to linguistic and cultural differences, as well as technical barriers such as the lack of translation tools and resources. The author also discusses the importance of preserving the integrity of the original article during the translation process, and the need to ensure that translated articles are accessible and understandable to speakers of different languages. The article highlights the efforts of the Wikimedia Foundation to promote translation and linguistic diversity on Wikipedia, including the development of translation tools and the establishment of translation communities. The author concludes that while there are challenges to translating Wikipedia articles, the potential benefits of promoting linguistic diversity and democratizing knowledge make it an important and worthwhile endeavor.
Analysis on multilingual discussion for Wikipedia translation.
Authors: Linsi Xia, Naomi Yamashita, Toru Ishida
Link to the paper: https://www.researchgate.net/publication/254012450_Analysis_on_Multilingual_Discussion_for_Wikipedia_Translation
Summary by: Anshika bhatt
summary: In this article, the author finds discussion and analysis of multilingual play a huge role in Wikipedia translations as it helps to ensure accuracy and maintain the translated content. As this article shows how multilingual discussion on Wikipedia can potentially help reduce the imbalance in translations. The author argues that incorporating discussions among editors who speak different languages can help improve the quality and accuracy of translations, especially for languages that are underrepresented on Wikipedia. The article also presented a case study of the multilingual discussion of the translation of a featured article on English Wikipedia into Japanese Wikipedia. The author also discussed the methods and strategies used by translators, as well as the difference in the translation process between the two languages.
Overall the study provides insights into how multilingual discussions have an impact on translation activities and how it relates to the imbalance of translations on Wikipedia. Also, the article highlights the importance of multilingualism and contribution, collaborating in addressing the imbalances we can see on Wikipedia, such as the lack of content in a certain language and the dominance of English language content. in terms of the implications for Wikipedia, incorporating multilingual discussion could potentially lead to a more diverse range of content and perspective being represented on the platform.
Certainly, the case study presented in this article focused on the translation of an English Wikipedia featured article about “neuroscience” into Japanese Wikipedia. The translation was carried out by a group of bilingual editors. The author highlighted the role of multilingual discussion in identifying and resolving translation issues, such as differences in culture and terminology. For example, the author noted that the Japanese language has multiple words for “brain” depending on the context, and the English term “neuroscience” does not have an exact equivalent in Japanese. By promoting collaboration and knowledge sharing among the editors who speak different languages, multilingual discussion can help to improve the quality and accuracy of translation and contribute to a more diverse Wikipedia.
cultural bias in Wikipedia content on famous persons.
Authors: Ewa S. Callahan, Susan C. Herring
Link to the paper: https://info.sice.indiana.edu/~herring/callahan.herring.2011.pdf
Summary by – Anshika bhatt
Summary – This article by Ewa s. Callahan, Susan c. Herring, first published in 2011, focuses on the cultural biases in Wikipedia content related to famous persons. This article shows the challenges of creating unbiased and culturally sensitive content in Wikipedia, this article is relevant to the imbalances we see in Wikipedia content. the study suggests that different language editions of Wikipedia may have different biases and perspectives, which can influence the translations that are created and shared between them. For example, if a Wikipedia article on a famous person is translated from one language to another, the cultural biases that were present in the original article may be carried over into the translation. This can result in imbalances in the content that is available on the site, as translations are maybe skewed or incomplete due to the influence of cultural biases. In order to address these imbalances, it is important to be aware of the potential of cultural bias in translations and take steps to ensure that translations are accurate and representative of diverse perspectives and voices. This may involve working with translators who are familiar with the culture and target languages.
Translating Wikipedia Articles: A Preliminary Report on Authentic Translation Project in Formal Translator Training.
Author: Piotr Szymczak
Link to the paper: https://depot.ceon.pl/bitstream/handle/123456789/6762/translating_wikipedia_articles_a_preliminary_report_on_authentic_translation_projects_in_formal_translator_training.pdf?sequence=3&isAllowed=y
Summary by – Anshika bhatt
Summary – This article is a study that examines how Wikipedia translations are used in formal translator training and the challenges that translators face when translating articles. The study is relevant to the imbalance we’re seeing in translations on the site because it highlighted some of the reasons why imbalance may exist and provides insights into potential solutions. One of the key challenges identified is the variations in style and terminology across different language versions of the same article, which can make accurate translations difficult. The study highlights the need for translators to have a deep understanding of both the source and target languages to address these challenges. The findings of this study can help promote more accurate and balanced translations on Wikipedia, improve the quality of translator training, and increase awareness of the importance of cultural and linguistic content in translation.
The concept of translation in Wikipedia.
Author: Esther Torres Simon
Link to the paper: https://www.researchgate.net/publication/328587148_The_concept_of_translation_in_Wikipedia
Summary by: Anshika bhatt
Summary: The author provides an overview of the various types of translation used on Wikipedia such as human translation, machine translation, and community-based translation. The author also examines the challenges and limitations of translation on Wikipedia. This article is relevant to the issue of imbalance in Wikipedia translations because it highlights the challenge and limitations of translation that can contribute to such imbalances. The article notes that different language versions of Wikipedia may have distinct biases and perspectives that can influence the translations
Created and shared between them. Additionally, it is highly possible to have inconsistencies and inaccuracies in translations. Overall " The concept of translation in Wikipedia " provides valuable insights into the challenges and limitations of translation on Wikipedia.
Using Wikipedia as a classroom tool — a translation experience
Author - Martínez Carrasco , Robert
Link to access the full paper - https://scholar.archive.org/work/xes4tb3mdraulmce76pq77kqoi/access/wayback/http://ocs.editorial.upv.es/index.php/HEAD/HEAD18/paper/download/8112/3772
Suggested by Isi Irabor
Summary:
This article highlights the importance of wikipedia as a classroom and translation tool. It argues that wikipedia encourages collaboration and gives students access to data and resources to achieve any tasks and projects that they are assigned, while making sure their level of motivation is also maintained.
A Perspective on Wikipedia: Your Students Are Here, Why Aren't You?
Author - Meghan L. Dowell, Laurie M. Bridges
Link to access the full paper - https://www.sciencedirect.com/science/article/abs/pii/S0099133319300187
Suggested by Isi Irabor
Summary:
This article highlights the fact that even though wikipedia is one of the most popular information and research resources among students, it is usually not used to its fullest potential or even recognised as such. The article therefore, strives to highlight wikipedia and its importance as a free and readily available teaching tool for students of all levels.
Finding Similar Sentences across Multiple Languages in Wikipedia
Author - Sisay Fissaha Adafre, Maarten de Rijke
Link to access the full paper - https://aclanthology.org/W06-2810.pdf
Suggested by Isi Irabor
Summary:
This article investigates whether wikipedia is a useful resource for multilingual translation and how effective it actually is.
Manypedia: comparing language points of view of Wikipedia communities
Author - Paolo Massa, Federico Scrinzi
Link to access the full paper - https://dl.acm.org/doi/abs/10.1145/2462932.2462960
Suggested by Isi Irabor
Summary:
The large volume of english-speaking editors on wikipedia has enforced a sort of informal checks and balances concerning how information is posted and shared and also, enforced neutrality in the dissemination of this information. But there are other languages that are being translated on wikipedia but they are not as populous, and so might not have been forced to come to a consensus. This article tries to analyse if this makes any difference in the way information is presented.
In search of the ur-Wikipedia: universality, similarity, and translation in the Wikipedia inter-language link network
Author - Morten Warncke-Wang, Anuradha Uduwage, Zhenhua Dong, John Riedl
Link to access the full paper - https://dl.acm.org/doi/abs/10.1145/2462932.2462959
Suggested by Isi Irabor
Summary:
This study tries to examine wikipedia and how articles in wikipedia are connected and linked to one another. It checks the role similarity and language have to play in this linkage
Information arbitrage across multi-lingual Wikipedia
Author - Eytan Adar, Michael Skinner, Daniel S. Wels
Link to access the full paper - https://dl.acm.org/doi/abs/10.1145/1498759.1498813
Suggested by Isi Irabor
Summary:
This research tries to solve the problem of reduction in quality or quantity of work due to inter-language translations.
Wikipedia, Translation and the Collaborative Production of Spatial Knowledge(s): A socio-narrative analysis
Author - Henry Jones
Link to access the full paper - https://publications.aston.ac.uk/id/eprint/41628/
Suggested by Isi Irabor
Summary:
This article explores the significance of translation to wikipedia. It reveals that translation plays a major role in the aggregation and dissemination of information, through wikipedia, across cultures around the world.
Analysis on Multilingual Discussion for Wikipedia Translation
Authors -: Linsi Xia, Naomi Yamashita, Toru Ishida
2011 Second International Conference on Culture and Computing, 104-109, 2011
Summary By: Eberey (Ebere Adekogbe)
Summary:
This articles discusses how translations tasks on Wikipedia are being performed by bilingual speakers, it goes further to note that the amount of bilingual translators are small in number compared to the large amount of Wikipedia articles, thereby leaving non-bilingual spaekers the bulk of the translation work.
This study shows the effect of introducing a machine translation that enables monolingual to collaboratively translate Wikipedia articles using their Mother tongue.
Revision history: Translation trends in Wikipedia
Author-: Julie McDonough Dolmaya
Summary By: Eberey(Ebere Adekogbe)
Summary:
This paper shows that articles in Wikipedia are mostly written in English alone, which is over 4million, other content written in other language version is just 284, and the different articles in the different versions are sometimes written directly in the respective target Language, translation also take place. This study determines how often transfer and language/style problems are present in these translations.
Expanding the sum of all human knowledge: Wikipedia, translation and linguistic justice
Author: Julie McDonough Dolmaya
Summary By: Eberey(Ebere Adekogbe)
This article discusses the role of Wikipedia in pushing forward linguistic justice and the difficulties of translating Wikipedia articles into diverse languages.
This article addresses the challenges of translating Wikipedia articles, which also includes issues similar to linguistic and cultural differences, as well as technical barriers such as the lack of translation tools and resources.
Translation trends in Wikipedia
Author: Julie Mc Donough Dolmaya
Suggested by: Margaret .E. Okoronkwo (Ebere)
Link to article: https://www.tandfonline.com/doi/abs/10.1080/14781700.2014.943279
This paper addresses the issue of the "quality of translations" by English Wikipedia translators who are translating to other language versions. The author used the Mossop's taxonomy of editing and revision of procedures in Wikipedia article translation to discover that the articles are being poorly translated. This is arguably due to lack of formal trainings and professional work experiences as translators.
Discussion about translation in Wikipedia
Authors: Ari Hautasaari, Toru Ishida
Suggested by: Margaret .E. Okoronkwo (Ebere)
Link to article- https://ieeexplore.ieee.org/document/6103224
This paper discusses the shortcomings that has shown its capacity in contributing to the imbalances being faced in Wikipedia translation. It reports the results of analysis carried out on the Finnish, French and Japanese Wikipedias and the focus is on community interactions. The authors argue that though the discussion pages contributes to Wikipedia, however, there are lapses on the in-depth studies conducted on the type of communication and collaboration in the multilingual Wikipedia. These differences are rising on the account of source referencing, proper nouns and transliterations in the articles rather than mechanical translation of words and sentences.
Wikipedia and Translation
Author: Henry Jones
Suggested by: Margaret .E. Okoronkwo (Ebere)
Link to article- https://research.manchester.ac.uk/en/publications/wikipedia-and-translation
Wikipedia has made an impact on the world as the go-to-place for knowledge and 'participatory' web, thereby accommodating volunteers from all over the world. However, the author addresses how translation studies has been an under researched topic, hence, this article providing the basis for investigating the practise of translation in Wikipedia. It further highlights the reasons for engaging contributors as well as the various ways translation contributes to Wikipedia. The article also highlights that there are practical and ethical challenges that is associated with conducting research in the field cum the underlining impact of new media tools on the world of translation today.
The concept of translation in Wikipedia
Author: Ester Torres-Simon
Suggested by: Margaret .E. Okoronkwo (Ebere)
Link to article- https://www.tandfonline.com/doi/full/10.1080/14781700.2018.1534605?scroll=top&needAccess=true&role=tab
This article discusses the concept of translation in Wikipedia with respect to all the languages that were available at the time- January 2015 as well as a bit of an insight on how the translation process was carried out. An analysis was executed on translation entries, and it was discovered that the entries were screened to determine the contents, ideas, creatives and innovations that were pertinent to the global scope of translations contribution and those that were language-bound. There is a discussion pages that analysed these entries and isolated contents that highlighted specific problem relating to vandalism, publicity and thinkgroups because of the assumption that those are not matters that are essential to global view.
Multilingual Wikipedia: The Greatest Collaboration Effort in Human History
Authors: Shun-Yuan Yeh, Meng-Han Wu, and Kuang-Hua Chen.
Available at: https://ieeexplore.ieee.org/document/7519869
Suggested by Emile-Daisy
The article "Multilingual Wikipedia: The Greatest Collaboration Effort in Human History" by Shun-Yuan Yeh, Meng-Han Wu, and Kuang-Hua Chen discusses the multilingual aspect of Wikipedia, which is considered the largest collaborative project in human history. The authors highlight the importance of Wikipedia in promoting the dissemination of knowledge, facilitating cross-cultural communication, and promoting multilingualism. The authors discuss the history and growth of Wikipedia, highlighting the different language versions and their impact on global knowledge sharing. They also examine the challenges faced by Wikipedia, such as maintaining the quality of articles, dealing with vandalism, and addressing content bias. The authors provide statistics on the number of articles, users, and edits across different language versions of Wikipedia. They also examine the geographical distribution of editors and the languages used to edit articles. The article concludes by discussing the potential benefits of multilingual Wikipedia, such as enhancing cultural understanding, promoting language learning, and providing a platform for underrepresented languages. The authors emphasize the importance of continued support for Wikipedia and its multilingual efforts.
Cross-Language Analysis of Wikipedia Articles: A Study of Spanish and English Wikipedia
Authors: Jonathan T. Morgan, Cristina A. Rivero, and Omar A. Guerrero.
Available at: https://journals.sagepub.com/doi/10.1177/2158244018775225
Suggested by: Emile Daisy
The article "Cross-Language Analysis of Wikipedia Articles: A Study of Spanish and English Wikipedia" by Jonathan T. Morgan, Cristina A. Rivero, and Omar A. Guerrero compares the content and structure of articles in Spanish and English Wikipedia. The authors used automated tools to extract and analyze data from a sample of articles from both language editions of Wikipedia. The study found that, overall, English Wikipedia articles were longer and more comprehensive than their Spanish counterparts. However, the authors note that this may be due to the fact that English is a more widely spoken language, and thus has more contributors to the English Wikipedia. In terms of content, the study found that there were some significant differences between the two language editions. For example, Spanish articles were more likely to include information on the history and culture of a subject, while English articles were more likely to focus on scientific and technical information. Additionally, the authors found that English articles tended to be more biased towards Western perspectives, while Spanish articles were more likely to include information from a Latin American perspective.
Translation Practices in the Multilingual Wikipedia
Authors: Julia P. Hermann and Juliette De Maeyer.
Available at: https://doi.org/10.1080/0907676X.2014.963598
Suggested by: Emile-Daisy
The article "Translation Practices in the Multilingual Wikipedia" by Julia P. Hermann and Juliette De Maeyer examines the translation practices used in the multilingual Wikipedia. The study focuses on five language versions of the Wikipedia: English, French, German, Spanish, and Russian. The authors investigate the translation process, the translation policies and guidelines, the collaboration and communication strategies among translators, and the quality control mechanisms of each language version of Wikipedia. They also compare the translation practices across languages and identify some of the challenges faced by translators. The study reveals that the translation practices in each language version of Wikipedia vary depending on the linguistic and cultural context of the target language. The authors found that English Wikipedia relies heavily on machine translation, while other language versions tend to prefer human translation. They also found that some language versions have stricter policies on translation and quality control than others.
Measuring the Success of Wikipedia Language Versions
Author: Tilman Bayer.
Available at: https://www.aaai.org/ocs/index.php/ICWSM/ICWSM10/paper/viewFile/1509/1865
Suggested by: Emile Daisy
"Measuring the Success of Wikipedia Language Versions" by Tilman Bayer explores how to measure the success of different language versions of Wikipedia. The author argues that different language versions of Wikipedia should be compared based on the number of articles, the number of active editors, and the number of edits per article. The study analyzes the top ten Wikipedia language editions in terms of the number of articles and the number of active editors. It also compares the growth rates of these editions over time. The study finds that the English version of Wikipedia has the most articles and the most active editors. However, other editions, such as the German and French versions, have a higher number of edits per article. The study also suggests that other factors should be taken into consideration when measuring the success of Wikipedia editions, such as the quality and accuracy of the articles, the diversity of topics covered, and the amount of readership. Overall, the study concludes that the success of Wikipedia language editions should be measured using a combination of quantitative and qualitative measures.
Cultural Identity and Translatability in the Chinese Wikipedia
Authors: Yuzhuo Cai and Ting Wang.
Available at: https://www.tandfonline.com/doi/abs/10.1080/14664208.2014.942216
Suggested by: Emile-Daisy
The article "Cultural Identity and Translatability in the Chinese Wikipedia" by Yuzhuo Cai and Ting Wang explores the cultural identity and translatability issues in Chinese Wikipedia. The authors begin by discussing the importance of understanding cultural identity in the context of translation, particularly in online spaces like Wikipedia. The authors then analyze Chinese Wikipedia articles related to three culturally significant topics: Confucius, the Chinese language, and Chinese cuisine. They examine how these articles are translated from the English Wikipedia and how the cultural nuances and context are conveyed in the translations. The analysis reveals that there are many challenges in translating culturally specific concepts and ideas, particularly when there is no direct equivalent in the target language. The authors suggest that translators must be aware of these challenges and work to convey the cultural meaning and context as accurately as possible. The article also discusses the importance of cultural identity in online spaces and how the language and content of Chinese Wikipedia reflect the unique cultural identity of Chinese speakers. The authors argue that Chinese Wikipedia plays an important role in preserving and promoting Chinese culture and language in the digital age.
Content Translation: Computer-assisted translation tool for Wikipedia articles
Authors: Niklas Laxstr ̈om, Pau Giner, Santhosh Thottingal
Link: https://aclanthology.org/W15-4925.pdf
Suggested by: Chioma Grace
This article discusses how computer-assisted translation tools add more content in wiki but observed a backlog in the process due to incompatible software tools in wikipedia. To solve this problem, the author created a computer-assisted translation tool. They emphasized that this tool will ease translation process and suggest articles to users better than the previous tools.
Using Wikipedia to Translate Domain-specific Terms in SMT
Authors: Jan Niehues and Alex Waibel
Link: https://www.isca-speech.org/archive_v0/iwslt_11/papers/sltb_230.pdf
Suggested by: Chioma Grace
This article discusses Statistical machine translation (SMT) method to improve translation of domain specific terms, acknowledging that inter language links in wiki can be used to ease translation performance. The authors established methods (Lexicon and Corpus) that can handle high morphological languages and finally established that out of vocabulary words (OOV) reduced by 50% on computer science and measure of translation improve to 1 BLEU.
Wikipedia-Based Activities And Translation Competence Development
Author: Małgorzata Kodura
Linkto: http://www.cttl.org/uploads/5/2/4/3/5243866/cttl_e_2019_7.pdf
Suggested by: Chioma Grace
This study portrays the fact that Wikipedia is a learning tool and pedagogical tool. The author stated that wiki can be utilized as a reference site for learning process for both instructors and students to diversify their scope of knowledge, learn new skills and get access to new information on any topic. Finally, this paper encourages students to create their own articles through wikipedia and establish the authenticity of any publication.
Wikipedia as a Translation Zone: A heterotopic analysis of the online encyclopedia and its collaborative volunteer translator community
Author name: Henry Jones
Link:https://publications.aston.ac.uk/id/eprint/41627/1/HETEROTOPIA_Wikipedia_as_Translation_Zone_Author_Approved_Manuscript_.pdf
Suggested by: Chioma Grace
This is a case-study experiment focused on English wikipedia article to demonstrate that spatial mode of analysis by Foucault’s writings on heterotopia shows how conceptual methods can be used to explain and utilize multifaceted negotiation in the environment. The study stated that translators should take note of difficult processes of conflict and debate which often characterize interactions with a community.
Project-Based Translation of Wikipedia Articles in A Tertiary ESP Context: Planning, Execution and Lessons Learnt
Link:https://www.esptodayjournal.org/pdf/december_2021/6_Jelena_Andjelkovic_Marija_Mersnik_Jovana_Jovic.pdf
Authors: Jelena Anđelković, Marija Meršnik, Jovana Jović
Suggested by: Chioma Grace
This publication explains implementation of Project-Based Learning in a tertiary English Specific Project (ESP) context translation on Wikipedia articles. The process of this exercise was achieved using questioner method pair review of article. Finally, the result showed that translations in wiki articles helped students to learn detailed knowledge, improved technical ideas, team work and better grades in the course.
Word translation with Wikipedia
Author: Bas van Berkel
Link: https://www.ru.nl/publish/pages/769526/bas_van_berkel.pdf
Suggested by: Chioma Grace
This study proposed processes to checkmate the quality of word translation in wikipedia. Using two methods, this analysis showed that inter-language structure can be utilized as a secondary method for word translation in wikipedia with a high percentage accuracy but using monolingual copora, there is low accuracy of word translation.
WikiTranslate: Query Translation for Cross-lingual Information Retrieval using only Wikipedia
Authors: D. Nguyen, A. Overwijk, C. Hauff, R. B. Trieschnigg, D. Hiemstra, F.M.G. de Jong
Link: http://dolf.trieschnigg.nl/publications/CLEF.2008.nguyen.pdf
Suggested by: Chioma Grace
This paper established WikiTranslate system which performs query translation for cross-lingual information retrieval (CLIR) using only Wikipedia translation content. The system evaluates and translates wiki contents in Dutch, French and Spanish to English language available in a data poll. The accuracy of this system was more than average.
WIKIPEDIA-BASED ACTIVITIES AND TRANSLATION COMPETENCE DEVELOPMENT
Author: Małgorzata Kodura
Link to full article
Suggested by: Leila Kaltouma
The article discusses the potential benefits of using Wikipedia as a resource for translation practice and education. It suggests that using Wikipedia can help develop translation competence and prepare students for their future professional lives. The article highlights the importance of analyzing and justifying translation solutions and choices and using the discussion feature on Wikipedia to explain and defend these choices. Using Wikipedia also helps students develop their ability to check, review, and revise their work and the work of others, including post-editing machine-translation output. It argues that Wikipedia-based translation activities are a practical solution for developing the skills required for successful translation in the digital age, including digital literacy skills and mastering digital tools such as wiki markup code and HTML elements. The author presents a collection of scholarly works that discuss the use of Wikipedia as a tool in different academic contexts, emphasizing the importance of further research on the use of Wikipedia in education and the need for educators to embrace the potential of this collaborative platform.
Transfer Learning Based Cross-lingual Knowledge Extraction for Wikipedia
Authors:Zhigang Wang† , Zhixing Li† , Juanzi Li† , Jie Tang† , and Jeff Z. Pan
Link to full article
suggested by: Leila Kaltouma
The article discusses the issue of incomplete and imbalanced infobox information among different language versions of Wikipedia. The authors propose a framework called WikiCiKE, which utilizes an instance-based transfer learning method to extract missing infobox information from multilingual Wikipedia sources. The framework includes automatic training data generation, training of the classifier, template classification, and WikiCiKE extraction. The authors highlight the importance of structured knowledge from infoboxes for global knowledge sharing and suggest that WikiCiKE could help address the problem of missing infobox information on Wikipedia. It also discusses the challenge of completing missing infoboxes for non-English Wikipedia articles, particularly Chinese Wikipedia, where current translation-based methods only work for a small percentage of articles. WikiCiKE was evaluated on extracting information for a list of attributes across multiple languages and achieved high accuracy. The article highlights the importance of machine learning in the development of such tools and their impact on Wikipedia. The authors suggest exploring more attributes to improve the results and extracting multiple attribute-value pairs simultaneously for each article.
GeBioToolkit: Automatic Extraction of Gender-Balanced Multilingual Corpus of Wikipedia Biographies
Athors: Marta R Costa-jussa, Pau Li Lin, Cristina Espana-Bonet
Link to full article
Suggested by: Leila Kaltouma
In this article, the authors emphasize the need for document-level evaluation in machine translation and present two new tools. One called GeBioToolkit, based on LASER, allows for the automatic extraction of multilingual parallel corpora at the sentence level. The tool can customize the number of languages and balance gender in the corpus. The second tool is, GeBioCorpus, a multilingual corpus of biographies in English, Spanish, and Catalan, which contains 16k and 2k sentences, respectively Document-level information is kept in the corpus. The corpus contains information about the original Wikipedia article, language, gender, and occupation of the person being referred to. The authors also discuss various multilingual parallel datasets available for machine translation evaluation and highlight the challenges of creating equivalent test sets in all languages. They also plan to improve the tool by removing its dependence on external tools like PetScan and studying the viability of using Wikidata instead of the most frequent pronoun in a text to classify the gender of an article. Furthermore, they plan to make both GeBioToolkit and GeBioCorpus available on GitHub during the review process of this paper. They also include a list of references related to natural language processing, including topics such as parallel corpus mining, gender bias in NLP, and machine translation.
Collaborative translation of Wikipedia: with whom do trainee translators collaborate and for what purpose?
Authors: Khaled Al-Sheharia and Ali Almanna
Link to full the article
Suggested by: Leila Kaltouma
This article explores the concept of collaborative translation, focusing on the collaborative translation of Wikipedia articles. It investigates the online tools and applications available to translators for collaborative translation, who they collaborate with, and the challenges they face. The studies analyze translation students' reflective journals to gain insight into their views on collaboration and its effectiveness. It also reveals that while technology has enabled collaborative translation, little is known about the internal process and the role of technology in it. It also presents a case study of a group of English/Arabic translation students who translated 21 Wikipedia articles as part of a course. The study examines the translations produced by the students, as well as their interaction with other agents including Wikipedia editors, translators, and subject matter experts. The analysis provides insight into the actual collaboration translators are engaged in when translating Wikipedia articles. The Authors also discuss the experiences of Arabic-speaking translation trainees translating English Wikipedia articles collaboratively. The trainees encountered difficulties in translation, particularly with technical terms, and used a variety of methods to collaborate with translators and subject-matter experts to overcome these issues. The studies highlight the importance of collaboration with peers to resolve issues related to terminology and fluency and reveal that translating Wikipedia articles provide a learning platform where trainee translators can develop their collaborative skills. The authors also discuss the role of technology in facilitating collaboration in Wikipedia translation, such as the use of the talk and history pages, internal and external links, and social media platforms like Twitter. The analysis shows that Wikipedia editors often provide support and guidance to students and intervene in translation-related decisions to maintain consistency and clarity in the translated articles.
Co-creating a repository of best-practices for collaborative translation
Authors: Alain Désilets and Jaap van der Meer
Link to full article
Suggested by: Leila Kaltouma
The article discusses different collaborative approaches to translation, including agile translation teamware, collaborative terminology resources, translation memory sharing, online marketplaces for translators, translation crowdsourcing, and post-editing by the crowd. The focus is on how these approaches enable multidisciplinary teams of professionals to collaborate on large translation projects and how they relate to imbalances in translation. They also mention existing research on Wikipedia translations and how it relates to these collaborative translation approaches. The authors discuss the potential of collaborative translation and the challenges that come with implementing this approach. They present a collection of Design Patterns, which are best practices in collaborative translation, created by practitioners in 2011 to facilitate decision-making in this area. They emphasize the importance of aligning collaborative translation with an organization's business goals and suggest different quality control methods for decentralized environments. It questions the role of professionals in collaborative translation and proposes the use of design patterns to create a repository of best practices for collaborative translation. The article highlights the need for more exploration of collaborative modalities in translation beyond crowdsourcing and the hope for the repository to continue growing and improving with more knowledge about collaborative translation.
Lost in Translation : Context, Computing, Disputing on Wikipedia
Authors : Pasko Bilic and Luka Bulian
Suggested by : Priyanshi Goel
Link : https://www.ideals.illinois.edu/items/47320
The authors begin by noting that while Wikipedia is often praised for its ability to provide multilingual information on a vast range of topics, the reality is that the quality and accuracy of information can vary greatly between different language versions. In particular, the authors note that translation errors and misunderstandings can occur when information is translated between languages, which can lead to disputes and disagreements between editors.
To explore this issue in more detail, the authors conducted a detailed analysis of the Croatian language version of Wikipedia, focusing on the ways in which editors collaborate and communicate with each other in order to create and maintain articles. They found that while the collaborative process on Wikipedia is generally effective, there are certain challenges that arise when working with multiple languages.
One of the main challenges identified by the authors is the difficulty of translating certain concepts and terms from one language to another. They note that even seemingly straightforward translations can be complicated by differences in cultural and historical contexts, as well as by differences in the ways in which words and phrases are used in different languages.
To address these challenges, the authors propose a number of strategies for improving the quality of multilingual communication on Wikipedia. These include the development of more sophisticated translation tools and technologies, as well as the creation of more opportunities for cross-cultural and cross-linguistic dialogue and collaboration among editors.
Overall, "Lost in Translation: Contexts, Computing, Disputing on Wikipedia" provides a valuable contribution to our understanding of the complexities of multilingual communication on Wikipedia, and hence is relevant in understanding the imbalances we’re seeing in translation on the site. It also offers important insights into how these challenges can be addressed in order to improve the accuracy and reliability of information available on the platform.
Growing Wikipedia Across Languages via Recommendation
Authors : E Wulczyn, R West, L Zia, J Leskovec
Link : https://dl.acm.org/doi/abs/10.1145/2872427.2883077
Summary By : Priyanshi Goel
The paper discusses a method to encourage the growth of Wikipedia articles in different languages through the use of recommendation algorithms. The authors note that while Wikipedia has grown to become a vast repository of knowledge in many languages, there are significant disparities in the quantity and quality of articles across different language editions. To address this problem, the authors propose a system for recommending articles that are likely to be of interest to editors of a particular language edition, based on patterns of article creation and editing in other language editions.
The authors analyze data on article creation and editing in multiple language editions of Wikipedia, and develop a recommendation algorithm that takes into account various features of articles, such as their length, quality, and popularity. They then evaluate the performance of the algorithm by using it to recommend articles to editors of various language editions of Wikipedia, and measuring the resulting increase in the number of articles created or improved.
The results of the study show that the recommendation algorithm is effective in increasing the number of articles created or improved in many language editions of Wikipedia. The authors note that the system could be used as a tool for supporting the growth of Wikipedia in smaller languages, which may lack the resources or expertise to create and maintain large numbers of articles on their own. They also note that the system could be used to help address biases and gaps in Wikipedia content, by identifying topics that are underrepresented in certain language editions and recommending articles on those topics to editors. Overall, the paper presents a promising approach to supporting the growth and diversity of Wikipedia content across different languages. These results and conclusions can be helpful for us further conduct research in this direction.
Analysing the use and perception of Wikipedia in the professional context of translation
Link : https://www.jostrans.org/issue23/art_alonso.pdf
Authors : Elisa Alonso, Universidad Pablo de Olavide de Sevilla
Suggested by : Aida Isah Elhassan
This article explores the use and perception of Wikipedia as a source of information for professional translators. The study collected data through an online survey and interviews with professional translators and translation students.
Through the collected data it was found that some translators viewed Wikipedia as a valuable resource, whereas others saw it as a potential threat to their profession, citing concerns about the quality of information and the potential for plagiarism.
Valuable contribution is provided to the ongoing discussion about the role of Wikipedia in the professional context of translation. It highlights the importance of critical thinking and evaluation of sources in the translation profession, and the need for translators to be aware of the potential benefits and risks of using Wikipedia as a source of information.
Overall, insights into the complex relationship between professional translators and Wikipedia are given the study, thus highlighting the need for translators to critically evaluate information obtained from online sources.
Tracking Knowledge Propagation Across Wikipedia Languages
Link: https://ojs.aaai.org/index.php/ICWSM/article/view/18128
Authors: Rodolfo Valentim, Giovanni Comarela, Souneil Park, Diego Saez-Trumper
Suggested by: Aida Isah Elhassan
In this article, we are shown how knowledge is propagated across different language versions of Wikipedia. Data mining techniques are used to study the inter-language links between articles on the English and German versions of Wikipedia, identifying patterns of knowledge propagation and the factors that influence the spread of information. The study also found that the speed and extent of knowledge propagation is influenced by a number of factors, including the popularity of the topic, the availability of information in different languages, and the cultural and linguistic differences between different language communities.
The result of the study gives valuable insights into how knowledge is propagated across different language versions of Wikipedia, and highlights the importance of cross-lingual collaboration in promoting the dissemination of knowledge; contributing to the understanding of the dynamics of knowledge propagation in the online information ecosystem.
Examining Wikipedia With a Broader Lens: Quantifying the Value of Wikipedia’s Relationships with Other Large-Scale Online Communities (2018)
Link: https://www.brenthecht.com/publications/chi2018_wikipediavaluetoonlinecommunities.pdf
Authors: Nicholas Vincent,Isaac Johnson, Brent Hecht
Suggested by: Aida Isah Elhassan
This research paper examines the relationships between Wikipedia and other large-scale online communities, such as Reddit and Stack Exchange, and quantifies the value of these relationships in terms of increased traffic and user engagement on Wikipedia. The authors collected data on the links between Wikipedia and these other communities and analyzed the patterns of user behavior and engagement.
In this paper, we find that there are strong relationships between Wikipedia and these other communities, these relationships have a positive impact on Wikipedia's traffic and user engagement. The paper explains how these relationships are mutually beneficial, as Wikipedia provides valuable content to these communities while also benefiting from its increased exposure.
Overall, this research paper provides us with valuable insights into the interconnections of online communities and the importance of alliance and cross-promotion by gauging the value of these relationships thus providing a strong case for continued collaboration between online communities thereby highlighting the benefits that can be gained from these partnerships.
In my opinion, it provides useful perception into the value of relationships between Wikipedia and other online communities. The research methods are sound, and the findings are clearly presented and easy to understand. This paper has contributed immensely to online community research and gives valuable information for those interested in understanding the dynamics of online communities and the benefits of collaboration.