May 21 2020
@Aklapper I thank you for your answer. As promised, I discussed with all users about the issue and I presented the outcomes of that discussion in Wikimania 2019 conference. You can find my presentation at https://commons.wikimedia.org/wiki/File:Languages_lightning_talks.webm and https://commons.wikimedia.org/wiki/File:Wikimania_2019_-_Arabic_Linguistics.pdf. The topic seems to be controversial. Middle Eastern users are struggling the idea despite all my attempts to convince them of the importance of this action. If all Arabic languages/dialects become recognized and supported, we will be able to enrich Wikidata with information about these languages. Please go through my Wikimania presentation and decide the best solution to the matter.
Apr 16 2020
Apr 8 2019
I am Houcemeddine Turki from Tunisia. I am a medical student. However, I like Computer Science. I have some skills of Python and SQL. I ask if I can enter as a student in your GSoC 2019 project.
Feb 26 2018
@Bassem I see, I do not know that as I am not Lebanese. Well, I am a Tunisian citizen :). Thank you for the information. Our proposal is not related to Said Akl's thoughts. It is just for linguistic purposes and to avoid spending time on creating tens of Wikipedias in Arabic dialects.
Feb 23 2018
@1339861mzb Done. Let us continue our discussion in https://meta.wikimedia.org/wiki/Talk:Wikimedia_Tunisie/WikiLingua_Maghreb.
@Bassem @Ibrahim.ID @1339861mzb
As you have seen, there is no political reason behind the proposal. The reason is linguistic. However, if you would like to further discuss with me about that, please keep in mind that phabricator is a website for Wikimedia technical discussions. That is why I invite you to continue our discussion in https://meta.wikimedia.org/wiki/Talk:Wikimedia_Tunisie/WikiLingua_Maghreb.
@Bassem Concerning the intelligibility of Modern Standard Arabic, I did not say that Arab people do not understand Modern Standard Arabic at all. What I said is "Due to the existence of False friends between Modern Standard Arabic and Arabic dialects and due to the existence of several morphological particularities within the Arabic dialects, some Arabs can misunderstand some words within a text in Modern Standard Arabic. Adding labels to Wikidata entities will prevent such misunderstandings".
@Bassem This is an article about Said Akl experience published in the Egyptian newspaper Al Youm Al Saba' in 2017.
If the project has already been finished since 20 years, I ask about why people still deal about it.
@Bassem You can contact Franck Salameh and discuss this with him. He can give you more details about the Said Akl experience:
@Bassem You can describe better than me the situation of Lebanese as you are one of its native speakers. However, from what I see, there are still some channels like MTV Lebanon that deals with the efforts of Said Akl and that still use Said Akl's Arabic script writing system.
@1339861mzb This is not always accurate. Moroccan Arabic is used in Education in Morocco. There are also efforts in this context in Algeria, in Tunisia, and in Lebanon. Furthermore, there are many resources that are written in Arabic dialects/languages since the early 20th Century. Scientists in Tunisia have even found school lessons, mystic poems, religious speeches, and administrative information written in Tunisian during the 18th and 19th Century. We will publish a paper about that. However, this is not the most important reason about adding labels in Arabic dialects to Wikidata. In fact, there are many false friends between Arabic dialects and Modern Standard Arabic. Adding labels in Arabic dialects to Wikidata will prevent users from misunderstanding the articles of the Arabic Wikipedia. Furthermore, there are many Arab people who do not have sufficient proficiency to read and understand Modern Standard Arabic as it should be due to the poor literacy rates in some Arab countries. Letting all human knowledge as provided by Wikidata converted into their Arabic dialect will help them to better understand the information they are searching for in Wikipedia editions.
@Nikerabbit If you need several other information, please write me soon.
- Geographical areas:
apc: Syria, Lebanon
ajp: Palestine, Jordan
afb: Saudi Arabia, Bahrain, UAE, Kuwait, Oman, and Qatar
acw: Saudi Arabia
ars: Saudi Arabia
ayh: Yemen, Saudi Arabia, Oman, United Arab Emirates, Qatar, Singapore, Somalia, Eritrea, Ethiopia, Kenya, Tanzania, Sudan, Indonesia, Malaysia
acq: Yemen, Djibouti, and Somalia
acm: Iraq, Syria, Iran, Turkey, Cyprus, and Armenia
avl: Egypt, Jordan, Palestine, Saudi Arabia, and Syria
abv: Bahrain, Oman, and Saudi Arabia
mea: Mauritania, Senegal, Mali, Morocco, and Algeria
aao: Algeria, Western Sahara, and Niger
shu: Chad, Cameroon, Nigeria, Niger, and Sudan
- Their script is Arabic script.
- Their language codes and English names are: North Levantine Arabic (apc), South Levantine Arabic (ajp), Gulf Arabic (afb), Hejazi Arabic (acw), Najdi Arabic (ars), Hadhrami Arabic (ayh), Sanaani Arabic (ayn), Ta'izzi-Adeni Arabic (acq), Mesopotamian Arabic (acm), Cypriot Arabic (acy), Egyptian Arabic (arz), Northwest Arabian Arabic (avl), Sudanese Arabic (apd), Bahrani Arabic (abv), Libyan Arabic (ayl), Tunisian Arabic (aeb-arab), Algerian Arabic (arq), Moroccan Arabic (ary), Hassaniya Arabic (mea), Saharan Arabic (aao), and Chadian Arabic (shu).
- Native names:
apc: الشامي الشمالي
ajp: الشامي الجنوبي
avl: الرقاوي, الشاوي, or البدوي
@Nikerabbit So, I ask if you can add these Arabic languages to ULS.
Feb 22 2018
@Baba_Tabita We will begin working on Tunisian, Algerian, Moroccan and Egyptian as they are already supported by ULS.
@Baba_Tabita That's clear. I will see what I can do.
Feb 19 2018
@Mbch331 I will not raise the priority again. However, please try to work on it
Feb 18 2018
If all Arabic languages will be added to Wikidata, I will try to add labels to all Wikidata entities in all Arabic languages very soon. Consequently, if we translate Mediawiki messages and the names of Wikidata entities to all Arabic languages, we will have Wikidata and consequently the sum of all human knowledge translated into all Arabic languages. Consequently, there will be no need to create stub Wikipedias and Wiktionaries in these Arabic languages
I ask if LangCom is for or against adding all Arabic dialects/languages to Wikidata.
Feb 17 2018
@Mbch331 Well, I discussed that with Wikidata community. I do not think that someone will be against this task.
Feb 16 2018
@Bassem I do not agree. Arabic dialects are not the same as Modern Standard Arabic. There are many false friends between them and each one of them has its linguistic particularities and this is what is proved in many research works.
@GerardM I know that. However, this is not what I meant.
@alanajjar I only proposed these dialects because they have Wikimedia language codes.
We are only waiting for the support to begin our work.
Feb 15 2018
This proposal is a part of the outcomes of the presentation about Wikidata in AICCSA 2017 conference.
If we add labels in Arabic dialects to all Wikidata entities, we will have Wikidata and consequently all the data existing in Wikipedia and Wiktionary translated to Arabic dialects. Consequently, we will not have to create Wikipedias or Wiktionaries in all the Arabic dialects. Who needs more information about an entity in Wikidata can search Arabic, French or English Wikipedia.
apc: الشامي الشمالي
ajp: الشامي الجنوبي
avl: الرقاوي, الشاوي, or البدوي
Feb 14 2018
The proposals are assigned to sessions in https://meta.wikimedia.org/wiki/Category:WikiIndaba_conference_2018/Accepted_submissions
Feb 10 2018
Notifications to authors not having a scholarship should be done soon.
Categories added. Please remove from them the presentations that will not be presented in WikiIndaba conference 2018.
Feb 9 2018
Aug 13 2015
Example of the converter:
It-Tūnsī huwwa il-dārja illī yitkallmūhā il-twānsa il-barra w fi- Tūnis. Tkawwan il-lsān il-Tūnsī baģd mā t'aŧŧar il-earbī b- il-lūġāt illī kānit mawjūda fi- Tūnis qbal eām 673 kīmā il-būniyya, il-amāzīġiyya… Il-cay hāđā baddal m- il-gṛammār mtāe il-lsān it-Tūnsī w xallāhā twallī beīda yāsir ea- il-Fuṣḥä… Ic-cay illī xallä barca erab mā yifhmūc il-lsān it-Tūnsi w barca twānsa yilqāw il-fuṣḥä ṣeība barca. Hađāka elāc emalnā Wīklam b- il-Tūnsī.
The source code is available here: https://phabricator.wikimedia.org/P1876
The additional Latin Letters were inserted into a special layout for Tunisian for the AZERTY and QWERTY Keyboards.
The AZERTY Keyboard Layout for Windows is available here: https://phabricator.wikimedia.org/F1371689
The QWERTY Keyboard Layout for Windows is available here: https://phabricator.wikimedia.org/F1371794
Aug 12 2015
The source code is available here: https://phabricator.wikimedia.org/P1871
This is the source code of the converter:
<html lang="en"><head><meta http-equiv="Content-Type" content="text/html; charset=UTF-8">
<meta charset="utf-8"> <title>Tunisian Arabic Script Converter</title> <meta name="viewport" content="width=device-width, initial-scale=1.0, maximum-scale=1.0, user-scalable=no">
All problems had been solved excepting the one of the Shaddah for punctation and numerals.
The Source code is:
Aug 11 2015
May 6 2015
I will try to contact the admins of Incubator. Thank you.
The problem of the language name had been resolved. I just apply that I can edit Wp/aeb-latn and Wt/aeb-latn.
I have seen that you had recognized both scripts as AEB-LATN and AEB-ARAB. However, I cannot edit them in Incubator. So, please give me access to edit them in Incubator. Furthermore, the name of Tunisian in Latin Script is "Tûnsî" and not "Tûnsi". So, try to adjust this as soon as possible.
May 3 2015
I had joined Phabricator because we have to solve all the problems here.
Now, we have two scripts for Tunisian... AEB (AEB-ARAB) and AEB-LATN...
I ask if we can achieve a consensus with Language Committee about the recognition of both scripts for Tunisian.
We can use AEB in writing Wikisource and Wikibooks and AEB-LATN in other projects.
Furthermore, adjust Tûnsi as Tûnsî.