**Profile**
**Name:** Soumya Atul Gupta
**Email:** soumyag213@gmail.com
**IRC or IM networks/handle(s):** soum on freenode
**Web Page / Blog / Microblog / Portfolio:** My [[ https://github.com/soumyag213 | Github ]] profile and my [[ https://theendofthesemicolon.wordpress.com/ | personal blog ]]
**Location:** New Delhi/Mumbai
**Typical working hours:** 1000-1330 hours and 2030-0230 hours, GMT +5.30
**Synopsis**
The project aims to consolidate the documentation for Zotero's translation servers, which in turn will help [[ https://www.mediawiki.org/wiki/Citoid | Citoid ]] (a service that allows people to easily add references on Wikipedias). Currently, up-to-date documentation exists only for the browser plug-ins not the translation-server and not on wikipedia, but on [[ https://www.zotero.org/support/dev/how_to_write_a_zotero_translator_plusplus | Zotero ]]. This documentation will be present on mediawiki.org and and the translation of the documentation to Hindi and French will be done.
While writing the documentation, site-specific will be solved (in order to establish full rapport with it).
**Mentors:** @Mvolz and @czar
**Deliverables: **
1. Write documentation for Zotero plug-in and translation server
2. Write translators for Zotero
3. Cleaning up [[ https://phabricator.wikimedia.org/project/view/62/ | Citoid-related issues ]] site-specific Zotero bugs
**Proposed Timeline: **
| **Dates** |**Tasks** |
|Nov 8 – Dec 6|Community Bonding Period, familiarizing oneself with Citoid and Zotero|
|Dec 6 – Jan 6| Writing translators for Zotero; fix bugs related to Zotero/Citoid to understand Zotero/Citoid completely| Site specific issues |T137019, T121295, T113262, T106892, T99091, T98675, T105647, T87331|
|Jan 6 – Feb 15| Documentation for Zotero plug-ins, and translation-servers|
|Feb 15 - Feb 28 | T106200 and T137440|
|March 1 – March 6| Final revisions; Pencils Down|
Mid-term review: Basic Documentation begun, and 6/8 bugs solved
{F4612658}
{F4612663}
**Participation**
I've been a regular user of Git for quite a while now, so I've got the basics of code review firmly established.
While I'll be working on the project, I'll be backing up my code everyday to Git and for review to Gerrit. This regularity will help other developers keep a track of my progress, and to point out corrections at every stage.
In addition to this, other forums where I will be available are:
IRC: I'll stay online on freenode in #mediawiki and #wikimedia-services as soum during my working hours.
I'll post my progress (weekly summary) on my [[ https://theendofthesemicolon.wordpress.com/ | personal blog ]].
**About you**
**Education and Work Experience**
I am a final year student at Netaji Subhas Institute of Technology, New Delhi pursuing B.E in Electronics and Communication. Over the years, I've gleaned knowledge of not only Circuits and Systems, Electronics, Digital Signal Processing, Computer System Architecture, Microprocessors et al but Programming, Data Structures, Algorithms and Machine Learning. I want to pursue Natural Language Processing in the future with a side of Linguistics. My final year project is based on Image Captioning using Neural Networks. I am also a open-source contributor to CLTK (Classical Language Toolkit) (I've talked about this later)
Over the summer (2016), I interned with Royal Bank of Scotland as a Software Development Intern. My work was mainly centered around RESTful HTTP APIs using the language Groovy (a mixture of Java + Python) as a base. I also did a project on Data Analysis wherein I had to organize the messed-up data streaming in through my department, I used SQL and QlikView to streamline the data.
I am also comfortable coding in C++, Python and Java.
HTML is a language that I have been introduced to since 2007, ever since web design became a part of our curriculum.
I am the senior editor of the college newspaper, so I like to think that I would be able to get the documentation done easily. I am not alien to the open-source contribution culture having contributed to a FOSS organization before.
These three skills together, make me an ideal candidate for the project at hand which mainly requires documentation and web scraping.
**How did you hear about this program?**
I heard about this program from colleagues who have been participants in the previous rounds (Mozilla, Round 12 and The Perl Foundation, Round 9). The learning curve exhibited by both my colleagues is what primarily what inspired me to apply here.
**Will you have any other time commitments, such as school work, another job, planned vacation, etc., during the duration of the program?**
I will have college work from mid-Feb to March, but the work pressure will be minimal as it is mainly a research-based semester. I will easily be able to give 40+ hours a week to this project. Other than that, I will have a classical dance exam (Odissi, that's an Indian Classical Dance Form) and anthropology related NGO work (concentrating on the olden traditions of West Bengal).
**We don't just care about your project -- you are a person, and that matters to us! What drives you? What makes you want to make this the most awesomest wiki enhancement ever?**
I've been using Wikipedia ever since I've started using the Internet itself. That's 14 years out of a 21 year old existence. Needless to say, it's Google and Wikipedia which singlehandedly define the Internet today. They form the fundamental pillars of the information age. To be a part of something so dynamic and affecting would be unbelievable for me, I would extrapolate it as my way of changing the world (one wiki enhancement at a time).
**Past experience**
**Please describe your experience with any other FOSS projects as a user and as a contributor**
As a user, I've been using Firefox, Linux, Mediawiki, Python and OpenJDK. They seem to be miles ahead kind of development they bring about by just sharing all the knowledge that they possess. I think it's amazing how people all over the world can cohesively make such a difference in real time (for instance Mozilla nightly builds and Google Canary, it's improved everyday consistently, by the people who use it). This goes for EVERY FOSS organization, I think they grow exponentially not only in terms of the software, but in terms of the camaraderie and the culture they cultivate.
As a open-source contributor, I've been contributing to the [[ https://github.com/cltk/cltk | CLTK ]] Project (Classical Language Toolkit; this project concerns itself with the documentation with the ancient languages of the world, it employs extensive Natural Language Processing with Python as its base). CLTK was featured in GSoc 2016 for the first time, and that is how I got to know about its existence. I have been importing code from third party libraries into their source code, while converting it from Python 2.7 to Python 3. My contributions can be accessed [[ https://github.com/soumyag213/cltk-indica-nlp | here ]].