===Profile Information
**Name:** Akash
**Zulip username:** akashsuper2000
**Web Profile:** https://akashsuper2000.github.io/
**Resume:** https://akashsuper2000.github.io/resume.pdf
**Location:** Chennai, India
**Typical working hours:** IST
===Synopsis
=====Short summary describing your project and how it will benefit Wikimedia projects
**Project document link:** https://phabricator.wikimedia.org/T304826
**Proposal document link:** https://phabricator.wikimedia.org/T306268
Campaigns are an integral part of the Wikimedia community aimed to encourage new and existing users to contribute data/information to the repository. Therefore, it is essential to understand the impact and the user retention of such campaigns. The goal of this project is to develop a metrics dashboard that provides insights on user retention over different time intervals.
For this, an ETL pipeline should be created that processes data from sources such as MariaDB or data dump, into a graph-feedable format. Informative and attractive graphs are automatically created from this data and sent to the front-end, built on frameworks like Flask.
=====Possible Mentors
@Jayprakash12345
@KCVelaga
@Sadads
=====Have you contacted your mentors already?
Yes, I have contacted the mentors through Wikimedia's Zulip chat.
===Deliverables
=====Describe the timeline of your work with deadlines and milestones, broken down week by week. Make sure to include the time you are planning to allocate for investigation, coding, deploying, testing, and documentation
===Participation
Describe how you plan to communicate progress and ask for help, where you plan to publish your source code, etc
===About Me
=====Education
I completed my bachelor's in Computer Science with distinction from Amrita University, which hosts one of India's top computer science programs.
=====How did you hear about this program?
I have known GSoC for a long time and have even submitted a proposal last year: https://akashsuper2000.github.io/blog/gsoc-2020-proposal
=====Will you have any other time commitments, such as school work, another job, planned vacation, etc, during the duration of the program?
I have recently started working as a Software Engineer (post my graduation in 2021). However, I have been accepted into a University in the United States for my Master's in Computer Science. Therefore, I would be available for the entirety of the program, except for one week (August 1st, 2022 to August 7th, 2022), when I would be busy with my relocation. Neither my job nor my relocation would affect, in any way, my ability to contribute to the program.
=====We advise all candidates eligible for Google Summer of Code and Outreachy to apply for both programs. Are you planning to apply to both programs and, if so, with what organization(s)?
I am only applying through the Google Summer of Code program.
=====What does making this project happen mean to you?
Wikimedia's mission is to bring free education to the world, a mission that deeply resonates with me. This opportunity allows me to directly improve this system while being able to learn new technologies, build critical infrastructure, and network with people who also share this vision. Specific to this project, I would be able to put my data science skills to good use by enabling users to understand the impact of various campaigns which translates to a more efficient financial expenditure in order to grow this community. This is also my gateway to start contributing to open source.
===Past Experience
=====Describe any relevant projects that you've worked on previously and what knowledge you gained from working on them.
======Web development
Throughout my undergraduate years, I was involved with projects in web development that enabled me to build solutions that had an immediate impact. Some of the projects include the 'Faculty Dashboard' built using ReachJS that aims to solve the need for a centralized portal for the faculty of my institution and the 'Voice-based transport inquiry system' built using Java SpringMVC with an inbuilt voice IO system. I have also worked on Python-based web frameworks like Flask to build quick applications for deploying stats visualizations, running cron jobs, and hosting machine learning models.
**Links to applications that are hosted at the moment**
COVID-19 dashboard using Flask: https://akashsuper2000.pythonanywhere.com/
Python executor using Flask: https://akash2000.pythonanywhere.com/
Faculty dashboard using ReactJS: https://akashsuper2000.github.io/faculty-dashboard/
======Data Science
I have hands-on experience working on a range of projects that utilize data science concepts clustering, hypothesis testing, ranking, regression, and SVM as part of my "Fundamentals of Data Science" course I attended in my college. As part of the course, I got to work with tools like Numpy, Pandas, Matplotlib, Seaborn, Plotly, and Bokeh, allowing me to quickly ramp up to Wikimedia's development ecosystem.
======Big data
Through the "Big Data" course I attended in my college and as part of working as a Software Engineer in a huge organization, I got the opportunity to explore and work on big data tools in the Apache Hadoop ecosystem such as MapReduce, Hive, and Pig.
======Databases
I have extensively used a variety of diverse databases like MySQL, MongoDB, Aurora RDS, DynamoDB, Cassandra, and Google BigQuery. I believe that these experiences would enable me to transition smoothly into the MariaDB ecosystem here at Wikimedia.
=====Describe any open source projects you have contributed to as a user and contributor (include links).
While I have a good number of open-sourced projects under my belt, I do not have first-hand experience actually contributing to an open-source project that people actually use. I believe that this program would be a good starter to building valuable connections in the community and getting started with active open-source participation and contribution.
===Other Information
=====Experience continued - Hackathons
My efforts in a diverse set of projects are complemented by my involvement in hackathons and competitions. I have participated in numerous Kaggle competitions, securing multiple medals to rank among the top 200 globally. I have also participated in CTF contests where my team ranked top 100 nationally for two consecutive years.