Page MenuHomePhabricator

[Proposal] Auto spell correction and maintaining the format of the text based on machine learning frameworks
Closed, DeclinedPublic

Description

Profile Information

Name- Aniket Agarwal
IRC nickname on Freenode - Aniket
Email - aniketkagarwal2312@gmail.com
Resume (optional) - https://drive.google.com/open?id=1VJaz1mAW73BmQsMob6oyWGExoFJZ6QKj
Github - https://github.com/aniketkagarwal
Location (country or state) - India
Typical working hours (include your timezone) - Between 5pm and 12 am UTC +5:30

Synopsis

The project deals with the effective use of machine learning in maintaining the format of the text. While any editor is writing a paragraph on wiki, it will itself help him to manage the functions so that the user can manage its time effectively. The functions may include capitalization of letters after every full stop (ending of the sentence), also the same lies with the nouns, based on the content the machine will itself prefer some of the images which the editor can place if he/she needs and also will involve auto correction feature.
Mentor- @Capt_Swing

Deliverables

Week 1 (30th May - 5th June)

  • Prepare the framework for the project, such as collection of resources.

Week 2 (6th June - 12th June)

  • Installation of Ubuntu and jupyter notebook.

Week 3 (13th June - 19th June)

  • Preparing data sets related to the problem.

Week 4 (20th June - 26th June)

  • Learning the machine based on the data sets and creating an algorithm for the successful completion of the first task. i.e., to manage the format of the content which include letter capitalization, auto spell correction and more.

Week 5 (27th June - 3rd July)

  • Phase 1 evaluation

Week 6 (4th July - 10th July)

  • Complete the data sets for the second task of the project. This will involve sets which will help the editor to choose images based on the content. The pictures will be shown automatically based on the content as provided by the user.

Week 7 (11th July - 17th July) + Week 8 (18th July - 24th July) + Week 9 (25th July - 31st July)

  • Preparing the algorithm for the task.

Week 10 (1st August - 7th August)

  • Phase 2 evaluation

Week 11 (8th August - 14th August)

  • Improvements based on the feedback and responses as received.

Week 12 (15th August - 21st August) + Week 13 (22nd August - 30th August)

  • Fix bugs. Writing documentation and updating appropriate guides. Code cleanup for final submission.

30th August and later

  • Submitting final evaluations and declaration of final results for GSoC 2020.

Participation

I will upload my daily progress on my github account. I will create pull requests as needed. Communications will be carried through commenting on the project created on Phabricator. I will also publish my weekly report. After the completion of each task, I will publish the summary on git.

About Me

I am currently pursuing my B.Tech degree from National Institute of Technology, Arunachal Pradesh in branch Computer Science and Engineering. I have heard about the program through social media profiles and had immersed myself in the groundbreaking contribution in open source community.
Our college will restart on the first week of August, but as there will be no exams at that moment, I will be able to complete my task as mentioned in the deliverables. Also I am eligible for both GSoC and Outreachy. I had already applied to it taking the same project.
I am very much excited to complete the project as this concept will take the efficiency to the next higher level. This task can be evaluated by the whole world not just by simply making a documentation and storing it.

Past Experience

I have prepared many projects involving machine learning and data collection. I had created an OCR system. Also I had designed many portals for my college. I had experienced Machine Learning Frameworks in a practical application. Also I had developed an algorithm which provides the machine to track objects having the highest efficiency, submitted as work in my previous internship. All the projects are available on my github repository.

Event Timeline

@Aniketkagarwal2312 Hello! Outreachy doesn't allow applicants to propose their own internship project. But, you can submit this proposal to GSoC! Remember to follow the application process steps here https://www.mediawiki.org/wiki/Google_Summer_of_Code/Participants#Application_process_steps.

I had read the steps and also had submitted the proposal as mentioned in GSoC.

QEDK renamed this task from Auto spell correction and maintaining the format of the text based on machine learning frameworks to [Proposal] Auto spell correction and maintaining the format of the text based on machine learning frameworks.Mar 25 2020, 7:17 PM
QEDK moved this task from Backlog to Accepted Proposals on the Google-Summer-of-Code (2020) board.
Pavithraes subscribed.

@Aniketkagarwal2312 We are sorry to say that we could not allocate a slot for you this time. Please do not consider the rejection to be an assessment of your proposal. We received over 100 quality applications, and we could only accept 14 students. We were not able to give all applicants a slot that would have deserved one, and these were some very tough decisions to make. Please know that you are still a valued member of our community and we by no means want to exclude you. Many students who we did not accept in 2019 have become Wikimedia maintainers, contractors and even GSoC students and mentors this year!

If you would like a de-brief on why your proposal was not accepted, please let me know as a reply to this comment or on the ‘Feeback on Proposals’ topic of the Zulip stream #gsoc20-outreachy20. I will respond to you within a week or so. :)

Your ideas and contributions to our projects are still welcome! As a next step, you could consider finishing up any pending pull requests or inform us that someone has to take them over. Here is the recommended place for you to get started as a newcomer: https://www.mediawiki.org/wiki/New_Developers.

If you would still be eligible for GSoC next year, we look forward to your participation!