Page MenuHomePhabricator

Outreachy Application Task: Tutorial for Wikipedia language-agnostic article quality modeling data
Closed, ResolvedPublic

Assigned To
Authored By
Pablo
Feb 21 2024, 11:54 AM
Referenced Files
F44181989: Screenshot 2024-04-02 at 16.40.09.png
Apr 2 2024, 3:40 PM
F44057810: Screenshot (210).png
Apr 1 2024, 6:47 PM
F44030520: Screenshot 2024-04-01 at 14.51.31.png
Apr 1 2024, 1:57 PM
F43908327: image.png
Mar 31 2024, 7:07 AM
F42948581: image.png
Mar 21 2024, 1:28 PM
F42766313: Screenshot 2024-03-20 020007.png
Mar 19 2024, 8:33 PM
F42748553: Screenshot (72).png
Mar 19 2024, 10:47 AM
Restricted File
Mar 19 2024, 9:51 AM

Description

Overview

This task serves as a tutorial with microtasks for the Outreachy Project T356498 (Build a data visualization tool for the evolution of Wikipedia articles maintained by WikiProjects). Starting from this notebook try go through the steps and complete the different TODOs.

The full Outreachy project will involve more comprehensive coding than what is being asked for here (and some opportunities for additional explorations as desired). This task will introduce some of the basic concepts and give us a sense of your Python skills, how well you work with new data, documentation of your code, and description of your thinking and results. We are not expecting perfection -- give it your best shot! See this example of working with Wikidata data as an example of what a completed notebook tutorial might look like.

Set-up

  • Make sure that you can login to the PAWS service with your wiki account: https://paws.wmflabs.org/paws/hub
  • Using this notebook as a starting point, create your own notebook (see these instructions for forking the notebook to start with) and complete the functions / analyses. All PAWS notebooks have the option of generating a public link, which can be shared back so that we can evaluate what you did. Use a mixture of code cells and markdown to document what you find and your thoughts.
  • As you have questions, feel free to add comments to this task (and please don't hesitate to answer other applicant's questions if you can help)
  • If you feel you have completed your notebook, you may request feedback and we will provide high-level feedback on what is good and what is missing. To do so, send an email to your mentor with the link to your public PAWS notebook. We will try to make time to give this feedback at least once to anyone who would like it.

Evaluation

When you feel you are happy with your notebook, you should include the public link in your final Outreachy project application as a recorded contribution. You may record contributions as you go as well to track progress. The final notebook (created for the microtask) is our primary means of judging each applicant. For the notebook, we will evaluate it along a few criteria:

  • Quality of code: are there bugs? is your code well-structured, commented, and easy to understand?
  • Quality of notebook: is the notebook well-structured and easy to follow?
  • Creativity: did you try different approaches? did you come up with interesting ideas for future analyses?
  • Iteration: if you receive feedback, were you able to incorporate those changes?

Because we will likely only provide one round of feedback for each applicant, we recommend submitting the notebook when you are fairly confident it is a complete state. As noted above, you are not evaluated on the quality of your initial notebook but rather how you adapt to the feedback so it's okay to have mistakes, etc., in your notebook in the initial review and you will not be penalized for that.

If you have received feedback from the mentors, we recommend that you include a section in your final notebook briefly listing all the changes you made to address our comments.

Please remember that April 2, 2024 4pm UTC is the deadline for ALL applicants to record contributions and create a final application.

Event Timeline

There are a very large number of changes, so older changes are hidden. Show Older Changes

Can anyone help with the image file mentioned in the given notebook, I'm not able to open it.

Hi @Gungun_Singh , please tell which image?

Hi all , I am also getting the same as @Jane_Ngethe was getting. Am i going in right direction that i am just dismissing those messages and waiting for it to complete?

Hi @Komalverma148 If you are encountering that error and just dismissing those messages, probably your file won't be saved. You will have to look at what is occupying the unnecessary space in your paws folder structure and have to remove the unnecessary files.

Hello there. This is right. I realized I had so many files uploaded trying to fork the main notebook. So I deleted all the unnecessary ones and today it's working fine. Autosave.

Can anyone help with the image file mentioned in the given notebook, I'm not able to open it.

Hi @Gungun_Singh , please tell which image?

Screenshot (72).png (1,920×868 px, 128 KB)

The image file given here

Can anyone help with the image file mentioned in the given notebook, I'm not able to open it.

Hi @Gungun_Singh , please tell which image?

Screenshot (72).png (1,920×868 px, 128 KB)

The image file given here

Ok So this the image you are looking at:

Can anyone help with the image file mentioned in the given notebook, I'm not able to open it.

Hi @Gungun_Singh , Which image file are you talking about? Is it this?

image.png (1,753×529 px, 70 KB)

Can anyone help with the image file mentioned in the given notebook, I'm not able to open it.

Hi @Gungun_Singh , please tell which image?

Screenshot (72).png (1,920×868 px, 128 KB)

The image file given here

Ok So this the image you are looking at:

Can anyone help with the image file mentioned in the given notebook, I'm not able to open it.

Hi @Gungun_Singh , Which image file are you talking about? Is it this?

image.png (1,753×529 px, 70 KB)

yes, this one only. Thanks but why isn't it showing in mine?

Can anyone help with the image file mentioned in the given notebook, I'm not able to open it.

Hi @Gungun_Singh , please tell which image?

Screenshot (72).png (1,920×868 px, 128 KB)

The image file given here

Ok So this the image you are looking at:

Can anyone help with the image file mentioned in the given notebook, I'm not able to open it.

Hi @Gungun_Singh , Which image file are you talking about? Is it this?

image.png (1,753×529 px, 70 KB)

yes, this one only. Thanks but why isn't it showing in mine?

That might be due to internet issues.

Message from the mentors: deadline to submit initial notebook this Friday

Due to the high number of interested applicants, we will be closing this project to new contributors on Friday March 22 at 9:00 PM UTC.

  • If you have not yet requested feedback, please email us the link to your PAWS notebook by this deadline.
  • Please complete your initial submission by this deadline.

We will provide first-round feedback to all notebooks emailed to us by this deadline. This can take up to 3 work days (Mon-Fri), so please be patient if you don't immediately hear back from us. You will have until April 2 to make updates to your notebook based on feedback that we provide.

Thank you!

Hi, can anyone help me please.
Is the server down for the first part that downloads the file using the code

df_revisions = pd.read_csv('https://public-paws.wmcloud.org/User:Pablo%20(WMF)/outreachy/round28/features_scores_climatechange_2022.csv.zip')

I keep getting an error saying

"IncompleteRead: IncompleteRead(7208587 bytes read, 25985796 more expected)"

Hi, can anyone help me please.
Is the server down for the first part that downloads the file using the code

df_revisions = pd.read_csv('https://public-paws.wmcloud.org/User:Pablo%20(WMF)/outreachy/round28/features_scores_climatechange_2022.csv.zip')

I keep getting an error saying

"IncompleteRead: IncompleteRead(7208587 bytes read, 25985796 more expected)"

I just checked, it is indeed throwing an error. Perhaps @Pablo, could you please tell us how to fix it?

Hi, can anyone help me please.
Is the server down for the first part that downloads the file using the code

df_revisions = pd.read_csv('https://public-paws.wmcloud.org/User:Pablo%20(WMF)/outreachy/round28/features_scores_climatechange_2022.csv.zip')

I keep getting an error saying

"IncompleteRead: IncompleteRead(7208587 bytes read, 25985796 more expected)"

I just checked, it is indeed throwing an error. Perhaps @Pablo, could you please tell us how to fix it?

Yes, definitely a server issue. Clicking the link directly throws a "404 Not Found" error

This file save error is popping up a lot. Can't reduce code as everything is important. If you have any solution, then let me know please. Changes are not being reflected on public paws link as file is not getting saved.

How can I filter climate change articles from the dataset? Can anyone tell me?

How can I filter climate change articles from the dataset? Can anyone tell me?

The dataset of language-agnostic feature values and quality scores of Wikipedia articles available on Zenodo is too large to handle with a Jupyter notebook. For that reason, we have prepared a sample with the revisions of English Wikipedia articles maintained by WikiProject Climate change with the following columns:

I think you are already working with the data

How can I filter climate change articles from the dataset? Can anyone tell me?

The dataset of language-agnostic feature values and quality scores of Wikipedia articles available on Zenodo is too large to handle with a Jupyter notebook. For that reason, we have prepared a sample with the revisions of English Wikipedia articles maintained by WikiProject Climate change with the following columns:

I think you are already working with the data

Okay. So, the dataset has climate change articles. Got it.

I am getting this in paws jupyter platform and files, notebook page is not opening. Does anyone know how to resolve this?

Screenshot 2024-03-20 020007.png (1,411×424 px, 16 KB)

Hi, are we supposed to return a data visualization for the first TODO? I was thinking of returning a list (or an error message if there isn't available data).

The first TO DO asks you to return pageviews count only and not visualize them. You need to visualize feature values and quality score data which is the last TO DO I think.

Thank you!

Hi, are we allowed to manipulate the data for part 2? If I wanted to display page_lengths with granularity monthly, can I group page_lengths as an average for a given month? For example, a bar chart for page_lengths for 2021 with granularity monthly. X axis would be months and Y axis would be average page lengths for a given month.

Hi, are we allowed to manipulate the data for part 2? If I wanted to display page_lengths with granularity monthly, can I group page_lengths as an average for a given month? For example, a bar chart for page_lengths for 2021 with granularity monthly. X axis would be months and Y axis would be average page lengths for a given month.

Hi , Yes we are allowed to manipulate the data for visualization.

Hi, are we allowed to manipulate the data for part 2? If I wanted to display page_lengths with granularity monthly, can I group page_lengths as an average for a given month? For example, a bar chart for page_lengths for 2021 with granularity monthly. X axis would be months and Y axis would be average page lengths for a given month.

yes however i are encouraged to use insightful and appealing visualizations. You can do it with any parameter in the data set

I am getting this in paws jupyter platform and files, notebook page is not opening. Does anyone know how to resolve this?

Screenshot 2024-03-20 020007.png (1,411×424 px, 16 KB)

you can try reloading your notebook, if the issue persist then you have exceeded the server size limit, i think @Pablo will have to assist

Hi, are we allowed to manipulate the data for part 2? If I wanted to display page_lengths with granularity monthly, can I group page_lengths as an average for a given month? For example, a bar chart for page_lengths for 2021 with granularity monthly. X axis would be months and Y axis would be average page lengths for a given month.

Yes, that is the basis for executing the tasks. What matters is the reasoning behind your manipulations and visualizations.

I am getting this in paws jupyter platform and files, notebook page is not opening. Does anyone know how to resolve this?

Screenshot 2024-03-20 020007.png (1,411×424 px, 16 KB)

you can try reloading your notebook, if the issue persist then you have exceeded the server size limit, i think @Pablo will have to assist

It got back to normal after a few mins, an error saying kernel died occurred and then this happened. It's working fine now.

I am getting this in paws jupyter platform and files, notebook page is not opening. Does anyone know how to resolve this?

Screenshot 2024-03-20 020007.png (1,411×424 px, 16 KB)

This happens mostly when we save a lot more files on paws and as our notebook dataset is very large, try saving your changes in the same notebook and deleting the ones which are unnecessary. I went through this error for almost a week and after some time it even stops allowing saving the changes.

I am getting this in paws jupyter platform and files, notebook page is not opening. Does anyone know how to resolve this?

Screenshot 2024-03-20 020007.png (1,411×424 px, 16 KB)

This happens mostly when we save a lot more files on paws and as our notebook dataset is very large, try saving your changes in the same notebook and deleting the ones which are unnecessary. I went through this error for almost a week and after some time it even stops allowing saving the changes.

Okay. I have deleted additional files. Now, I have only two files, the forked one and mine one.

While using the mviews.api for getting the page views, I'm getting error while fetching the data for some dates. Can you please mention the exact year from which the page views data are provided?

While using the mviews.api for getting the page views, I'm getting error while fetching the data for some dates. Can you please mention the exact year from which the page views data are provided?

Try-catch blocks are essential for error handling in this context, as data availability may vary from one date to the next. It's not guaranteed that data will be available incrementally over the course of a year; errors indicating data unavailability may still arise for some dates.

Also please help me with the future analyses like do we need to write the proper code for it?

Also please help me with the future analyses like do we need to write the proper code for it?

For future Analysis , we don't have to write code, we have to write our idea.

Also please help me with the future analyses like do we need to write the proper code for it?

For future Analysis , we don't have to write code, we have to write our idea.

okay Got it! Thank you so much for your help.

While using the mviews.api for getting the page views, I'm getting error while fetching the data for some dates. Can you please mention the exact year from which the page views data are provided?

According to the mentors, data is available as from July 2015 onwards.

While using the mviews.api for getting the page views, I'm getting error while fetching the data for some dates. Can you please mention the exact year from which the page views data are provided?

According to the mentors, data is available as from July 2015 onwards.

Even after July 2015, it's important to note that data for every date may not be available. This could be due to a variety of reasons, such as days when there were no edits or when pageview count data wasn't recorded.

Hello Guys, please i have this error while fetching the API, can you assist

image.png (1,634×1,011 px, 199 KB)

Hello Guys, please i have this error while fetching the API, can you assist

image.png (1,634×1,011 px, 199 KB)

It is expected that in some date intervals, depending on the granularity used, the interval may be too tight to return any reasonable data. Therefore you can use a try-and-except block or similar techniques to handle such occurrences, and not terminate the execution.

Hello Guys, please i have this error while fetching the API, can you assist

image.png (1,634×1,011 px, 199 KB)

It is expected that in some date intervals, depending on the granularity used, the interval may be too tight to return any reasonable data. Therefore you can use a try-and-except block or similar techniques to handle such occurrences, and not terminate the execution.

thanks @DevJames1, i will look at that

Hello everyone, please what is the email where we are supposed to send the link to our notebook?

Hello everyone, please what is the email where we are supposed to send the link to our notebook?

The email-ids of mentors are provided in the project description. You can mail to any one of them.

hey, we just needed to mail the PAWS public link right or is there anything else that should be done?

hey, we just needed to mail the PAWS public link right or is there anything else that should be done?

yes, just the link. Also, I think it makes sense to introduce yourself if that's your first direct communication.

Reminder from the project mentors: deadline to submit initial notebook today

We will be closing this project to new contributors today, March 22, at 9:00 PM UTC. Please see our previous message for details.

After today's deadline,

  • If you have specific questions about your notebook you are welcome to ask.
  • You may continue working on your notebook/final applications until the Outreachy deadline of April 2.
NOTE: Please email the link to your public PAWS notebook to all three of the mentors. Our email addresses are available on the Outeachy project page, in the contact info for "Isaac Johnson", "Caroline Myrick", and "Pablo Aragón".

Message from the project mentors regarding submission

  • The project already has strong applicants and we will no longer provide feedback to any more applicants.
  • We are working on the feedback to applicants that shared their notebook with us by last Friday’s deadline.
  • If you have received feedback from the mentors, we recommend that you include a section in your final notebook briefly listing all the changes you made to address our comments.
  • Please remember that April 2, 2024 4pm UTC is the deadline for ALL applicants to record contributions and create a final application.

Reminder from the project mentors: deadline to submit initial notebook today

We will be closing this project to new contributors today, March 22, at 9:00 PM UTC. Please see our previous message for details.

After today's deadline,

  • If you have specific questions about your notebook you are welcome to ask.
  • You may continue working on your notebook/final applications until the Outreachy deadline of April 2.
NOTE: Please email the link to your public PAWS notebook to all three of the mentors. Our email addresses are available on the Outeachy project page, in the contact info for "Isaac Johnson", "Caroline Myrick", and "Pablo Aragón".

Hello @CMyrick-WMF
Can I ask questions about my notebook if I received feedback?

Reminder from the project mentors: deadline to submit initial notebook today

We will be closing this project to new contributors today, March 22, at 9:00 PM UTC. Please see our previous message for details.

After today's deadline,

  • If you have specific questions about your notebook you are welcome to ask.
  • You may continue working on your notebook/final applications until the Outreachy deadline of April 2.
NOTE: Please email the link to your public PAWS notebook to all three of the mentors. Our email addresses are available on the Outeachy project page, in the contact info for "Isaac Johnson", "Caroline Myrick", and "Pablo Aragón".

Hello @CMyrick-WMF
Can I ask questions about my notebook if I received feedback?

Hello @GonzaGertrude, you are always welcome to ask questions even after getting feedback either here or on Zulip.

hello, I am not able to upload the link to my paws notebook can anyone helpme

Please how do we go about recording the contribution and submitting a final application

hello, I am not able to upload the link to my paws notebook can anyone helpme

While recording the contribution, we have to just add the link to our paws notebook. Can you please clarify where are you trying to upload it?

Please how do we go about recording the contribution and submitting a final application

You can go to project selection page on outreachy --> Find 'Wikimedia closed project #1' --> Record your contribution and apply.
We need to share the link to our notebook as a contribution. The link will be the similar to the link you shared for feedback.

Please correct me if I'm wrong.

@Mistura14 how many contributions did you make

I don't think there are number of contributions you can make. It's just the notebook (as mentioned in task details).
As mentioned by our mentors, they recommend to include a section in your final notebook briefly listing all the changes you made to address their feedback.

Please ask general setup questions in support forums instead as they are not directly related to fixing this very task. Thanks.

Message from the project mentors regarding generative AI

In the process of providing feedback to applicants we have found some notebooks with very similar responses to specific to-dos. Their style resembles that generated by AI tools such as ChatGPT. We believe that these tools can be useful in some specific cases (e.g., proofreading for non-native speakers of English). However, if AI tools are used, the final notebook must clearly indicate where and for what purpose.

Please how do we go about recording the contribution and submitting a final application

You can go to project selection page on outreachy --> Find 'Wikimedia closed project #1' --> Record your contribution and apply.
We need to share the link to our notebook as a contribution. The link will be the similar to the link you shared for feedback.

Please correct me if I'm wrong.

Thanks for the feedback. I found my way around it

Message from the project mentors regarding generative AI

In the process of providing feedback to applicants we have found some notebooks with very similar responses to specific to-dos. Their style resembles that generated by AI tools such as ChatGPT. We believe that these tools can be useful in some specific cases (e.g., proofreading for non-native speakers of English). However, if AI tools are used, the final notebook must clearly indicate where and for what purpose.

Yes, sure

Reminder from the project mentors: deadline to submit initial notebook today

We will be closing this project to new contributors today, March 22, at 9:00 PM UTC. Please see our previous message for details.

After today's deadline,

  • If you have specific questions about your notebook you are welcome to ask.
  • You may continue working on your notebook/final applications until the Outreachy deadline of April 2.
NOTE: Please email the link to your public PAWS notebook to all three of the mentors. Our email addresses are available on the Outeachy project page, in the contact info for "Isaac Johnson", "Caroline Myrick", and "Pablo Aragón".

Hi, I just saw this message. I lost track of it among the many other messages and completely missed the deadline for the initial notebook submission. I didn't record my progress as I went; I didn't think it was required with the initial instructions:

When you feel you are happy with your notebook, you should include the public link in your final Outreachy project application as a recorded contribution. You may record contributions as you go as well to track progress.

Am I still able to submit a notebook before April 2? I've been working on my notebook since my first post on March 19 and it would be very unfortunate for my work to go to waste :(

Am I still able to submit a notebook before April 2? I've been working on my notebook since my first post on March 19 and it would be very unfortunate for my work to go to waste :(

@Kn138 Yes! Although we have no longer capacity to provide feedback on notebooks, candidates (having received feedback or not) may submit their final notebook by April 2.

Where should the notebook be submitted?

Hey, it has to be submitted to the project mentors via mail.

Where should the notebook be submitted?

Hey, it has to be submitted to the project mentors via mail.

Isn't it passed the deadline for feedback? The person can register it as a contribution in the outreachy website.

Where should the notebook be submitted?

Hey, it has to be submitted to the project mentors via mail.

Isn't it passed the deadline for feedback? The person can register it as a contribution in the outreachy website.

Yes, the feedback period is over now but you still create a contribution for it

Am I still able to submit a notebook before April 2? I've been working on my notebook since my first post on March 19 and it would be very unfortunate for my work to go to waste :(

@Kn138 Yes! Although we have no longer capacity to provide feedback on notebooks, candidates (having received feedback or not) may submit their final notebook by April 2.

Thank you so much!

Can anyone tell me what date we need to put in recorded contribution section "Date contribution was accepted or merged"?

Message from the project mentors about the final application

In response to questions we've received about the final application related to the timeline question, the "date contribution was accepted or merged", and the "community-related questions":

  • For information related to the timeline question, please see my previous message.
  • Date contribution was accepted or merged: you may use the date you received initial feedback from us on your notebook, or the date you submitted your final notebook to us.
  • Community-specific questions: there are no community-specific questions for our project's application.

@Pablo @CMyrick-WMF @Isaac

Can we add docs link in the project timeline section in our final application or we have to describe everything there only in the provided space?

Message from the project mentors about the final application

In response to questions we've received about the final application related to the timeline question, the "date contribution was accepted or merged", and the "community-related questions":

  • For information related to the timeline question, please see my previous message.
  • Date contribution was accepted or merged: you may use the date you received initial feedback from us on your notebook, or the date you submitted your final notebook to us.
  • Community-specific questions: there are no community-specific questions for our project's application.

Thank you so much

Hey! Can someone help me with this, actually while fetching page count data, I'm getting error with yearly and weekly tags? So can only the monthly page view count be retrieved?

Hey! Can someone help me with this, actually while fetching page count data, I'm getting error with yearly and weekly tags? So can only the monthly page view count be retrieved?

Hey @Gungun_Singh , As of today, the only supported granularity for the API endpoint is daily and monthly.

image.png (1,335×114 px, 17 KB)

Refer to this link for more information : https://wikimedia.org/api/rest_v1/#/Pageviews%20data/get_metrics_pageviews_per_article__project___access___agent___article___granularity___start___end_

Hey! Can someone help me with this, actually while fetching page count data, I'm getting error with yearly and weekly tags? So can only the monthly page view count be retrieved?

Hey @Gungun_Singh , As of today, the only supported granularity for the API endpoint is daily and monthly.

image.png (1,335×114 px, 17 KB)

Refer to this link for more information : https://wikimedia.org/api/rest_v1/#/Pageviews%20data/get_metrics_pageviews_per_article__project___access___agent___article___granularity___start___end_

okay sure, Thank you!

@Pablo @CMyrick-WMF @Isaac

Can we add docs link in the project timeline section in our final application or we have to describe everything there only in the provided space?

I described everything in the provided area. They don't have words limit.

Hello,
My Boxplots for pred_qual are like this. It doesn't seem right to me. I merged my tables correctly, created a months columns from the revision_timestamp. I am plotting the box_plot with my x as pre_qual column and y as the month column. Can anyone shed light on how to make my plot show better?. By the way, the subplots are for different articles for the year 2019 monthly

Screenshot 2024-04-01 at 14.51.31.png (3,316×744 px, 173 KB)

@Pablo @CMyrick-WMF @Isaac

Can we add docs link in the project timeline section in our final application or we have to describe everything there only in the provided space?

Please use only the provided space. Keeping the timeline short is completely acceptable.

Hello,
My Boxplots for pred_qual are like this. It doesn't seem right to me. I merged my tables correctly, created a months columns from the revision_timestamp. I am plotting the box_plot with my x as pre_qual column and y as the month column. Can anyone shed light on how to make my plot show better?. By the way, the subplots are for different articles for the year 2019 monthly

Screenshot 2024-04-01 at 14.51.31.png (3,316×744 px, 173 KB)

Boxplots of monthly granularity for a specific year would look something like this, if you are plotting for a specific year, use some other charts.

Hello,
My Boxplots for pred_qual are like this. It doesn't seem right to me. I merged my tables correctly, created a months columns from the revision_timestamp. I am plotting the box_plot with my x as pre_qual column and y as the month column. Can anyone shed light on how to make my plot show better?. By the way, the subplots are for different articles for the year 2019 monthly

Screenshot 2024-04-01 at 14.51.31.png (3,316×744 px, 173 KB)

Hello Mary, I think it's completely normal as you are plotting box plot for monthly granularity, the dot points or small box plots for some months shows that the data points are accumulated for that value of pred_qual for some specific months.And as you can see for other months as they might be having a range of pred_qual scores thus it gives rises to a box plot structure.

when I recorded my contribution it's giving me "Contribution #1: started March 3, 2024. Not accepted or merged. Update accepted date by editing this contribution." please date should I put instead or that doesn't matter @Pablo @CMyrick-WMF @Isaac

@Ansha_Pashine @Shruti799
Thank you so much. I first did for a range of years and it came out well so, I decided to try monthly for one of the years. It gave me much concern when my plots started to show that way for monthly granularity . I would now focus on others charts. Thank you.

when I recorded my contribution it's giving me "Contribution #1: started March 3, 2024. Not accepted or merged. Update accepted date by editing this contribution." please date should I put instead or that doesn't matter @Pablo @CMyrick-WMF @Isaac

As communicated earlier by Mentors:

For information related to the timeline question, please see my previous message.
Date contribution was accepted or merged: you may use the date you received initial feedback from us on your notebook, or the date you submitted your final notebook to us.
Community-specific questions: there are no community-specific questions for our project's application.

This comment was removed by Shruti799.

Hey! @CMyrick-WMF Can you help me if we need to include pages other than climate change or do we need to work on a dataset from that one page only?

Working with the whole dataset is ideal as it'll best capture the patterns in the data, but working with subsets is acceptable as long these limitations are noted. If you work with a single page/article (such the "climate change" page or another page within the climate change dataset), you should compare with other pages to see if the patterns hold or vary by page.

when I make an graph between days of specific month. all boxplots are same? I checked manually that all days have same median . so it is right or not? or is Am I doing something wrong here.

Screenshot (210).png (801×472 px, 35 KB)

when I make an graph between days of specific month. all boxplots are same? I checked manually that all days have same median . so it is right or not? or is Am I doing something wrong here.

Screenshot (210).png (801×472 px, 35 KB)

I think there is some error, data can't be exactly same for all the days of a month.

when I make an graph between days of specific month. all boxplots are same? I checked manually that all days have same median . so it is right or not? or is Am I doing something wrong here.

Screenshot (210).png (801×472 px, 35 KB)

I think there is something wrong too. Have you been able to figure that out?

Hello @Pablo, @CMyrick-WMF, @Isaac,

Given that we can still edit the submitted notebook, would it be advisable to make further edits directly to it? Alternatively, if we intend to incorporate additional content beyond the original submission, should we create a new notebook for these purposes? Your guidance on the preferred approach would be appreciated.

Thank you.

Hello @Pablo, @CMyrick-WMF, @Isaac,

Given that we can still edit the submitted notebook, would it be advisable to make further edits directly to it? Alternatively, if we intend to incorporate additional content beyond the original submission, should we create a new notebook for these purposes? Your guidance on the preferred approach would be appreciated.

Thank you.

Yes, all applicants who submit a final application will be able to continue recording and editing their microtask notebook through May 1, 2024 at 4pm UTC. However, please know that these edits might occur after we have finished assessing the notebooks. (After April 5th, mentors will have likely completed most of their reviews).

Hello @Pablo, @CMyrick-WMF, @Isaac,

Given that we can still edit the submitted notebook, would it be advisable to make further edits directly to it? Alternatively, if we intend to incorporate additional content beyond the original submission, should we create a new notebook for these purposes? Your guidance on the preferred approach would be appreciated.

Thank you.

Thank you for this question.

Hello @Pablo, @CMyrick-WMF, @Isaac,

Given that we can still edit the submitted notebook, would it be advisable to make further edits directly to it? Alternatively, if we intend to incorporate additional content beyond the original submission, should we create a new notebook for these purposes? Your guidance on the preferred approach would be appreciated.

Thank you.

Yes, all applicants who submit a final application will be able to continue recording and editing their microtask notebook through May 1, 2024 at 4pm UTC. However, please know that these edits might occur after we have finished assessing the notebooks. (After April 5th, mentors will have likely completed most of their reviews).

@CMyrick-WMF All right, thank you.