Page MenuHomePhabricator

Isaac (Isaac Johnson)
User

Projects

Today

  • Clear sailing ahead.

Tomorrow

  • Clear sailing ahead.

Friday

  • Clear sailing ahead.

User Details

User Since
Oct 1 2018, 2:19 PM (33 w, 2 d)
Availability
Available
IRC Nick
isaacj
LDAP User
Isaac Johnson
MediaWiki User
Isaac (WMF) [ Global Accounts ]

Recent Activity

Yesterday

Isaac updated the task description for T224053: Submit proposals for Wikimania 2019.
Tue, May 21, 8:28 PM · Research
Isaac added a comment to T222744: Prepare translations of survey.

Request posted for translations: https://meta.wikimedia.org/wiki/Research_talk:Characterizing_Wikipedia_Reader_Behaviour/Demographics_and_Wikipedia_use_cases#Translations

Tue, May 21, 7:23 PM · Research
Isaac updated the task description for T223765: Wiki Content Translation Tool Research Project.
Tue, May 21, 7:21 PM · Outreachy (Round 18)
Isaac added a comment to T223765: Wiki Content Translation Tool Research Project.

hey @Aklapper : yes, the project is accepted. we didn't require the phabricator submission when Outreachy applications were due because some of the students were unable to create phabricator accounts at the time. so instead we just had Doris create this when she was accepted for tracking going forward. thanks for checking!

Tue, May 21, 7:21 PM · Outreachy (Round 18)

Mon, May 20

Isaac updated the task description for T222744: Prepare translations of survey.
Mon, May 20, 7:48 PM · Research
Isaac updated the task description for T222175: Isaac Reviews / Outreach work for May '19.
Mon, May 20, 4:46 PM · Research

Fri, May 17

Isaac updated the task description for T222744: Prepare translations of survey.
Fri, May 17, 9:31 PM · Research
Isaac updated the task description for T222175: Isaac Reviews / Outreach work for May '19.
Fri, May 17, 6:49 PM · Research

Thu, May 16

Isaac updated the task description for T222175: Isaac Reviews / Outreach work for May '19.
Thu, May 16, 4:16 PM · Research

Tue, May 14

Isaac updated the task description for T222175: Isaac Reviews / Outreach work for May '19.
Tue, May 14, 9:52 PM · Research

Mon, May 13

Isaac updated the task description for T222175: Isaac Reviews / Outreach work for May '19.
Mon, May 13, 11:12 PM · Research

Wed, May 8

Isaac added a comment to T217699: Better understand impact of content translation tools.

Could I ask for any kind of feedback on my analysis? It would be very useful to know what I need to pay attention to next time.

Hey @Cherrywins -- yes, I can do that. I'll email you by the end of the week using the email you provided on your application.

Wed, May 8, 10:22 PM · Outreachy (Round 18), ContentTranslation, Research

Tue, May 7

Isaac created T222744: Prepare translations of survey.
Tue, May 7, 5:09 PM · Research

Mon, May 6

Isaac updated the task description for T222175: Isaac Reviews / Outreach work for May '19.
Mon, May 6, 3:14 PM · Research

Fri, May 3

Isaac added a comment to T219660: Figure out the topic of articles translated automatically by external translation service.

There seems to be less STEM related language switch on wiki. My guess is that those articles are not available in the local languages.

Yeah, I'd agree and also expect that this is somewhat Google's bias in what signals they use to choose articles to translate.

Fri, May 3, 4:53 PM · ExternalGuidance, Product-Analytics

Thu, May 2

Isaac added a comment to T219660: Figure out the topic of articles translated automatically by external translation service.

This is awesome @chelsyx !

Thu, May 2, 2:49 PM · ExternalGuidance, Product-Analytics

Tue, Apr 30

Isaac moved T220454: Research website typos from Staged to Services on the Research board.
Tue, Apr 30, 1:12 PM · Research
Isaac moved T222175: Isaac Reviews / Outreach work for May '19 from Staged to Services on the Research board.
Tue, Apr 30, 1:12 PM · Research
Isaac created T222175: Isaac Reviews / Outreach work for May '19.
Tue, Apr 30, 1:12 PM · Research

Mon, Apr 29

Isaac updated subscribers of T222078: Analyze readers' engagement in countries affected by Singapore Data Center's switch.

This looks awesome @Miriam -- just adding more particulars to what I mentioned today:

Mon, Apr 29, 9:32 PM · Research-consulting, Research
Isaac updated the task description for T219903: Keep research.wikipedia.org landing page updated.
Mon, Apr 29, 3:29 PM · Research

Apr 19 2019

Isaac added a comment to T201707: Output 3.3: Baseline statistics on contributor diversity.

Current status:

  • Privacy policy: currently working w/ Privacy now that we have a more concrete plan for the surveys
  • QuickSurveys: added functionality complete but waiting on status of two possibly related bugs (T218243 and T220627)
  • Potential for some preliminary insights: the reader demographics surveys (T203042) should provide some early insight into editor gender
Apr 19 2019, 3:57 PM · Epic, address-knowledge-gaps

Apr 16 2019

Isaac closed T215670: Pilot survey in one language as Resolved.
Apr 16 2019, 3:10 PM · Research
Isaac closed T215670: Pilot survey in one language, a subtask of T203042: Output 2.2: Characterizing readership by demographics, as Resolved.
Apr 16 2019, 3:10 PM · Epic, address-knowledge-gaps
Isaac added a comment to T215670: Pilot survey in one language.

While we are still waiting on findings related to QuickSurveys (T218243 and T220627), we have decided to move forward w/ the survey in other languages. A few notes:

  • Found supporting evidence that men do indeed read Wikipedia more frequently than women in the United States: https://papers.ssrn.com/sol3/papers.cfm?abstract_id=2617021
    • Based on survey of 1000 AMT workers from US: "Second, men use Wikipedia more often — they are twice as likely than women to use Wikipedia daily"
  • While younger respondents were consistently more likely to read Wikipedia frequently, mixed evidence from Global Insights phone surveys on gender:
    • India: women more likely to be frequent readers of Wikipedia
    • Mexico: men more likely to be frequent readers of Wikipedia
    • Nigeria: men slightly more likely to be frequent readers of Wikipedia
    • Iraq: ~equal likelihood by gender of being frequent readers of Wikipedia
  • Though we're missing EventLogging for ~10% of our responses, it actually is more likely to be missing from our younger, male readers (T220627#5113109), so if there are issues with QuickSurveys loading, this would suggest that if anything it would lead to greater skew in the demographics results
Apr 16 2019, 3:09 PM · Research

Apr 15 2019

Isaac added a comment to T220627: QuickSurveys EventLogging missing ~10% of interactions.

My other current theory is the missing 10% is possibly browsers that don't support sendBeacon

Seems unlikely rather makes sense that if you have a loading issue (per @phuedx ) comment above and that is causing events not being sent (cause EL module is not loaded) that issue will be more prevalent in older browsers that parse and load javascript much more slowly than new ones.

Apr 15 2019, 8:39 PM · Readers-Web-Backlog (Tracking), Analytics, Analytics-EventLogging, QuickSurveys
Isaac updated the task description for T219903: Keep research.wikipedia.org landing page updated.
Apr 15 2019, 4:26 PM · Research
Isaac closed T212441: Finalize Survey Questions, a subtask of T203042: Output 2.2: Characterizing readership by demographics, as Resolved.
Apr 15 2019, 1:51 PM · Epic, address-knowledge-gaps
Isaac closed T212441: Finalize Survey Questions as Resolved.

From the pilot survey, the only question-related feedback we received was that the language dropdown was not displaying correctly for some, so I'll be adding the English names of each language as a fallback -- e.g., "Чӑвашла (Chuvash)"

Apr 15 2019, 1:51 PM · Research

Apr 12 2019

Isaac added a comment to T220627: QuickSurveys EventLogging missing ~10% of interactions.

i.e. it's unlikely but not impossible that QuickSurveys could be loaded and executed before EventLogging.

Apr 12 2019, 9:23 PM · Readers-Web-Backlog (Tracking), Analytics, Analytics-EventLogging, QuickSurveys
Isaac added a comment to T220627: QuickSurveys EventLogging missing ~10% of interactions.

Is there any documentation I can read on the flow of the surveys? Does the user click on a link on-wiki, that opens a Google/Qualtrics form?

Apr 12 2019, 4:29 PM · Readers-Web-Backlog (Tracking), Analytics, Analytics-EventLogging, QuickSurveys

Apr 10 2019

Isaac added a comment to T220627: QuickSurveys EventLogging missing ~10% of interactions.

Is it possible that the link to the survey is being shared outside a QuickSurvey (e.g. social media)?

Apr 10 2019, 6:33 PM · Readers-Web-Backlog (Tracking), Analytics, Analytics-EventLogging, QuickSurveys
Isaac created T220627: QuickSurveys EventLogging missing ~10% of interactions.
Apr 10 2019, 4:20 PM · Readers-Web-Backlog (Tracking), Analytics, Analytics-EventLogging, QuickSurveys

Apr 9 2019

Isaac added a comment to T220454: Research website typos.

Excellent - thanks @srodlund ! I will try to make sure we don't have to create any more of these tasks in the future too :)

Apr 9 2019, 1:48 PM · Research
Isaac updated the task description for T219903: Keep research.wikipedia.org landing page updated.
Apr 9 2019, 1:47 PM · Research

Apr 8 2019

Isaac closed T218304: Allow quicksurveys to target based on registration date as Resolved.

Looks good to me! Thanks team!

Apr 8 2019, 5:56 PM · QuickSurveys, Audiences-QA (RW-Test-Cases), MW-1.33-notes (1.33.0-wmf.24; 2019-04-02), Readers-Web-Backlog (Readers-Web-Kanbanana-Board-2018-19-Q4), Patch-For-Review, Surveys
Isaac closed T218304: Allow quicksurveys to target based on registration date, a subtask of T216495: [EPIC] Diversity quicksurveys, as Resolved.
Apr 8 2019, 5:56 PM · QuickSurveys, Readers-Web-Backlog (Readers-Web-Kanbanana-Board-2018-19-Q4), Surveys
Isaac updated the task description for T218304: Allow quicksurveys to target based on registration date.
Apr 8 2019, 5:55 PM · QuickSurveys, Audiences-QA (RW-Test-Cases), MW-1.33-notes (1.33.0-wmf.24; 2019-04-02), Readers-Web-Backlog (Readers-Web-Kanbanana-Board-2018-19-Q4), Patch-For-Review, Surveys
Isaac updated the task description for T219903: Keep research.wikipedia.org landing page updated.
Apr 8 2019, 4:06 PM · Research

Apr 4 2019

Isaac updated subscribers of T218243: QuickSurveys will not show on mobile if the beta optin experiment is not set.
Apr 4 2019, 10:23 PM · Mobile, QuickSurveys, Readers-Web-Backlog
Isaac added a comment to T218243: QuickSurveys will not show on mobile if the beta optin experiment is not set.

Not sure if this is the same issue as in this thread or should be separated into a new task, but...

Apr 4 2019, 10:23 PM · Mobile, QuickSurveys, Readers-Web-Backlog
Isaac updated the task description for T218917: Understand Research presence on the web.
Apr 4 2019, 2:57 PM · Research
Isaac updated the task description for T218917: Understand Research presence on the web.
Apr 4 2019, 2:54 PM · Research

Apr 2 2019

Isaac moved T219903: Keep research.wikipedia.org landing page updated from Staged to Services on the Research board.
Apr 2 2019, 5:23 PM · Research
Isaac updated the task description for T219903: Keep research.wikipedia.org landing page updated.
Apr 2 2019, 5:23 PM · Research
Isaac created T219903: Keep research.wikipedia.org landing page updated.
Apr 2 2019, 5:21 PM · Research

Apr 1 2019

Isaac added a comment to T218003: Qualitative Exploration of Content Translation Tools.

Am I correct in understanding that the talk page basically records the changes made to the original article in English?

Apr 1 2019, 1:08 AM · Outreachy (Round 18), ContentTranslation, Research

Mar 30 2019

puja_jaji awarded T218003: Qualitative Exploration of Content Translation Tools a Like token.
Mar 30 2019, 6:48 PM · Outreachy (Round 18), ContentTranslation, Research
puja_jaji awarded T217699: Better understand impact of content translation tools a 100 token.
Mar 30 2019, 6:47 PM · Outreachy (Round 18), ContentTranslation, Research
puja_jaji awarded T217699: Better understand impact of content translation tools a 100 token.
Mar 30 2019, 5:36 PM · Outreachy (Round 18), ContentTranslation, Research

Mar 29 2019

Isaac added a comment to T218304: Allow quicksurveys to target based on registration date.

are registeredBefore and registeredAfter stored in ISO 8601 format ok for you? Todays date ( Fri Mar 29 2019 ) would be stored as 2019-03-29

Yes, that'd be perfect. Thanks!

Mar 29 2019, 6:59 PM · QuickSurveys, Audiences-QA (RW-Test-Cases), MW-1.33-notes (1.33.0-wmf.24; 2019-04-02), Readers-Web-Backlog (Readers-Web-Kanbanana-Board-2018-19-Q4), Patch-For-Review, Surveys

Mar 28 2019

Isaac added a comment to T218004: Quantitative Exploration of Content Translation Tools.

@NuKira that is entirely up to you whether you feel you can complete the application. Glad to hear you are still interested.

Mar 28 2019, 6:59 PM · Outreachy (Round 18), ContentTranslation, Research

Mar 27 2019

Isaac committed rRLPb9e7922cae37: Updates to team and additional papers, including arxiv links. (authored by Isaac).
Updates to team and additional papers, including arxiv links.
Mar 27 2019, 7:13 PM
Isaac added a comment to T218004: Quantitative Exploration of Content Translation Tools.

Yes, regarding public links for PAWS notebooks: in general if you want to check what public notebooks exist for you, you can go to this URL (with your username substituted in) to see the list:
https://paws-public.wmflabs.org/paws-public/User:<username>/

Mar 27 2019, 3:07 PM · Outreachy (Round 18), ContentTranslation, Research

Mar 26 2019

Isaac added a comment to T218004: Quantitative Exploration of Content Translation Tools.

@Cherrywins : categories are not a straightforward concept on Wikipedia. I believe you can get the categories that are listed for a page (https://www.mediawiki.org/wiki/API:Categories), but this is far from a perfect solution. I would not worry about getting this perfect on a submission - if you find an approach that works, great, but I'd say more important is to discuss how you might approach this given more time.

Mar 26 2019, 3:59 PM · Outreachy (Round 18), ContentTranslation, Research
Isaac added a comment to T217699: Better understand impact of content translation tools.

@Supida_h the latter is sufficient but do your best to keep it to an amount of work that you could reasonably complete during the program.

Mar 26 2019, 3:54 PM · Outreachy (Round 18), ContentTranslation, Research

Mar 25 2019

Isaac added a comment to T217699: Better understand impact of content translation tools.

Always worth saying: thanks all for answering each others questions and being supportive.

Mar 25 2019, 7:56 PM · Outreachy (Round 18), ContentTranslation, Research
Isaac added a comment to T218004: Quantitative Exploration of Content Translation Tools.

@Trishla08 if you have specific questions, I or others can try to provide some assistance. General feedback is not feasible at this stage though. I am not great at troubleshooting IRC but Phabricator has been the more effective channel for discussion on this project.

Mar 25 2019, 5:38 AM · Outreachy (Round 18), ContentTranslation, Research

Mar 22 2019

Isaac updated subscribers of T217699: Better understand impact of content translation tools.

An issue raised by @Muraran : even with the removal of duplicate commas in the .text.json.gz file, there can be a trailing comma at the very end that interferes with proper loading. Here's how you can figure out what's going on when you get these errors and how to fix it:

Mar 22 2019, 2:05 PM · Outreachy (Round 18), ContentTranslation, Research
Isaac merged T218972: JSONDecodeError: Expecting value: line 1 column 356418517 (char 356418516) into T217899: Duplicate commas in JSON Content Translation Dumps.
Mar 22 2019, 1:52 PM · Language-Team (Language-2019-April-June), MW-1.33-notes (1.33.0-wmf.22; 2019-03-19), Unplanned-Sprint-Work, ContentTranslation, Dumps-Generation
Isaac merged task T218972: JSONDecodeError: Expecting value: line 1 column 356418517 (char 356418516) into T217899: Duplicate commas in JSON Content Translation Dumps.
Mar 22 2019, 1:52 PM · Outreachy
Isaac added a comment to T218972: JSONDecodeError: Expecting value: line 1 column 356418517 (char 356418516).

Hey @Muraran: glad you're being proactive but in the future, reach out to me first or raise questions/bugs on the thread (T217699) or its related subtasks. I'm going to close this task and move the discussion over there.

Mar 22 2019, 1:51 PM · Outreachy

Mar 21 2019

Isaac committed rRLPa35ae2803678: Updates to team and additional papers. (authored by Isaac).
Updates to team and additional papers.
Mar 21 2019, 8:59 PM
Isaac updated the task description for T218917: Understand Research presence on the web.
Mar 21 2019, 6:07 PM · Research
Isaac closed T213847: Should be possible to sample by country as Resolved.
Mar 21 2019, 4:21 PM · MW-1.33-notes (1.33.0-wmf.22; 2019-03-19), Audiences-QA (RW-Test-Cases), Readers-Web-Backlog (Readers-Web-Kanbanana-Board-2018-19-Q3), QuickSurveys
Isaac closed T213847: Should be possible to sample by country, a subtask of T203042: Output 2.2: Characterizing readership by demographics, as Resolved.
Mar 21 2019, 4:21 PM · Epic, address-knowledge-gaps
Isaac added a comment to T213847: Should be possible to sample by country.

This looks good to me. A few notes:

Mar 21 2019, 4:19 PM · MW-1.33-notes (1.33.0-wmf.22; 2019-03-19), Audiences-QA (RW-Test-Cases), Readers-Web-Backlog (Readers-Web-Kanbanana-Board-2018-19-Q3), QuickSurveys
Isaac closed T139317: Allow quicksurvey to target based on edit count as Resolved.
Mar 21 2019, 1:42 PM · Audiences-QA (RW-Test-Cases), MW-1.33-notes (1.33.0-wmf.21; 2019-03-12), Patch-For-Review, Readers-Web-Backlog (Readers-Web-Kanbanana-Board-2018-19-Q3), QuickSurveys
Isaac closed T139317: Allow quicksurvey to target based on edit count, a subtask of T166395: [EPIC] Make QuickSurveys more useful to survey writers, as Resolved.
Mar 21 2019, 1:42 PM · Readers-Web-Backlog (Tracking)
Isaac added a comment to T139317: Allow quicksurvey to target based on edit count.

Looks good to me. Documentation of each potential criteria:

  • Target anonymous users (wgEditCount === null) and logged-in users without edits (wgEditCount === 0)
  • minEdits undefined and maxEdits set to 0
  • Target a non-editor (wgEditCount === 0)
  • minEdits and maxEdits set to 0. Alternatively, setting maxEdits set to 0 and anons to false (T186737) would also lead to just targeting logged-in users without edits.
  • Target an editor
  • minEdits set to 1, which would sample all users with at least one edit. This is based on the definition used in this task that a non-editor has zero edits and therefore an editor has at least one edit.
  • Target a user with an edit count that falls into a given range -- e.g., 5-20 edits
  • For this example, minEdits set to 5 and maxEdits set to 20. Notably, the configuration allows for flexible ranges to be set (as opposed to limiting the edit ranges to pre-defined buckets as recorded in the EventLogging schema)
Mar 21 2019, 1:33 PM · Audiences-QA (RW-Test-Cases), MW-1.33-notes (1.33.0-wmf.21; 2019-03-12), Patch-For-Review, Readers-Web-Backlog (Readers-Web-Kanbanana-Board-2018-19-Q3), QuickSurveys

Mar 20 2019

Isaac closed T186737: Let me choose whether to present a survey to logged-in or logged-out editors as Resolved.
Mar 20 2019, 8:35 PM · Audiences-QA (RW-Test-Cases), MW-1.33-notes (1.33.0-wmf.22; 2019-03-19), Patch-For-Review, Readers-Web-Backlog (Readers-Web-Kanbanana-Board-2018-19-Q3), Research, QuickSurveys, Surveys
Isaac closed T186737: Let me choose whether to present a survey to logged-in or logged-out editors, a subtask of T89970: Enable microsurveys for long-term tracking of editing experience , as Resolved.
Mar 20 2019, 8:35 PM · QuickSurveys (Surveys), Surveys, MediaWiki-Page-editing
Isaac added a comment to T186737: Let me choose whether to present a survey to logged-in or logged-out editors.

Thanks @Jdlrobson for making that update. All looks good to me (i'll update the task description and resolve)! Documentation for each acceptance criteria:

Mar 20 2019, 8:34 PM · Audiences-QA (RW-Test-Cases), MW-1.33-notes (1.33.0-wmf.22; 2019-03-19), Patch-For-Review, Readers-Web-Backlog (Readers-Web-Kanbanana-Board-2018-19-Q3), Research, QuickSurveys, Surveys
Isaac added a comment to T186737: Let me choose whether to present a survey to logged-in or logged-out editors.

@Jdlrobson : before I sign off on this, I think the Developer Notes in the task description are the opposite of what the functionality actually does. Before I update them, I wanted to make sure I wasn't misinterpreting:

Mar 20 2019, 7:37 PM · Audiences-QA (RW-Test-Cases), MW-1.33-notes (1.33.0-wmf.22; 2019-03-19), Patch-For-Review, Readers-Web-Backlog (Readers-Web-Kanbanana-Board-2018-19-Q3), Research, QuickSurveys, Surveys

Mar 19 2019

Isaac updated the task description for T217699: Better understand impact of content translation tools.
Mar 19 2019, 9:52 PM · Outreachy (Round 18), ContentTranslation, Research
Isaac added a comment to T217699: Better understand impact of content translation tools.

Hey all - considering that PAWS was unreachable for a while and this project was posted later in the cycle, I am going to extend the deadline for working on this until April 2nd. That gives you another two weeks to explore the data and begin to generate questions / analyses that you could build on in a summer project. I'll update Outreachy's website as well.

Mar 19 2019, 9:37 PM · Outreachy (Round 18), ContentTranslation, Research
Isaac added a comment to T218004: Quantitative Exploration of Content Translation Tools.

Is there a way to way to get the length of each article i.e. no, of bytes or do I have to perform scraping to get that information.

Mar 19 2019, 9:35 PM · Outreachy (Round 18), ContentTranslation, Research
Isaac updated the task description for T218304: Allow quicksurveys to target based on registration date.
Mar 19 2019, 2:44 PM · QuickSurveys, Audiences-QA (RW-Test-Cases), MW-1.33-notes (1.33.0-wmf.24; 2019-04-02), Readers-Web-Backlog (Readers-Web-Kanbanana-Board-2018-19-Q4), Patch-For-Review, Surveys

Mar 18 2019

Isaac updated subscribers of T218168: Content Translation Parallel Corpus API and Dumps have different data.

The dumps exclude sections which have no user translation, because that is not useful information in a comparable corpora. It seems the API does not do this filtering.

Mar 18 2019, 10:52 PM · Dumps-Generation, ContentTranslation
Isaac added a comment to T218004: Quantitative Exploration of Content Translation Tools.

@Israashahin : thanks for letting me know that you needed an answer to that as the original comment has been deleted. The example notebook that I provided to you ( https://paws-public.wmflabs.org/paws-public/User:Isaac_(WMF)/Content%20Translation%20Example.ipynb ) has a link to examples of how to do that under the Quantitative Analyses section. If you have more specific questions, let me know.

Mar 18 2019, 3:23 PM · Outreachy (Round 18), ContentTranslation, Research
Isaac added a comment to T218004: Quantitative Exploration of Content Translation Tools.

I am trying to access Page History for the translated page, but it doesn't work. It works as planned with the source page though.

Mar 18 2019, 2:02 PM · Outreachy (Round 18), ContentTranslation, Research

Mar 17 2019

Isaac added a comment to T217699: Better understand impact of content translation tools.

@Supida_h yes - while I'd prefer that you upload to PAWS and submit that link, if the service is not responding, a Github link that is open would be acceptable as well.

Mar 17 2019, 1:34 PM · Outreachy (Round 18), ContentTranslation, Research

Mar 16 2019

Isaac added a comment to T218004: Quantitative Exploration of Content Translation Tools.

Thanks @XinyueWang1 for offering assistance!

Mar 16 2019, 4:39 PM · Outreachy (Round 18), ContentTranslation, Research
Isaac added a comment to T217699: Better understand impact of content translation tools.

@Supida_h and @NuKira thanks for alerting me to the JupyterHub issue. I'll continue to monitor and hopefully it clears up soon, but I'll take that into account.

Mar 16 2019, 4:38 PM · Outreachy (Round 18), ContentTranslation, Research
Isaac added a comment to T218003: Qualitative Exploration of Content Translation Tools.

@Mansi29ag and @Supida_h : Glad you're looking into this -- I believe those statistics are for the initial translation (not what happens afterwards, which is one reason that this is an important research project) and indicate what proportion of content is translated over and whether it was created by humans or came from the machine translation. Because it's based on word count, if the translated article has more words than the source article, this would result in a number over 1. For example if the source article had 1000 words and the translated article had 1200 words, then this would result in 1.2 for any and if half of those 1200 words were suggested by the machine translation and half was added by the editor, then that would be 0.6 for mt and 0.6 for human.

Mar 16 2019, 4:24 PM · Outreachy (Round 18), ContentTranslation, Research

Mar 15 2019

Isaac added a comment to T218003: Qualitative Exploration of Content Translation Tools.

@NuKira at this point I understand that not everyone will have experience with qualitative methods and so do not worry if you're not certain of the right approach. What you should focus on is whether you can generate some questions or hypotheses around the content translation tool. This can be aimed at the types of content that is / is not translated or what happens after an article is translated. So go through some of the articles that have been translated and look for patterns. For example: do you see that overview content is translated but that more detailed specifics of an article are often left behind? if so, maybe give some examples of sections that correspond to each. Do you find that new content that is more culturally-specific is added to the translated article after it has been created?

Mar 15 2019, 5:26 PM · Outreachy (Round 18), ContentTranslation, Research
Capt_Swing awarded T218004: Quantitative Exploration of Content Translation Tools a Stroopwafel token.
Mar 15 2019, 3:34 PM · Outreachy (Round 18), ContentTranslation, Research
Isaac added a comment to T217699: Better understand impact of content translation tools.

Welcome everyone who has joined in the past few days! As you may see from the others, feel free to ask questions and let me know if you're running into challenges with getting started on this research. It's an open-ended task so don't be discouraged!

Mar 15 2019, 2:15 PM · Outreachy (Round 18), ContentTranslation, Research
Isaac added a comment to T218003: Qualitative Exploration of Content Translation Tools.

Hey @Mansi29ag : is this the notebook you're trying this out in: https://paws-public.wmflabs.org/paws-public/57510755/Untitled.ipynb

Mar 15 2019, 2:07 PM · Outreachy (Round 18), ContentTranslation, Research

Mar 14 2019

Isaac added a comment to T218243: QuickSurveys will not show on mobile if the beta optin experiment is not set.

@Jdlrobson Revising a bit from what I said on IRC: for our recent survey (March 4/5) on English Wikipedia, I looked at the distribution of webhost (en vs. en.m) and browser family (Chrome, Chrome Mobile etc.) for those who received the survey (QuickSurveyInitiation) vs all of the webrequests to en.wikipedia for that same time period. It seems less about mobile vs. desktop and more about specific browsers or OSes. See below (and I'm happy to talk more):

Mar 14 2019, 7:38 PM · Mobile, QuickSurveys, Readers-Web-Backlog
Isaac added a comment to T212444: Run demographics survey in one or more Wikipedia languages.

Update: see T215670#5024817 for current blocking issues with pilot that prevent full surveys from moving forward.

Mar 14 2019, 5:38 PM · Research
Isaac added a comment to T215670: Pilot survey in one language.

Current status and findings:

  • We are concerned about how much selection bias we may be seeing in the pilot results (i.e. is the survey reaching a representative sample of readers or not).
    • We are seeing a much higher proportion of younger users and users who identify as men than we expected. This could be because these users truly read Wikipedia more frequently (and so are more likely to be included in the survey) or it could be due to higher rates of self-selection into the survey. We will evaluate whether these trends are consistent by country and other demographics.
  • We are concerned about whether certain bugs are affecting the QuickSurvey sampling and logging and would like to address these (or at least better understand them) before moving on:
    • T218243 which would mean mobile is being undersampled. This matches what we see in our survey, which is that while e.g., ~20% of English Wikipedia readers use Chrome, 34% of the devices that saw the survey used Chrome
    • Approximately 12% of our survey responses cannot be matched to QuickSurveyInitiation EventLogging. The reason for this is unclear: almost all of the survey codes from these responses look normal and the timestamps are relatively evenly distributed across the survey deployment.
    • Approximately 18% of our survey responses cannot be matched to QuickSurveysResponses EventLogging: this higher percentage is possibly due to QuickSurveysResponses EL not being captured if the user explicitly right-clicks and opens the survey in a new tab (verified by me and see T131315#2311065 for related issue)
Mar 14 2019, 5:31 PM · Research
Isaac closed T212446: Work with Legal to develop a privacy statement for the new round of surveys as Resolved.

https://foundation.wikimedia.org/wiki/2019_Wikipedia_Demographics_Survey_Privacy_Statement

Mar 14 2019, 5:15 PM · Research
Isaac closed T212446: Work with Legal to develop a privacy statement for the new round of surveys, a subtask of T203042: Output 2.2: Characterizing readership by demographics, as Resolved.
Mar 14 2019, 5:15 PM · Epic, address-knowledge-gaps
Isaac added a comment to T218003: Qualitative Exploration of Content Translation Tools.

The first one why I don't have translation for the parallel translation is it because the Arabic translation is for the general description of the Articles (not the whole article) I choose or the download for the dump file has a problem??

Mar 14 2019, 5:11 PM · Outreachy (Round 18), ContentTranslation, Research

Mar 13 2019

Isaac added a comment to T210813: Analysis of short term impact of Mexico campaign to site and landing page traffic.

Hey @atgo: can we close this task out or are there still questions around the short-term analysis that you're waiting on? Thanks!

Mar 13 2019, 6:07 PM · Product-Analytics, New-Readers
Isaac triaged T218003: Qualitative Exploration of Content Translation Tools as Normal priority.
Mar 13 2019, 6:06 PM · Outreachy (Round 18), ContentTranslation, Research
Isaac triaged T218004: Quantitative Exploration of Content Translation Tools as Normal priority.
Mar 13 2019, 6:06 PM · Outreachy (Round 18), ContentTranslation, Research
Isaac added a comment to T212442: Finalize choice of Wikipedia languages for running the demographics survey.

Volunteers / discussion being tracked here: https://meta.wikimedia.org/wiki/Research_talk:Characterizing_Wikipedia_Reader_Behaviour/Demographics_and_Wikipedia_use_cases

Mar 13 2019, 6:05 PM · Research
Isaac triaged T217699: Better understand impact of content translation tools as Normal priority.
Mar 13 2019, 6:03 PM · Outreachy (Round 18), ContentTranslation, Research