Page MenuHomePhabricator
Feed Advanced Search

Apr 2 2019

AggNisha added a comment to T217699: Better understand impact of content translation tools.
Apr 2 2019, 4:12 PM · Outreachy (Round 18), ContentTranslation, Research
AggNisha added a comment to T217699: Better understand impact of content translation tools.

Can anyone help me with the application submission? Do we have to submit the final application only on Outreachy website or do we have to make a different proposal for it too?

Apr 2 2019, 11:41 AM · Outreachy (Round 18), ContentTranslation, Research

Mar 27 2019

AggNisha added a comment to T218004: Quantitative Exploration of Content Translation Tools.

@Isaac I encountered a few problems, could you please help me with them?

  • Can sections in dump file and those returned by contenttranslationcorpora differ? I tried and found that the dump file contains fewer sections than contenttranslationcorpora for some translation ids but could not understand why?
  • How can we get the type of the edit done? There is parameter 'rvprop' which can be set to tags and comments in prop=revisions, but what for the case when no tags and comments have been mentioned?
  • Is 'any' attribute calculated over the number of sections translated, or over the whole article?
  • I found a few sections in contenttranslationcorpora, for which 'content' exists for 'source', but not for 'mt' or 'user'. If the section has not been included at all in the translated version, why is it counted amongst the content translated version?

Thanks

Mar 27 2019, 4:41 PM · Outreachy (Round 18), ContentTranslation, Research

Mar 25 2019

AggNisha added a comment to T217699: Better understand impact of content translation tools.

Hey, @Batoulkh12 have you tried just removing the 'to' parameter. It works for me. But it will give maximum 500 articles as cxpublishedtranslations does. Sometimes it will return an empty list, because parameter 'offset' will be too high, just try decreasing its value. For all articles translated from Hindi, it works when offset value is set to 900.

Mar 25 2019, 2:28 AM · Outreachy (Round 18), ContentTranslation, Research

Mar 24 2019

AggNisha added a comment to T218004: Quantitative Exploration of Content Translation Tools.

@Isaac I was trying to get a particular page's information on Hindi Wikipedia when all I have is its translation id (when converted from English to Hindi), which I got from the dump file. I thought I could add some parameters passed to the mwapi API (action=query), but I could find only pageids and titles as a way to do it. I tried connecting it to the cxpublishedtranslations API, but the maximum number of results returned is 500, and the given translationid might not exist in it. Am I missing something, or is there a way around to getting this.

Mar 24 2019, 5:41 PM · Outreachy (Round 18), ContentTranslation, Research