- Based on 1A, the volume of visits coming from integrated translated results in Google search is similar to the visits explicitly requesting a translation (e.g., by going to Google Translate or clicking the "translate this age" option in the search results).
Correct. And note that this graph only shows the traffic when the translation target language is Indonesian. You may also notice that from January to February, visits from integrated translated results is much higher, then it drop since March. I asked @dr0ptp4kt and he thinks Google probably change their algorithm.
- Queries
- All Stories
- Search
- Advanced Search
- Transactions
- Transaction Logs
Advanced Search
Mar 27 2019
Mar 25 2019
Mar 21 2019
It seems like in the screenshot from @Peter it's now fairly clear about the proxying. Obviously the visual nudge is to actually turn on the feature, but it seems like the terminology is pretty clear. In this regard, this makes it more on par with something like Opera.
Mar 20 2019
Thank you, @Krinkle.
I don't have my Android device handy, but this is not applying Google Web Light, correct? Would it be possible for someone to post screenshots of the resultant treatment for our pages?
Mar 11 2019
Thanks @ovasileva.
Yes, @Legoktm, thanks!
Mar 10 2019
Thanks @Legoktm. Yes, please, Joe Walsh should be added, too.
Mar 8 2019
Thanks. Looking forward to confirmation.
^ Well, I intended for that to be on email. But it stands: I think Olga put this in terms that I could understand - and as I've said in other places, I think the implementation is non-trivial even if the consequences can be studied sufficiently to be well understood. That said, what is this "exploring sharing entire articles or portions of articles" part about?
Thanks. That's cool, although what should we do to reinstate my privileges short run? The specific request right now is to get Natalia H granted group membership, although I want to ensure that both Joe W and I will have owner privs more generally in the apps repos.
Great framing, nice job! One question, though, what's this part about and how does that tie into the conversation?
Mar 7 2019
@Krenair created task here. Previously I had sufficient permissions as I recall, but it seems like there's been an update (TBH I may have missed a note somewhere).
Thanks @Krenair
Hi there - re-opening, although please let me know if I should open a fresh task.
Mar 5 2019
Mar 4 2019
@chelsyx As it is the "Access the translated funnel" line makes the other parts of the funnel look compressed in the "Number of events when target language is Indonesian, by action type" graph. It's a true representation of the magnitude, of course, but I was wondering if you had an approach that might aid visual interpretation of the data (e.g., two y-axes, non-constant scale, percentile fluctuation, etc.).
Mar 1 2019
Thanks @Pginer-WMF. I've put a HOLD on the calendar for March 6 to get the Varnish patch up a little bit ahead of this, although will adjust as needed for any change in the activation for the extension (or the schedule of @BBlack and myself the day prior to the activation of the extension).
Feb 28 2019
Thanks, @santhosh. When you say "context detection code", I take that to mean inclusive of this init code we're referring to for the "Desktop" footer link removal.
Feb 26 2019
Thanks, @BBlack, will give the heads up once the date is set.
Feb 25 2019
Would it be possible to clarify the wording on "There is no existing FLOSS software that provides the same functionality"? I believe the intent here is about surveying the FLOSS ecosystem for well crafted, well maintained, architecturally compatible FLOSS software that provides comparable functionality before specifying and building new non-trivial standalone services.
Feb 22 2019
I don't see this as an urgent priority, although planning it as a small piece of work for a future quarter would be fine. We could then share this with the mailing lists and contacts we have at places where people are employing these sorts of algorithms in their own code.
This sort of algorithm is in use in several prominent high scale media properties, but people are recreating the work in their specific cases, as opposed to having one easy-to-call API that reflects this line of thinking. The idea was to expose something that, given a title, produces the correct revision. I strongly agree that it should also take into consideration whether that last correct revision is reportedly non-damaging (and scrub backwards further if so), as sometimes humans can't keep up with the backlog.
Feb 21 2019
@BBlack in https://gerrit.wikimedia.org/r/490120 I checked in with @Pginer-WMF today. Pau said deploying this the day or two prior to ExternalGuidance being activated for the source wiki of enwiki (for Indonesian) would be ideal.
Feb 20 2019
Thanks @TheDJ.
Thanks. I'm not sure if something changed in a Scribunto module or somewhere in extension land, but it doesn't seem like it's really turning up on enwiki source, at least - there are some Village Pump discussions on this.
Feb 14 2019
Hi all - I was aware of this task but hadn't been following it. But it was brought to my attention as having some momentum, so here I am! I have some information I can dredge up that I think may help shed some light on some paths forward. I also want to check in with some product and design people about any sense on forthcoming product interventions in the area of interactive or, for that matter, materialized graphs.
Feb 13 2019
For those following along, I ran a query to get a sense of global usage of Google Translate and using the "Desktop" link. On 11 February 2019 there were only 89 such requests globally, about 2/3 where enwiki was the source wiki. This figure is not a perfect predictor of desktop user behavior, as for desktop users using enwiki as the source wiki receiving the mobile treatment it will be a new thing. But it probably suggests that, in addition to the rationale @Pginer-WMF provides about the basis of stopping showing broken stuff, the mobile read view is okay for consumptive purposes in general.
Thanks, @santhosh !
Feb 12 2019
@BBlack ^ would you please review the enwiki VCL patch? We'll only want to merge it after ExternalGuidance has been tested with simplewiki and @Pginer-WMF has given the greenlight, but I figured it best if we go through review ahead of that.
@santhosh ^ would you please review and verify it has the intended effect? I need to reset my Vagrant stuff, but figured this was simple looking enough to post a patch (we'll see if I'm right!).
Feb 11 2019
Heads up @chelsyx: for simplewiki access via the Google Translate proxy the traffic pattern is now mobile web based even for desktop UAs. The same will happen with enwiki when we make that change later. I thought I should make this clear for any intervention analysis.
@santhosh and @Gilles the footer list containing the "Desktop" link and other list items places the dot character between elements using an li::after pseudo-element. Do you think we should just use JS to remove the "Desktop" <li> instead of using a CSS rule? Setting the opacity to 0 like the other hidden elements would leave the dot character for any preceding bullets in place, which looks unusual because it leaves a dot at the end of the list. If we use JS is there a preferred segment of the JS code to do so to avoid any performance issues?
Feb 10 2019
I should note that, concerning the edge case regression, it has historically been the case that, even after the mobile Google Translate user taps/clicks "Desktop" link and is landed in a desktop presentation, subsequent taps/clicks in article links would wind them back into a mobile presentation. This has to do with the way cookies are not capable of being enforced in this translation context like when users access the site directly. So addressing the regression for the mobile user is only a modest improvement. But for the genuine desktop user, it may not be perceived as so modest because the desktop user may have been accustomed to the desktop treatment (even if we think the mobile treatment is more reader friendly). I'll need to check with @Pginer-WMF about the mental model and nearer term (next six to twelve months) user journey here and determine how to handle this, in a sustainable way, if at all. Given that users can use inbuilt translation in user agents, we may just want to notify the user on tap to do that upon the "Desktop" click/tap by way of ExternalGuidance JavaScript, for example.
Feb 8 2019
Hi team.
Jan 18 2019
Okay, @BBlack, now it's ready for review.
@BBlack hold that thought, one more condition to add.
@BBlack patch posted for your review ^. Would you please review and let me know on patch for any additions?
Jan 9 2019
Hi @BBlack , any suggestion here?
Jan 8 2019
Jan 7 2019
Paraphrasing a dialogue with @BBlack immediate edge side HTTP redirects based on header/regex might be feasible without fragmenting caches/backends.
Dec 5 2018
Approved.
Nov 28 2018
Nov 15 2018
Nov 11 2018
Nov 9 2018
Nov 7 2018
@Nuria thanks. You understood the question well. Okay, so my read of sessionInSample and randomTokenMatch is that the populationSize values between different schemas would need to have a common base value so that they divide cleanly in order to guarantee intersection, as it's a divisor in a modulo calculation. Do I have that right?
Nov 6 2018
The question of whether you can sample events per session with stickiness is a different one, and the answer to that is yes, you can do that as of today deterministically and decide that event 1 and event2 are always going to be sampled for session "25". Session here means " identifier assigned to your browser until you close it down" . This identifier is sent in eventlogging events but it is not sent in general requests. It will be reset when you re-start your browser.
- IE11+ (6.8%)
- Safari 5.1-11.2 (1.7%)
Opera 15-26- iOS Safari 8-11.3
- After a brief peak at pageviews_daily in Turnilo, this looks like ~0.7%
Yes, we discussed collision avoidance as part of T201124 and increased the length of mw.user.sessionId() to a value that should be safe for all foreseeable scenarios (see in particular T201124#4521002). I'm not quite sure what salting and hashing has to do with that though.
- For unique device:
- (an example that can include both scenarios you mentioned above) Any kind of experiment or data collection that requires asking the same unique device multiple questions across a period of time. For example, when we want to learn about how users "learn" on Wikipedia, we need to be able to interfere with their experience on Wikipedia in multiple stages of their interaction and ask them questions. Not being able to say which unique device has answered the first batch of questions is a blocker for this line of research.
Thanks for the review. The User-Agent field is that of the end user's device.
Nov 5 2018
Nov 2 2018
Oct 31 2018
Follow up here: Kosta and I spoke, and we don't need the token, as logging should take place on a per-user basis, not just on a per-session basis. So the key will be constructed by hashing two non-sensitive items. This is an okay approach in my view given the requirements.
@Bawolff I added a question in the patchset about getToken(). Basically, although the cost of computing a rainbow table to reverse engineer the hashed values of getToken() in case of someone spilling Redis keynames is moderately high, I wanted to check whether there's even a risk if an attacker does so. If there's a risk if an attacker does so, I'm thinking we should instead take just a portion of the token (I'm working from the assumption this is tied to something fixed between the client and the server - a cookie issued post login) and the user's numerical ID and concatenate those and then hash that concatenated value for setting the keyname - that would still be basically collision free for keynaming purposes.
Oct 30 2018
@leila to clarify, which of the following do you desire?
Oct 23 2018
Thanks, @Ottomata.
Oct 21 2018
@Tbayer @Neil_P._Quinn_WMF @chelsyx @mpopov @nettrom_WMF curious about your thinking here for session overlap between events that are sent at the global (perhaps per-project, if we need that) default and those that are oversampled for the sessions.
Oct 18 2018
@phuedx do you think it might be sensible to simply make sendBeacon a pre-requisite at this point for client side event logging?
Oct 12 2018
I'm interested in filling out the TODOs here.
Oct 11 2018
The understanding from @Tbayer is correct about this task being separate from the question on retention beyond 90 days.
Oct 10 2018
Sep 26 2018
Sep 14 2018
@JKatzWMF adding you in here. I believe reading-platform may have been your dashboard based on [T102036#1524744]. Okay to delete?
Aug 29 2018
This appears to be related to fully qualified links in the page. Links to [[articles]] are automatically rewritten for mdot if on mdot already IIRC.
Aug 2 2018
Ah, interesting point @Nemo_bis . @Varnent @srishakatux who's the audience for people clicking through from https://wikimediafoundation.org/technology/ ?
That's fine. I have some suggestions on the content on https://wikimediafoundation.org/technology/ and https://www.mediawiki.org/wiki/New_Developers#Choose_a_software_project , but I'll take that to an email thread.
Jul 30 2018
Thanks, @Rfarrand for closing. Further summary posted at https://www.mediawiki.org/w/index.php?title=Wikimedia_Developer_Summit%2F2018%2FKnowledge_as_a_Service&type=revision&diff=2840141&oldid=2699409
Jul 27 2018
Update to the ticket: webmaster console access has been provided to Ian for https://it.m.wikipedia.org/ and https://it.wikipedia.org/ for investigation/remediation.
Jul 26 2018
What's needed?
Jul 20 2018
Oh, interesting: https://developer.apple.com/videos/play/wwdc2018/204/
Jun 25 2018
Marking this as resolved. Thanks @mforns for the review.