User Details
- User Since
- May 19 2025, 3:26 PM (38 w, 4 d)
- Availability
- Available
- LDAP User
- Guilherme Gonçalves
- MediaWiki User
- GGoncalves-WMF [ Global Accounts ]
Yesterday
A quick note about the trending indicator: in T409601, DPE onboarded a data pipeline for WME that classifies articles as trending according to (I think) this methodology. You can see a sample query here. Naturally, your definition of trending doesn't have to agree with that, but this could be a starting point.
Tue, Feb 10
Mon, Feb 9
In general, I think this is good to consider, though I'm not seeing a very strong priority signal just yet. Can we estimate how incorrect our current completeness estimate is, e.g. with a one-time comparison to wmf.mediawiki_history?
This is great, thanks for the detailed proposal @JMonton-WMF !
Fri, Feb 6
Wed, Feb 4
Having spoken to @Ladsgroup this morning (thanks!), here's my notes on this task.
Thu, Jan 29
Update (from Slack): cuc_agent and cu_changes are no longer bring read in MediaWiki, and writing will tentatively stop on the week of Feb 9.
Dec 22 2025
Dec 9 2025
Sorry I'm late to this, but I basically second Andrew's comment. I think there are two things at play here:
Dec 8 2025
@fkaelin and I just chatted a little more about this, quoting here:
Nov 25 2025
Thanks for checking! What annotation do you have in mind? Something like, "Here's when we enabled temp accounts, and they are included under Anonymous edits"?
Looking at the attached sheet, I see the following must-have actions listed as "complicated":
Nov 24 2025
From a quick discussion at the DE team meeting:
This looks like a very reasonable change, and Q3 sounds feasible; I'm not entirely sure how complex this is, but it looks more like "weeks" of effort than "quarters" or "days".
Nov 20 2025
Sorry I'm a bit late to this - I just caught up a bit on the incident this morning.
Oct 29 2025
I've used the link to set up a donation. I get to the "thank you" page normally on my browser.
Oct 21 2025
@calbon mentioned today that we want to make sure to capture the referrals from x.com/twitter.com . We do look for Twitter in the Referer header, but not X.
Oct 14 2025
Unfortunately it looks like we won't be able to prioritize this for the next couple of months in Data Engineering, but it's still something we want to get to. We'll revisit this in our next planning period towards the end of the year.
Oct 9 2025
@JAllemandou and @OSefu-WMF just had a chat about this one and how we can address it, now and in the future.
Sep 26 2025
Hi, just a quick update after my chat with Sukhbir. We should do this, not only for the value of the dataset itself, but also because it will be an excellent opportunity to make this kind of release a more documented and repeatable process.
Sep 25 2025
I took a look at the upstream issue tracker and found this issue, which looks related. But that's from 2021 and the associated patch, which allows users to view draft dashboards, apparently went into the the 1.2.0 release according to the GitHub tags on the right. We seem to be running Superset v4.2.0 at the moment.
Sep 19 2025
Thanks Valentín, I'm thinking probenet can be a useful signal, but our current focus is to experiment with specific known signals (e.g. presence of DOM properties). We'd like a bit more flexibility to deploy custom logic, and also control over what % of traffic we collect those signals from. More fundamentally, we believe we'll need the X-Request-Id correlation to even be able to evaluate with confidence whether client-side signals, include probenet, are helpful or not.
Sep 15 2025
Sep 10 2025
That is a good question, I was just quoting @BTullis 's wisdom in the original ticket (T383175#10440220).
Sep 8 2025
Nice, this makes a lot of sense! I think Jupyter users are generally used to going to either #data-platform-sre, #talk-to-data-engineering or (less frequently) #working-with-data for support.
Sep 2 2025
Excellent, thanks for coming up with this! It's very cool to see the performance of the real pipeline. My thoughts the initial question:
Aug 27 2025
Thanks KC, and agreed: if we need to make this into a dashboard for ongoing tracking, we should look into importing those logs into the Data Lake.
Thanks Halley, a couple of comments and follow-up questions please:
Aug 26 2025
You're welcome, happy to help!
Aug 20 2025
We had to ask around a little, and unfortunately we don't have a precise answer as we don't really instrument device orientation directly. However, we do instrument device viewport width (aggregated to buckets) for mobile web page views and keep it for 90 days, and I think that can help give you an approximate answer.
Aug 14 2025
Aug 11 2025
Aug 6 2025
I met with @mfossati to discuss this. The main takeaways are:
Jul 8 2025
Jul 3 2025
Jun 19 2025
Excellent discussion, thank you both.
Jun 17 2025
Hey Debra, thank you and welcome as well :) Sorry for the delay here, I wanted to get Virginia's input again to make sure we go in a sensible direction with this.
Jun 10 2025
Yep, I was able to run kinit, set my password and run it again. Thank you!
I was testing these credentials in the past couple of days. I can use Superset (so I'm in analytics-privatedata-users ) and log in to bastion and stat machines (so SSH is also fine).