Page MenuHomePhabricator

sguebo_WMF (Samuel Guebo)
Sr. Privacy Engineer (Security Team)

Today

  • Clear sailing ahead.

Tomorrow

  • Clear sailing ahead.

Sunday

  • Clear sailing ahead.

User Details

User Since
Aug 10 2018, 4:17 PM (242 w, 5 h)
Availability
Available
LDAP User
Unknown
MediaWiki User
Samuel (WMF) [ Global Accounts ]

Recent Activity

Yesterday

sguebo_WMF added a comment to T322751: Design: Design how to present all PII information collected from participants to organizers (on desktop and mobile).

My recommendation from a data privacy perspective is to show aggregated data only and keep the PII in the back end for 90 days, during which time participants can update their answers, after which time we anonymize the PII data and keep only the aggregated data.
As far as aggregated data, I recommend reporting out when we have more than x persons in a [sub]category; below that we could either not report out or report, for example a compilation "other <x"

Thu, Mar 30, 7:15 PM · Campaign-Tools, CampaignEvents, Campaign-Registration

Tue, Mar 28

sguebo_WMF added a comment to T322751: Design: Design how to present all PII information collected from participants to organizers (on desktop and mobile).

Usually, with PII data on persons we set a minimum for calculating averages so the data cannot be disambiguated and persons identified.
Essentially, such data should not be disaggregated at small numbers. There may be a standard at 10, that is usually 20 though for power to detect differences. So, for example, if you have <10 persons in a [sub] category then you don't report out on that [sub] category.
Security and GDI teams may be able to provide additional insights and feedback.

Hey @Iflorez, how will that work if event organizers are able to view data at an individual level anyway (F35861130)? Will the detailed view of the Participants tab be unavailable once data is aggregated, after the 90-day window?

Tue, Mar 28, 12:33 PM · Campaign-Tools, CampaignEvents, Campaign-Registration

Tue, Mar 7

sguebo_WMF added a comment to T296847: Third-party resources policy.

@sguebo_WMF given that the TPR policy is being proposed to be incorporated by reference into the terms of use, i think there is a desire for there to at least be a public draft, if not the final policy, prior to the comment period for the terms of use ammendments closing.

Tue, Mar 7, 12:23 PM · SecTeam-Processed, Privacy Engineering, tech-decision-forum
sguebo_WMF added a comment to T273220: Deploy StopForumSpam extension to production.

Thanks for the ping @sbassett. We could borrow some ideas from the generic message currently displayed when logged in users visit external links, and a privacy notice(T65598#6914486) which was provided by WMF-Legal. Privacy best practices encourage both brevity and clarity of notices. So, a more privacy-conscious message could be something along these lines:

Tue, Mar 7, 10:44 AM · Privacy Engineering, MW-1.40-notes (1.40.0-wmf.17; 2023-01-02), Security-Team, user-sbassett, User-notice, Wikimedia-Extension-setup, MediaWiki-extensions-StopForumSpam
sguebo_WMF moved T273220: Deploy StopForumSpam extension to production from Incoming to Completed on the Privacy Engineering board.
Tue, Mar 7, 10:44 AM · Privacy Engineering, MW-1.40-notes (1.40.0-wmf.17; 2023-01-02), Security-Team, user-sbassett, User-notice, Wikimedia-Extension-setup, MediaWiki-extensions-StopForumSpam

Mon, Mar 6

sguebo_WMF added a comment to T296847: Third-party resources policy.

Hey everyone, I agree that having the security team review every single Gadgets and User script would not be scalable or even realistic.

Mon, Mar 6, 11:43 AM · SecTeam-Processed, Privacy Engineering, tech-decision-forum
sguebo_WMF moved T324989: Application Security Review Request : OurWorldInData from Incoming to In Progress on the Privacy Engineering board.
Mon, Mar 6, 10:01 AM · Privacy Engineering, secscrum, Security, Application Security Reviews
sguebo_WMF added a comment to T324989: Application Security Review Request : OurWorldInData.

Tagging Privacy Engineering for an opinion/risk rating about the following. I'm not certain there's precedent for this on Wikimedia production or that wmcs would completely satisfy any privacy concerns for proposed, embedded content like this.

Mon, Mar 6, 9:45 AM · Privacy Engineering, secscrum, Security, Application Security Reviews

Fri, Mar 3

sguebo_WMF added a comment to T324989: Application Security Review Request : OurWorldInData.

Hello @Skizzerz, is there a publicly accessible repository for the source of https://owidm.wmcloud.org?

Fri, Mar 3, 1:22 PM · Privacy Engineering, secscrum, Security, Application Security Reviews

Feb 10 2023

sguebo_WMF updated sguebo_WMF.
Feb 10 2023, 12:07 PM

Nov 28 2022

sguebo_WMF added a comment to T312898: NDA tool should treat `~` as a symbol, not a non-Latin character.

@jrbs was added as a maintainer to the NDA bot, see (https://toolsadmin.wikimedia.org/tools/id/tsbot). Also, the code was moved the Wikimedia's Gitlab instance: https://gitlab.wikimedia.org/repos/security/tsbot-nda

Nov 28 2022, 12:30 PM · Trust-and-Safety
sguebo_WMF closed T312903: Move github.com/samuelguebo/tsbot-nda to Wikimedia code hosting as Resolved.
Nov 28 2022, 12:27 PM · Trust and Safety Tools Team Backlog
sguebo_WMF added a comment to T312903: Move github.com/samuelguebo/tsbot-nda to Wikimedia code hosting.

The repository was imported to Wikimedia's Gitlab instance: https://gitlab.wikimedia.org/repos/security/tsbot-nda

Nov 28 2022, 12:26 PM · Trust and Safety Tools Team Backlog

Nov 24 2022

sguebo_WMF moved T65598: Privacy issues with Gadget-GoogleTrans.js (calls out to google APIs) from Waiting to Watching on the Privacy Engineering board.
Nov 24 2022, 8:47 AM · Security, Privacy Engineering, WMF-General-or-Unknown, Privacy
sguebo_WMF moved T259421: WordPress blogs load (unused) Twemoji.js which uses third-party service from Waiting to Watching on the Privacy Engineering board.
Nov 24 2022, 8:47 AM · Diff-blog, Privacy, Privacy Engineering, Technical Blog, wikimediafoundation.org
sguebo_WMF moved T65598: Privacy issues with Gadget-GoogleTrans.js (calls out to google APIs) from In Progress to Waiting on the Privacy Engineering board.
Nov 24 2022, 8:46 AM · Security, Privacy Engineering, WMF-General-or-Unknown, Privacy
sguebo_WMF moved T322218: Create API to allow retrieving detailed information about Thanks log items for my own user account from Incoming to Completed on the Privacy Engineering board.
Nov 24 2022, 6:52 AM · Growth-Team, Privacy Engineering, Thanks, Growth-Positive-Reinforcement, GrowthExperiments-ImpactModule
sguebo_WMF added a comment to T322218: Create API to allow retrieving detailed information about Thanks log items for my own user account.

I examined the proposed API through common privacy risk categories:

Nov 24 2022, 6:48 AM · Growth-Team, Privacy Engineering, Thanks, Growth-Positive-Reinforcement, GrowthExperiments-ImpactModule

Nov 8 2022

sguebo_WMF added a comment to T320641: Security/Privacy Review For P&E Dashboard Integration with Campaign Registration System.

Hey @ldelench_wmf, I have no objections to closing this one, thanks.

Nov 8 2022, 4:49 PM · Campaign-Tools, Campaign-Registration, Privacy Engineering

Nov 1 2022

sguebo_WMF added a comment to T296847: Third-party resources policy.

Hello, some quick updates.

Nov 1 2022, 1:29 PM · SecTeam-Processed, Privacy Engineering, tech-decision-forum

Oct 25 2022

sguebo_WMF added a comment to T320641: Security/Privacy Review For P&E Dashboard Integration with Campaign Registration System.

@ifried, the Security-Team hasn't gotten the chance to discuss the mitigating options surfaced in the Google Docs conversation. Meanwhile, I would like to keep the ticket open and update it once we've made some progress.

Oct 25 2022, 6:49 PM · Campaign-Tools, Campaign-Registration, Privacy Engineering

Oct 20 2022

sguebo_WMF moved T320641: Security/Privacy Review For P&E Dashboard Integration with Campaign Registration System from Backlog to Waiting on the Privacy Engineering board.
Oct 20 2022, 4:35 PM · Campaign-Tools, Campaign-Registration, Privacy Engineering
sguebo_WMF added a comment to T320641: Security/Privacy Review For P&E Dashboard Integration with Campaign Registration System.

Hey @ifried, the Privacy Engineering review is complete. Could you take a look at our conclusions and address any potential misunderstanding there? https://docs.google.com/document/d/1lFeq7jtUCmXdwoKwIfqgO-74ccTU0kBtX7zkJkeMByw/edit#?

Oct 20 2022, 4:07 PM · Campaign-Tools, Campaign-Registration, Privacy Engineering

Oct 14 2022

sguebo_WMF added a comment to T320641: Security/Privacy Review For P&E Dashboard Integration with Campaign Registration System.

Hello @ifried, Privacy Engineering will start looking into this as part of our current sprint. On a side note, I am aware that the previous features have been looked at by WMF-Legal. For this additional feature, are you having any conversation with Legal in parallel?

Oct 14 2022, 7:54 PM · Campaign-Tools, Campaign-Registration, Privacy Engineering

Oct 12 2022

sguebo_WMF moved T320641: Security/Privacy Review For P&E Dashboard Integration with Campaign Registration System from Incoming to Backlog on the Privacy Engineering board.
Oct 12 2022, 4:00 PM · Campaign-Tools, Campaign-Registration, Privacy Engineering

Aug 22 2022

sguebo_WMF awarded Blog Post: Creating a pentesting process a Like token.
Aug 22 2022, 12:42 PM · Security Team AppSec, Security

Jul 12 2022

sguebo_WMF added a comment to T312903: Move github.com/samuelguebo/tsbot-nda to Wikimedia code hosting.

@Aklapper, sure thing. It used to be a private repo which I was the sole maintainer when I was still in Trust-and-Safety. Moving it to GitLab makes sense but I am not sure which project would be suited for it. I don't currently see anything related to Trust & Safety. Any suggestion?

Jul 12 2022, 11:47 PM · Trust and Safety Tools Team Backlog
sguebo_WMF updated subscribers of T312903: Move github.com/samuelguebo/tsbot-nda to Wikimedia code hosting.
Jul 12 2022, 11:33 PM · Trust and Safety Tools Team Backlog
sguebo_WMF added a comment to T312898: NDA tool should treat `~` as a symbol, not a non-Latin character.

Merged and deployed lemme know if that fixed it - sorry didn't get the chance to test it live as I no longer have a TS account. Let me know if you have a tool account and I add you as a deployer. Once the PR is merged deploying is usually just ssh'ing and running a shell script that does all the git and kubernetes steps.

Jul 12 2022, 11:27 PM · Trust-and-Safety
sguebo_WMF added a comment to T312896: NDA tool does not respect the new "formerly" parameter.

Hey @jrbs, I think the issue might linger somewhere between these lines: https://github.com/samuelguebo/tsbot-nda/blob/6c05f844039d9713f3328d1e59c34aa29c80fa3b/routes/nda.py#L233-L284

Jul 12 2022, 9:09 PM · Trust-and-Safety

Mar 8 2022

sguebo_WMF changed the visibility for T65598: Privacy issues with Gadget-GoogleTrans.js (calls out to google APIs).
Mar 8 2022, 2:28 PM · Security, Privacy Engineering, WMF-General-or-Unknown, Privacy
sguebo_WMF added a comment to T65598: Privacy issues with Gadget-GoogleTrans.js (calls out to google APIs).

Thank you both.

Mar 8 2022, 2:27 PM · Security, Privacy Engineering, WMF-General-or-Unknown, Privacy

Mar 7 2022

sguebo_WMF added a project to T296847: Third-party resources policy: Security-Team.
Mar 7 2022, 12:54 PM · SecTeam-Processed, Privacy Engineering, tech-decision-forum
sguebo_WMF added a comment to T65598: Privacy issues with Gadget-GoogleTrans.js (calls out to google APIs).

Hey @sbassett and @JFishback_WMF , do you have any strong objections to making this task public? Its content may inform the ongoing discussion around third-party resources in T296847.

Mar 7 2022, 12:01 PM · Security, Privacy Engineering, WMF-General-or-Unknown, Privacy

Mar 4 2022

sguebo_WMF added a comment to T296847: Third-party resources policy.

I'm confused by this problem statement. The Privacy Policy already forbids anything on Wikimedia projects that causes the UA to contact any third-party website, including Toolforge and WMCS, for any purpose (any HTTP header and the client IP are covered by the definition of "Personal information"). So regardless of whether it's executable code, an image, a webfont, JSON/JSONP data, etc. it is currently bright-line forbidden. What, then, would this "clear Wikimedia policy on the use of third-party resources" cover?

Mar 4 2022, 6:50 PM · SecTeam-Processed, Privacy Engineering, tech-decision-forum
sguebo_WMF added a comment to T296847: Third-party resources policy.

We had an RFC open about this for a couple of years, which has some analysis and discussion of legitimate use cases and UX for opt-in: T208188: RFC: Partial opt-out method for Content security policy

Mar 4 2022, 6:08 PM · SecTeam-Processed, Privacy Engineering, tech-decision-forum
sguebo_WMF added a comment to T296847: Third-party resources policy.

My current plan for rollout is as follows:

  • Informal feedback round (in progress)
  • Update policy based on feedback.
  • New more formal feedback round from WMF staff e.g. please respond by X date
  • Update policy based on feedback.
  • A formal round of feedback from the community.
  • We'll update the interface to provide a notice on pages where JS can be added that links to the policy:
<div class="mw-message-box mw-message-box-notice">All code written here is expected to <a href="#">adhere to the gadget policy</a>.</div>

Screen Shot 2022-02-16 at 11.07.18 AM.png (1×2 px, 461 KB)

Mar 4 2022, 6:03 PM · SecTeam-Processed, Privacy Engineering, tech-decision-forum

Feb 16 2022

sguebo_WMF added a comment to T296847: Third-party resources policy.

Following T262493#7584789 I've begun drafting a policy and collating feedback on the talk page:
https://www.mediawiki.org/wiki/User:Jdlrobson/Extension:Gadget/Policy

Perhaps we could combine efforts here?

Feb 16 2022, 3:19 PM · SecTeam-Processed, Privacy Engineering, tech-decision-forum
sguebo_WMF triaged T296847: Third-party resources policy as Medium priority.
Feb 16 2022, 1:05 PM · SecTeam-Processed, Privacy Engineering, tech-decision-forum
sguebo_WMF moved T296847: Third-party resources policy from Backlog to In Progress on the Privacy Engineering board.
Feb 16 2022, 1:05 PM · SecTeam-Processed, Privacy Engineering, tech-decision-forum

Feb 15 2022

sguebo_WMF moved T290493: Cross Origin Resource Sharing Misconfiguration | Lead to sensitive information. in "diff.wikimedia.org" from Waiting to Completed on the Privacy Engineering board.
Feb 15 2022, 6:32 PM · Privacy Engineering, Privacy, Diff-blog, SecTeam-Processed, Security
sguebo_WMF added a comment to T290493: Cross Origin Resource Sharing Misconfiguration | Lead to sensitive information. in "diff.wikimedia.org".

As noted above, WordPress-powered websites such as Diff are used by the Foundation for public-facing initiatives. For instance, blog posts published on Diff feature names of their authors, and in most cases their titles within the organization. Although, the REST API allows people to retrieve the list of user accounts of the website, it generates list of already-public users, in JSON format that'll need to be parsed/processed. Therefore, the API is not disclosing any information that was not already private, nor is it increasing the visibility of information that was already public by making it easier to retrieve.

Feb 15 2022, 6:32 PM · Privacy Engineering, Privacy, Diff-blog, SecTeam-Processed, Security

Dec 8 2021

sguebo_WMF moved T292594: It is possible to construct a list of all globally suppressed users from In Progress to Completed on the Privacy Engineering board.
Dec 8 2021, 5:56 PM · Security-Team, SecTeam-Processed, cloud-services-team (Kanban), Privacy, Privacy Engineering, Stewards-and-global-tools, Data-Services, Vuln-Infoleak, Security

Dec 6 2021

sguebo_WMF moved T296847: Third-party resources policy from Incoming to Backlog on the Privacy Engineering board.
Dec 6 2021, 1:04 PM · SecTeam-Processed, Privacy Engineering, tech-decision-forum

Dec 2 2021

sguebo_WMF updated the task description for T296847: Third-party resources policy.
Dec 2 2021, 10:09 AM · SecTeam-Processed, Privacy Engineering, tech-decision-forum

Dec 1 2021

sguebo_WMF created T296847: Third-party resources policy.
Dec 1 2021, 2:17 PM · SecTeam-Processed, Privacy Engineering, tech-decision-forum

Oct 29 2021

sguebo_WMF moved T293583: [mwcli] reporting on usage from In Progress to Completed on the Privacy Engineering board.
Oct 29 2021, 12:26 PM · Patch-For-Review, Privacy Engineering, Privacy, mwcli
sguebo_WMF added a comment to T293583: [mwcli] reporting on usage.

Hey @Addshore, on the behalf of Security-Team, I reviewed the privacy risks inherent to the proposed usage reporting feature for mwcli. I’ll share my conclusions below.

Oct 29 2021, 12:23 PM · Patch-For-Review, Privacy Engineering, Privacy, mwcli

Oct 26 2021

sguebo_WMF added a comment to T284944: Increased visibility in wiki-replicas for volunteers fighting vandals.

@sguebo_WMF Is this data visible on the wikis?

Oct 26 2021, 6:46 PM · cloud-services-team, Data-Engineering, Privacy Engineering, Data-Services
sguebo_WMF moved T293583: [mwcli] reporting on usage from Backlog to In Progress on the Privacy Engineering board.
Oct 26 2021, 3:15 PM · Patch-For-Review, Privacy Engineering, Privacy, mwcli
sguebo_WMF added a comment to T293583: [mwcli] reporting on usage.

For example after a user has gone through a whole setup and played around with the environment a bit we might get something like this (at the highest detail level planned) from the next time the mwcli attempts to send data back.

  • 4x docker mediawiki create
  • 2x docker mysql create
  • 2x docker mediawiki install --dbtype=mysql --dbname=default
  • 2x docker mediawiki install --dbtype=sqlite1 --dbname=CUSTOM
  • 22x docker mediawiki exec
  • 4x codesearch search --output=ack
  • 1x codesearch search --output=table
Oct 26 2021, 3:15 PM · Patch-For-Review, Privacy Engineering, Privacy, mwcli

Oct 25 2021

sguebo_WMF awarded Blog Post: How we deploy code a Orange Medal token.
Oct 25 2021, 4:55 PM
sguebo_WMF updated subscribers of T293811: Clarify whether CUs should share non-public information with external services.
Oct 25 2021, 4:37 PM · Privacy Engineering
sguebo_WMF triaged T293583: [mwcli] reporting on usage as Medium priority.
Oct 25 2021, 4:08 PM · Patch-For-Review, Privacy Engineering, Privacy, mwcli
sguebo_WMF added a comment to T293583: [mwcli] reporting on usage.

Hey @Addshore -- thanks for bringing that to the Privacy Engineering team's attention. My understanding is that the metric tool would work as below:

Oct 25 2021, 4:07 PM · Patch-For-Review, Privacy Engineering, Privacy, mwcli
sguebo_WMF moved T293583: [mwcli] reporting on usage from Incoming to Backlog on the Privacy Engineering board.
Oct 25 2021, 3:51 PM · Patch-For-Review, Privacy Engineering, Privacy, mwcli
sguebo_WMF moved T293811: Clarify whether CUs should share non-public information with external services from Incoming to Waiting on the Privacy Engineering board.
Oct 25 2021, 3:51 PM · Privacy Engineering

Oct 21 2021

sguebo_WMF moved T292594: It is possible to construct a list of all globally suppressed users from Backlog to In Progress on the Privacy Engineering board.
Oct 21 2021, 5:14 PM · Security-Team, SecTeam-Processed, cloud-services-team (Kanban), Privacy, Privacy Engineering, Stewards-and-global-tools, Data-Services, Vuln-Infoleak, Security
sguebo_WMF added a comment to T292594: It is possible to construct a list of all globally suppressed users.
localuser:
  source:
    - localuser
    - globaluser
  view: >
    select lu_wiki, lu_name, lu_attached_timestamp, lu_attached_method, lu_local_id, lu_global_id
  where: lu_global_id = gu_id AND gu_hidden=''
Oct 21 2021, 4:28 PM · Security-Team, SecTeam-Processed, cloud-services-team (Kanban), Privacy, Privacy Engineering, Stewards-and-global-tools, Data-Services, Vuln-Infoleak, Security

Oct 20 2021

sguebo_WMF added a comment to T293811: Clarify whether CUs should share non-public information with external services.

Noted, thanks for the additional context @GeneralNotability. The description was updated accordingly.

Oct 20 2021, 5:14 PM · Privacy Engineering
sguebo_WMF updated the task description for T293811: Clarify whether CUs should share non-public information with external services.
Oct 20 2021, 5:14 PM · Privacy Engineering

Oct 19 2021

sguebo_WMF updated subscribers of T292594: It is possible to construct a list of all globally suppressed users.
Oct 19 2021, 4:47 PM · Security-Team, SecTeam-Processed, cloud-services-team (Kanban), Privacy, Privacy Engineering, Stewards-and-global-tools, Data-Services, Vuln-Infoleak, Security
sguebo_WMF updated subscribers of T293811: Clarify whether CUs should share non-public information with external services.

Hey @GeneralNotability and @Urbanecm. As mentioned earlier, your question will be brought to WMF-Legal's attention. Feel free to rename it or adjust the description if I missed some aspects.

Oct 19 2021, 4:35 PM · Privacy Engineering
sguebo_WMF created T293811: Clarify whether CUs should share non-public information with external services.
Oct 19 2021, 4:32 PM · Privacy Engineering

Oct 18 2021

sguebo_WMF moved T292594: It is possible to construct a list of all globally suppressed users from Incoming to Backlog on the Privacy Engineering board.
Oct 18 2021, 3:43 PM · Security-Team, SecTeam-Processed, cloud-services-team (Kanban), Privacy, Privacy Engineering, Stewards-and-global-tools, Data-Services, Vuln-Infoleak, Security
sguebo_WMF moved T293379: [[:w:en:User:Firefly/checkuseragenthelper.js]] sends CU user-agents to a third party from Incoming to Backlog on the Privacy Engineering board.
Oct 18 2021, 3:43 PM · Security-Team, User-Urbanecm, SecTeam-Processed, Trust-and-Safety, Privacy, Privacy Engineering, Security

Oct 8 2021

sguebo_WMF added a comment to T65598: Privacy issues with Gadget-GoogleTrans.js (calls out to google APIs).

Something more robust would be needed if this is to become any sort of wikimedia, or WMF-wide standard. A longer-term fix would be to integrate such indications in to the software, but development on Gadgets 2.0 has been stalled for a LONG time.

That being said, I don't think "PRIVACY" is very useful there alone - from a UX perspective that seems confusing, did you notice there is already a lengthy hover text on the "E"xternal indicator? It looks like this:

image.png (61×639 px, 5 KB)

Also keep in mind, there are actually much larger risks then "privacy" when loading third party scripts, such as account hijacking - surreptitious action making, etc.

Oct 8 2021, 2:56 PM · Security, Privacy Engineering, WMF-General-or-Unknown, Privacy

Oct 1 2021

sguebo_WMF moved T290099: Create a "delete me" maintenance script for special user/data deletion requests from In Progress to Completed on the Privacy Engineering board.
Oct 1 2021, 4:10 PM · affects-Miraheze, MW-1.38-notes (1.38.0-wmf.2; 2021-09-28), Security-Team, Privacy Engineering, Privacy, Security

Sep 20 2021

sguebo_WMF added a comment to T291094: Add "Samuel (WMF)" account to Security Team group in gitlab.wikimedia.org.

Thanks for handling that, @thcipriani

Sep 20 2021, 3:33 PM · SecTeam-Processed, Privacy Engineering, Release-Engineering-Team (Doing), Security-Team, GitLab

Sep 17 2021

sguebo_WMF moved T289952: Request: expose database tables of the Translate extension to users in replicas on Toolforge (Wikidata, or all Wikis) from In Progress to Completed on the Privacy Engineering board.
Sep 17 2021, 8:17 PM · Language-Team (Language-2021-October-December), Privacy Engineering, SecTeam-Processed, cloud-services-team (Kanban), Data-Services
sguebo_WMF added a comment to T289952: Request: expose database tables of the Translate extension to users in replicas on Toolforge (Wikidata, or all Wikis).

On the behalf of Security-Team, I reviewed the privacy risks that exposing Translation extension’s tables in replicas may bring about. I’ll share my conclusions below.

Sep 17 2021, 8:17 PM · Language-Team (Language-2021-October-December), Privacy Engineering, SecTeam-Processed, cloud-services-team (Kanban), Data-Services
sguebo_WMF added a comment to T289952: Request: expose database tables of the Translate extension to users in replicas on Toolforge (Wikidata, or all Wikis).

Hey @Nikerabbit and thanks for providing some background on these tables.

You mentioned that translate_messageindex should not be exposed because it contains some internal tracking information. I pulled up an excerpt from that table but I think I would need some help in understanding where you see an issue.

wikiadmin@10.64.16.207(wikidatawiki)> select * from translate_messageindex limit 1\G

*************************** 1. row ***************************
tmi_key: <redacted integer>:help:About_data/1
tmi_value: page-Help:About data|agg-Help

Could you please explain to me why you think this index information is concerning?
That may help me better grasp any security or privacy risk inherent to this specific table.

Sep 17 2021, 8:14 PM · Language-Team (Language-2021-October-December), Privacy Engineering, SecTeam-Processed, cloud-services-team (Kanban), Data-Services

Sep 16 2021

sguebo_WMF moved T291094: Add "Samuel (WMF)" account to Security Team group in gitlab.wikimedia.org from Incoming to Waiting on the Privacy Engineering board.
Sep 16 2021, 3:01 PM · SecTeam-Processed, Privacy Engineering, Release-Engineering-Team (Doing), Security-Team, GitLab
sguebo_WMF added a project to T291094: Add "Samuel (WMF)" account to Security Team group in gitlab.wikimedia.org: Privacy Engineering.
Sep 16 2021, 3:01 PM · SecTeam-Processed, Privacy Engineering, Release-Engineering-Team (Doing), Security-Team, GitLab
sguebo_WMF added a comment to T289952: Request: expose database tables of the Translate extension to users in replicas on Toolforge (Wikidata, or all Wikis).

Hey @Nikerabbit and thanks for providing some background on these tables.

Sep 16 2021, 2:56 PM · Language-Team (Language-2021-October-December), Privacy Engineering, SecTeam-Processed, cloud-services-team (Kanban), Data-Services
sguebo_WMF moved T290099: Create a "delete me" maintenance script for special user/data deletion requests from Backlog to In Progress on the Privacy Engineering board.
Sep 16 2021, 2:18 PM · affects-Miraheze, MW-1.38-notes (1.38.0-wmf.2; 2021-09-28), Security-Team, Privacy Engineering, Privacy, Security
sguebo_WMF moved T289952: Request: expose database tables of the Translate extension to users in replicas on Toolforge (Wikidata, or all Wikis) from Backlog to In Progress on the Privacy Engineering board.
Sep 16 2021, 2:18 PM · Language-Team (Language-2021-October-December), Privacy Engineering, SecTeam-Processed, cloud-services-team (Kanban), Data-Services

Sep 15 2021

sguebo_WMF updated subscribers of T65598: Privacy issues with Gadget-GoogleTrans.js (calls out to google APIs).
Sep 15 2021, 7:20 PM · Security, Privacy Engineering, WMF-General-or-Unknown, Privacy
sguebo_WMF created T291094: Add "Samuel (WMF)" account to Security Team group in gitlab.wikimedia.org.
Sep 15 2021, 4:20 PM · SecTeam-Processed, Privacy Engineering, Release-Engineering-Team (Doing), Security-Team, GitLab

Sep 13 2021

sguebo_WMF awarded T290099: Create a "delete me" maintenance script for special user/data deletion requests a Love token.
Sep 13 2021, 5:06 PM · affects-Miraheze, MW-1.38-notes (1.38.0-wmf.2; 2021-09-28), Security-Team, Privacy Engineering, Privacy, Security

Sep 10 2021

sguebo_WMF moved T195578: Deploy access to performance_schema/sys for the administrative mediawiki account (mediawiki deployers) from Backlog to Completed on the Privacy Engineering board.
Sep 10 2021, 6:36 PM · WMF-Legal, Privacy Engineering, Security, SecTeam Discussion, Performance Issue, DBA

Sep 9 2021

sguebo_WMF added a comment to T195578: Deploy access to performance_schema/sys for the administrative mediawiki account (mediawiki deployers).

Thank you for your answer. I have now wrapped the privacy review and would like to share the conclusion. The analysis focused on the sys database of db2083 server, which as of writing, contains 88 tables, encompassing performance statistics.

Sep 9 2021, 4:30 PM · WMF-Legal, Privacy Engineering, Security, SecTeam Discussion, Performance Issue, DBA
sguebo_WMF added a comment to T195578: Deploy access to performance_schema/sys for the administrative mediawiki account (mediawiki deployers).

@sguebo_WMF thanks a lot for starting to take a look into this.
sys schema has lots of view (this is the documentation about them: https://dev.mysql.com/doc/refman/5.7/en/sys-schema-views.html) to be honest, I am not fully sure which ones could leak private information.

If this is helpful in trying to identify what can be potentially private, this is an output of one row per table, in case something rings a bell and you want to explore the table further:

Thanks for pasting the output of sys’ tables, @Marostegui. That’s really helpful. As I am wrapping up my analysis, I’d like to ask a quick question just to make sure that I understand things correctly. The host_summary table seems to contain IP addresses in the 10.192.** range. Judging by the range, my understanding is that these IPs are from Wikimedia load balancer and not from the end user. Is my understanding accurate?

Sep 9 2021, 1:40 PM · WMF-Legal, Privacy Engineering, Security, SecTeam Discussion, Performance Issue, DBA

Sep 3 2021

sguebo_WMF added a comment to T279952: event.WikipediaPortal referer modification.

@sguebo_WMF & @EYener, we discussed this task and will go ahead and implement this feature.
However, as it is something not trivial, we won't be able to do it this quarter.
We're aiming to tackle it next quarter.

Sep 3 2021, 9:09 PM · Data-Engineering, Privacy Engineering, FR-Tech-Analytics
sguebo_WMF added a comment to T195578: Deploy access to performance_schema/sys for the administrative mediawiki account (mediawiki deployers).

That would be great. There is no real rush, I've just been reviewing blocked tasks that have been silent for a while.

Sep 3 2021, 8:45 PM · WMF-Legal, Privacy Engineering, Security, SecTeam Discussion, Performance Issue, DBA
sguebo_WMF added a comment to T290099: Create a "delete me" maintenance script for special user/data deletion requests.

Note: Miraheze has the RemovePII mediawiki extension for compliance with RTBF. There might be bits you can steal.

Interesting - https://github.com/miraheze/RemovePII does indeed look like it could maybe get us part of the way there.

Sep 3 2021, 7:37 PM · affects-Miraheze, MW-1.38-notes (1.38.0-wmf.2; 2021-09-28), Security-Team, Privacy Engineering, Privacy, Security

Aug 30 2021

sguebo_WMF awarded T279952: event.WikipediaPortal referer modification a Like token.
Aug 30 2021, 3:02 PM · Data-Engineering, Privacy Engineering, FR-Tech-Analytics

Aug 27 2021

sguebo_WMF added a comment to T279952: event.WikipediaPortal referer modification.

Hey @mforns, that confusion is totally understandable so not worries at all. The good thing is that I have now updated my title in Phabricator.

Aug 27 2021, 9:22 PM · Data-Engineering, Privacy Engineering, FR-Tech-Analytics
sguebo_WMF updated sguebo_WMF.
Aug 27 2021, 9:19 PM
sguebo_WMF updated sguebo_WMF.
Aug 27 2021, 9:18 PM
sguebo_WMF updated sguebo_WMF.
Aug 27 2021, 9:17 PM

Aug 24 2021

sguebo_WMF moved T279952: event.WikipediaPortal referer modification from Backlog to Completed on the Privacy Engineering board.
Aug 24 2021, 6:56 PM · Data-Engineering, Privacy Engineering, FR-Tech-Analytics
sguebo_WMF added a comment to T279952: event.WikipediaPortal referer modification.

A couple comments.

  1. The other day, talking with the team, we thought we Analytics could take this task, as sanitizing a full URL by applying a mask, could be useful to other data sets. This is something we could do, I created a task for it: T281144.
  1. On the other hand, I thought that, even if we purge the URL and only leave the hostname, that could still hold privacy sensitive information, that could indicate user's preferences or interests or specific situation. I think we should ask the Security/Privacy team about this. Pinging @JFishback_WMF, what do you think?
Aug 24 2021, 6:50 PM · Data-Engineering, Privacy Engineering, FR-Tech-Analytics

Aug 16 2021

sguebo_WMF added a comment to T284941: [S] Add note explaining that EXIF geolocation metadata may be uploaded with Commons images.

Hi @sguebo_WMF! Do you have suggestions about what language we should use to inform users that information about them will be collected via the EXIF data and made public? Or do you suggest we reach out to legal?

Aug 16 2021, 2:53 PM · QTE-TestingOverview, MW-1.38-notes (1.38.0-wmf.9; 2021-11-16), Commons, Structured-Data-Backlog (Current Work), cloud-services-team (Kanban), Privacy Engineering

Aug 6 2021

sguebo_WMF moved T284944: Increased visibility in wiki-replicas for volunteers fighting vandals from In Progress to Completed on the Privacy Engineering board.
Aug 6 2021, 5:56 PM · cloud-services-team, Data-Engineering, Privacy Engineering, Data-Services
sguebo_WMF added a parent task for T284944: Increased visibility in wiki-replicas for volunteers fighting vandals: Unknown Object (Task).
Aug 6 2021, 5:54 PM · cloud-services-team, Data-Engineering, Privacy Engineering, Data-Services
sguebo_WMF moved T284948: Raw IPs of logged-out users disclosed in wiki-replicas from In Progress to Watching on the Privacy Engineering board.
Aug 6 2021, 5:53 PM · cloud-services-team, Data-Engineering, Privacy Engineering, Data-Services
sguebo_WMF added a parent task for T284943: User genders publicly disclosed in wiki-replicas global_preferences and user_properties tables: Unknown Object (Task).
Aug 6 2021, 5:50 PM · Privacy Engineering, Data-Services, cloud-services-team (Kanban)
sguebo_WMF added a parent task for T284941: [S] Add note explaining that EXIF geolocation metadata may be uploaded with Commons images: Restricted Task.
Aug 6 2021, 5:49 PM · QTE-TestingOverview, MW-1.38-notes (1.38.0-wmf.9; 2021-11-16), Commons, Structured-Data-Backlog (Current Work), cloud-services-team (Kanban), Privacy Engineering
sguebo_WMF added a parent task for T284944: Increased visibility in wiki-replicas for volunteers fighting vandals: Restricted Task.
Aug 6 2021, 5:49 PM · cloud-services-team, Data-Engineering, Privacy Engineering, Data-Services