Split from T306769:
Indexing GitLab data in Perceval does not index email addresses, probably because that's non-public PII which would require auth and specific permissions before being able to pull via the GitLab API.
So what we get in the DB is only a random username without any further info:
"1234567890abcdef1234567890abcdef12345678": { "enrollments": [], "identities": [ { "email": null, "id": "1234567890abcdef1234567890abcdef12345678", "name": "SomeName", "source": "gitlab", "username": "someusername", "uuid": "1234567890abcdef1234567890abcdef12345678" } ] }
That means in contrast to Gerrit there is nothing that would allow identifying staff or non-volunteer GitLab accounts, as we only have a random username.
This will mean that our affiliation stats will become more incorrect.