Split from T306769:
Indexing GitLab data in Perceval does not index email addresses, probably because that's non-public PII which would require auth and specific permissions before being able to pull via the GitLab API.
So what we get in the DB is only a random username without any further info:
"1234567890abcdef1234567890abcdef12345678": {
"enrollments": [],
"identities": [
{
"email": null,
"id": "1234567890abcdef1234567890abcdef12345678",
"name": "SomeName",
"source": "gitlab",
"username": "someusername",
"uuid": "1234567890abcdef1234567890abcdef12345678"
}
]
}That means in contrast to Gerrit there is nothing that would allow identifying staff or non-volunteer GitLab accounts, as we only have a random username.
This will mean that our affiliation stats will become more incorrect.