Figure out submodule updating in GitLab
Open, Needs TriagePublic
Actions

Assigned To

None

Authored By

	brennen
	Nov 19 2020, 9:46 PM

Description

Gerrit can update submodules in a containing project when the submodule's project changes. This is used, for example, by our workflow for deploying backports to MediaWiki release branches. It's also been mentioned as potentially useful for frontend tasks.

Does GitLab support anything similar, or will it need to be implemented as a job?

It seems to be at least supported as an action in the API.

Some relevant docs:

GitLab CI/CD: Using Git submodules with GitLab CI
API: Update existing submodule reference in repository
CI runners: GIT_SUBMODULE_STRATEGY: "The GIT_SUBMODULE_STRATEGY variable is used to control if / how Git submodules are included when fetching the code before a build. You can set them globally or per-job in the variables section."

Related Objects

Mentioned In: T292255: Where should design tokens live? Separate repository or monorepo with Design System?
Mentioned Here: T259832: mediawiki-vendor submodule doesn't get automatically bumped on release branches

Event Timeline

brennen created this task.Nov 19 2020, 9:46 PM

Restricted Application added a subscriber: Aklapper. · View Herald TranscriptNov 19 2020, 9:46 PM

brennen added a project: User-brennen.Nov 19 2020, 10:03 PM

brennen moved this task from INBOX to Development services on the Release-Engineering-Team board.

brennen edited projects, added Release-Engineering-Team (Development services); removed Release-Engineering-Team.

brennen edited projects, added Release-Engineering-Team-TODO (2020-10-01 to 2020-12-31 (Q2)); removed Release-Engineering-Team-TODO.Nov 19 2020, 10:05 PM

thcipriani edited projects, added Release-Engineering-Team-TODO; removed Release-Engineering-Team-TODO (2020-10-01 to 2020-12-31 (Q2)).Jan 23 2021, 12:19 AM

thcipriani moved this task from Should be empty (use Release-Engineering-Team) to Soon-ish on the Release-Engineering-Team-TODO board.

thcipriani removed a project: Release-Engineering-Team (Development services).Apr 20 2021, 1:22 AM

thcipriani edited projects, added Release-Engineering-Team (thcipriani-workboard-fiddling); removed Release-Engineering-Team-TODO.Apr 20 2021, 3:41 AM

thcipriani moved this task from thcipriani-workboard-fiddling to Seen (ARCHIVE) on the Release-Engineering-Team board.Apr 20 2021, 3:46 AM

thcipriani edited projects, added Release-Engineering-Team; removed Release-Engineering-Team (thcipriani-workboard-fiddling).

thcipriani edited projects, added Release-Engineering-Team (Seen); removed Release-Engineering-Team.Apr 20 2021, 3:23 PM

brennen moved this task from Inbox to Migration on the GitLab board.Jul 2 2021, 9:28 PM

The mediawiki/extensions and mediawiki/skins repositories with auto-updating submodules is great in absence of a real monorepo. And submodules in mediawiki/core work great for release branches too.

But honestly Gerrit's auto updating has been magical and hard to debug/understand when it doesn't work (e.g. T259832), so I hope whatever replaces it is more predictable and easier to debug. I remember back when Gerrit didn't support bumping mediawiki/extensions/VisualEditor because a VisualEditor/VisualEditor repo also existed, we had Jenkins do it as a post-merge job so there's some precedent in that.

In T268283#7265462, @Legoktm wrote:

The mediawiki/extensions and mediawiki/skins repositories with auto-updating submodules is great in absence of a real monorepo. And submodules in mediawiki/core work great for release branches too.

A real monorepo would be nice for this use-case and dramatically simplifies many other uses: Cut a new train? Make a new branch, delete non-prod extensions. New tarball release? Delete non-bundled extensions, zip it up, and ship it. Code search? → git grep.

A monorepo makes the code review ACL harder but a great many things more manageable. I'm not sure if I have the appetite for this can of worms, but: why did we go with the current model vs. a monorepo?

In T268283#7289743, @thcipriani wrote:

A monorepo makes the code review ACL harder but a great many things more manageable. I'm not sure if I have the appetite for this can of worms, but: why did we go with the current model vs. a monorepo?

We consciously chose against a mono-repo when moving into the world of git; the original code in CVS was a production mono-repo, and when we were on SVN we effectively had a mono-repo, though a lot of work was done to move code (e.g. the skins, or the Math or Cite extensions) into individual folders as logical (but not actual) repos. When 'we' (Chad/hashar/Roan) moved from SVN to git, we intentionally split out each repo individually for space/effectiveness reasons (devs should have to download multiple GiB of history to work on an extension, etc.).

We've since become a lot more strict about ACLs, not less, so any changes would have to be very carefully thought through, of course.

In addition to what James said, AIUI Git's support for very large monorepos was...not great back then, it's only recently that Microsoft/Facebook/Google have been pushing on it. But even then it still requires a significant amount of disk space, bandwidth, etc. if you just want to hack on one extension. Given that we have contributors who e.g. use Raspberry Pis, I think our split makes sense. We've optimized for new/casual contributors but for "power" users who want the convenience of a monorepo, we have the giant submodule repos. As much as I'd personally love a real monorepo, I don't think it's in our project/community's long-term interests. The submodule repos aren't perfect since you can't just make one big commit and fix the world but we could probably develop tooling to bridge the gap (it's on my list of free time things so we stop abusing LibUp for this...).

brennen moved this task from Migration to Backlog on the GitLab board.Sep 1 2021, 7:58 PM

brennen moved this task from Backlog to Inbox on the GitLab board.Sep 21 2021, 7:25 PM

brennen moved this task from Inbox to Project Migration on the GitLab board.Sep 21 2021, 7:57 PM

brennen edited projects, added GitLab (Project Migration); removed GitLab.

thcipriani edited projects, added Release-Engineering-Team (Done by Wed 06 Oct); removed Release-Engineering-Team (Seen).Sep 22 2021, 3:54 PM

Volker_E mentioned this in T292255: Where should design tokens live? Separate repository or monorepo with Design System?.Sep 30 2021, 8:59 PM

brennen moved this task from Done by Wed 06 Oct to Next on the Release-Engineering-Team board.Oct 19 2021, 7:27 PM

brennen edited projects, added Release-Engineering-Team (Next); removed Release-Engineering-Team (Done by Wed 06 Oct).

Addshore subscribed.Jan 15 2022, 2:30 PM

thcipriani edited projects, added Release-Engineering-Team (Priority Backlog 📥); removed Release-Engineering-Team (Next).Sep 7 2022, 3:46 PM

Addshore unsubscribed.Jun 27 2023, 12:35 PM

LSobanski subscribed.Nov 9 2023, 4:10 PM