User Details
- User Since
- Mar 6 2020, 9:03 PM (223 w, 6 d)
- Availability
- Available
- IRC Nick
- Raymond_Ndibe
- LDAP User
- Raymond Ndibe
- MediaWiki User
- Raymond Ndibe [ Global Accounts ]
Yesterday
Wed, Jun 19
Hello @taavi , thanks for helping reverse this. This was a oversight on my path. I thought the cli changes has already been deployed
Tue, Jun 18
yes docs. Will add that
Thu, Jun 6
Tue, Jun 4
Mon, Jun 3
Thu, May 30
also I am working on unifying the parameters
then we rename this task to move jobs load logic to the jobs-api? that way it will still be valid for component-api? because we need to resolve this or else we can't say we've conclusively resolved the parent task https://phabricator.wikimedia.org/T364204
Wed, May 29
Tue, May 28
Mon, May 27
made some attempt to define somethings and answer some important questions on the task description, based on our discussion @dcaro . Input and possible modifications are welcome
Thanks @bd808
Wed, May 22
this is done right @dcaro? we should mark it as resolved if so
May 21 2024
May 16 2024
I have a thing against reuse-from. It is not immediately clear what it means by just looking at it. depends-on is a more descriptive name if I understand the supposed meaning of reuse-from correctly. In contrast reuse-from sounds like we are somehow reusing the configuration of a particular component in another. Obviously this is not already set in stone but it's important to point it out
May 15 2024
May 13 2024
one of the resources we need to be aware of here https://wikitech.wikimedia.org/wiki/Portal:Toolforge/Admin/Kubernetes/Upgrading_Kubernetes
Can we link any resources we already have (automations, cookbooks, instructions, etc) on how we handle k8s upgrade here too? k8s upgrade is easy on paper but I assume it'll probably be more hairy for our particular implementation. Btw for the decision request I'm going with Option 2
Apr 16 2024
marking as resolved. We can open it again if anyone disagrees
Apr 10 2024
Apr 9 2024
I think we can mark this as resolved now @taavi
Apr 8 2024
Apr 4 2024
we now have a --health-check-script argument that allows you to provide a custom health script that kubernetes uses to decide when your workload becomes unhealthy so it can be restarted. This only works for continuous jobs
how will this affect the current 3 continuous jobs limit? does 2 replicas of a continuous job count as 1 or 2 when considering limits?
Apr 2 2024
Mar 21 2024
Mar 5 2024
@dcaro do you have any idea on how to reproduce this issue?
Mar 4 2024
Feb 29 2024
Feb 27 2024
I think I need help reproducing this. So far multiple runs on tools and toolsbeta failed to trigger this error. Is it likely that this is a combination of more than 1 factor, maybe a result of multiple users trying to upload at the same time?