Page MenuHomePhabricator

ACraze (accraze)
Sr. Software Engineer (Machine Learning)

Today

  • Clear sailing ahead.

Tomorrow

  • Clear sailing ahead.

Friday

  • Clear sailing ahead.

User Details

User Since
Jun 17 2019, 4:51 PM (105 w, 1 d)
Availability
Available
IRC Nick
accraze
LDAP User
Unknown
MediaWiki User
ACraze (WMF) [ Global Accounts ]

Recent Activity

Mon, Jun 14

ACraze committed rMLISf7e7ee9ed81c: swap outlink transformer base image to wmf python3 (authored by ACraze).
swap outlink transformer base image to wmf python3
Mon, Jun 14, 7:39 PM

Fri, Jun 11

ACraze committed rMLIS6c1d71df8abe: swap outlink base image to wmf bullseye (authored by ACraze).
swap outlink base image to wmf bullseye
Fri, Jun 11, 4:10 PM

Wed, Jun 9

ACraze closed T284115: Move inference services to v1beta1 api as Resolved.

Alright, looks like all our current specs have been rewritten to reflect the v1beta1 api and have been confirmed to run well on KFServing 0.5.X.
All new services should use this as well.

Wed, Jun 9, 7:18 PM · Machine-Learning-Team (Active Tasks), Lift-Wing
ACraze closed T284115: Move inference services to v1beta1 api, a subtask of T272917: Lift Wing proof of concept, as Resolved.
Wed, Jun 9, 7:18 PM · Lift-Wing, Machine-Learning-Team (Active Tasks)
ACraze closed T283526: Create generic revscoring inference service as Resolved.

Ok great work on @kevinbazira, closing this task. I have captured our next steps in the following tasks:

Wed, Jun 9, 7:15 PM · Machine-Learning-Team (Active Tasks), artificial-intelligence, revscoring, Lift-Wing
ACraze closed T283526: Create generic revscoring inference service, a subtask of T272874: Prepare 4 ORES English models for Lift Wing, as Resolved.
Wed, Jun 9, 7:15 PM · Lift-Wing, Machine-Learning-Team (Active Tasks)
ACraze created T284689: Create migration plan for editquality models from ORES to Lift Wing.
Wed, Jun 9, 7:12 PM · artificial-intelligence, Machine-Learning-Team, editquality-modeling, revscoring, Lift-Wing
ACraze moved T284678: Create a KFServing model server for articlequality models from Non-Project Work to Project: ML Models on the Machine-Learning-Team (Active Tasks) board.
Wed, Jun 9, 5:08 PM · artificial-intelligence, Machine-Learning-Team (Active Tasks), articlequality-modeling, revscoring, Lift-Wing
ACraze edited projects for T284678: Create a KFServing model server for articlequality models, added: Machine-Learning-Team (Active Tasks); removed Machine-Learning-Team.
Wed, Jun 9, 5:00 PM · artificial-intelligence, Machine-Learning-Team (Active Tasks), articlequality-modeling, revscoring, Lift-Wing
ACraze created T284678: Create a KFServing model server for articlequality models.
Wed, Jun 9, 4:59 PM · artificial-intelligence, Machine-Learning-Team (Active Tasks), articlequality-modeling, revscoring, Lift-Wing
ACraze committed rMLISd7a218862462: migrate outlink-topic-model to v1beta1 (authored by ACraze).
migrate outlink-topic-model to v1beta1
Wed, Jun 9, 4:04 PM
ACraze committed rMLIS672f12713856: migrate enwiki-goodfaith to v1beta1 (authored by ACraze).
migrate enwiki-goodfaith to v1beta1
Wed, Jun 9, 4:04 PM
ACraze committed rMLIS9684ea99859e: migrate enwiki-damaging to v1beta1 (authored by ACraze).
migrate enwiki-damaging to v1beta1
Wed, Jun 9, 4:04 PM

Tue, Jun 8

ACraze added a comment to T283526: Create generic revscoring inference service.

@kevinbazira Just adding notes here after digging into all the different model classes (articlequality, drafttopic, etc..) today:

  1. As you mentioned earlier today, articlequality models only support up to Python3.7 and also use an older version of revscoring, which means we will need a separate model-server image to load those types of revscoring models.
  2. The editquality models (damaging/goodfaith/reverted/etc.) seem to run well with our current image, although we should think about loading them all into inference services to see if there are any issues like old revscoring dependencies.
  3. The drafttopic/articletopic models use additional word embeddings that we would either need to package inside a container, or inject via storage. Let's hold off on migrating these to KFServing for now, as there is talk about the language-agnostic Outlink topic model replacing these types of models.
Tue, Jun 8, 11:40 PM · Machine-Learning-Team (Active Tasks), artificial-intelligence, revscoring, Lift-Wing
ACraze added a subtask for T272917: Lift Wing proof of concept: T284115: Move inference services to v1beta1 api.
Tue, Jun 8, 11:26 PM · Lift-Wing, Machine-Learning-Team (Active Tasks)
ACraze added a parent task for T284115: Move inference services to v1beta1 api: T272917: Lift Wing proof of concept.
Tue, Jun 8, 11:26 PM · Machine-Learning-Team (Active Tasks), Lift-Wing
ACraze updated the task description for T284115: Move inference services to v1beta1 api.
Tue, Jun 8, 11:23 PM · Machine-Learning-Team (Active Tasks), Lift-Wing
ACraze moved T284115: Move inference services to v1beta1 api from Non-Project Work to Project: Lift Wing on the Machine-Learning-Team (Active Tasks) board.
Tue, Jun 8, 5:50 PM · Machine-Learning-Team (Active Tasks), Lift-Wing
ACraze claimed T284115: Move inference services to v1beta1 api.

After talking with @elukey, it seems the conversion webhook is not patched by the self-signed-ca.sh (see: T280661) script in KFServing v0.5.x. When the kfserving webhook automatically converts from v1alpha2 to v1beta1, the converted service is not patched with the custom CA. The issue is cleared up when the service is deployed using the v1beta1 and bypasses the conversion webhook.

Tue, Jun 8, 5:49 PM · Machine-Learning-Team (Active Tasks), Lift-Wing

Fri, Jun 4

ACraze committed rMLIS3f9a6604ec33: add enwiki damaging inference service config (authored by kevinbazira).
add enwiki damaging inference service config
Fri, Jun 4, 5:04 PM
ACraze committed rMLIS84376129b064: swap revscoring base image to wmf buster (authored by ACraze).
swap revscoring base image to wmf buster
Fri, Jun 4, 4:36 PM

Thu, Jun 3

ACraze added a comment to T279004: Production images for ORES/revscoring models.

I ran into some issues upgrading the revscoring inference service base image to bullseye (mostly since scipy & numpy have some issues with python3.9 still), so I went with the wmf buster image instead. Things seem to work well so far.

Thu, Jun 3, 6:49 PM · Patch-For-Review, Machine-Learning-Team (Active Tasks), Lift-Wing

Wed, Jun 2

ACraze added a comment to T283526: Create generic revscoring inference service.

@kevinbazira I've been thinking about how to structure our repo with the generic revscoring image and all the service config files. There are still some unknowns around deployment and following the SRE guidelines with respect to our inference services, so it is highly likely the codebase structure will change a bit in the near future.

Wed, Jun 2, 9:23 PM · Machine-Learning-Team (Active Tasks), artificial-intelligence, revscoring, Lift-Wing
ACraze updated subscribers of T279004: Production images for ORES/revscoring models.

I was talking with @elukey today and he mentioned that we should begin using base images from the WMF docker registry where we can.
This means, the production version of our generic revscoring image should use the WMF Bullseye image instead of Ubuntu (if possible). I will do some testing today using the Bullseye image and will report back.

Wed, Jun 2, 9:04 PM · Patch-For-Review, Machine-Learning-Team (Active Tasks), Lift-Wing

Tue, Jun 1

ACraze triaged T284115: Move inference services to v1beta1 api as Low priority.
Tue, Jun 1, 11:20 PM · Machine-Learning-Team (Active Tasks), Lift-Wing
ACraze created T284115: Move inference services to v1beta1 api.
Tue, Jun 1, 11:20 PM · Machine-Learning-Team (Active Tasks), Lift-Wing
ACraze created T284091: Review egress rules for ml-serve cluster.
Tue, Jun 1, 4:44 PM · Lift-Wing, Machine-Learning-Team

Thu, May 27

ACraze added a comment to T283526: Create generic revscoring inference service.

I uploaded two more models to the public bucket for testing:

Thu, May 27, 10:59 PM · Machine-Learning-Team (Active Tasks), artificial-intelligence, revscoring, Lift-Wing
ACraze added a comment to T283526: Create generic revscoring inference service.

I have been able to create two inference services for two models (enwiki.goodfaith and enwiki.damaging) using one generic container.

@kevinbazira this is very exciting! It seems like the generic revscoring image is working well. I have fixed the permissions issue on the gerrit repo and now you should be able to push your code up.

Thu, May 27, 5:21 PM · Machine-Learning-Team (Active Tasks), artificial-intelligence, revscoring, Lift-Wing
ACraze committed rMLIS1a081233d5f7: move outlink model binary to external storage (authored by ACraze).
move outlink model binary to external storage
Thu, May 27, 4:05 PM
ACraze committed rMLIS598bd8264e19: create external transformer for outlink model (authored by ACraze).
create external transformer for outlink model
Thu, May 27, 4:05 PM

Wed, May 26

ACraze committed rMLIScf04e03aebb9: disable sidecar injection for outlink-topic-model (authored by ACraze).
disable sidecar injection for outlink-topic-model
Wed, May 26, 6:23 PM
ACraze added a comment to T283526: Create generic revscoring inference service.

@kevinbazira this is great news! glad to hear the generic container approach is working so far. I have uploaded the enwiki-damaging model to our public wmf-ml-models bucket, so you should be able to inject that model into a separate container now. Let me know if you run into any issues.

Wed, May 26, 6:13 PM · Machine-Learning-Team (Active Tasks), artificial-intelligence, revscoring, Lift-Wing

Tue, May 25

ACraze renamed T276862: Load outlinks topic model in to KFServing from Load outlinks topic model model in to KFServing to Load outlinks topic model in to KFServing.
Tue, May 25, 10:28 PM · Patch-For-Review, Machine-Learning-Team (Active Tasks), Lift-Wing

Mon, May 24

ACraze added a comment to T282802: Implement model storage for enwiki-goodfaith inference service.

Found a really helpful github issue today related to using STORAGE_URI with custom inference services:
https://github.com/kubeflow/kfserving/issues/1232

Mon, May 24, 10:15 PM · Machine-Learning-Team (Active Tasks), Patch-For-Review, artificial-intelligence, revscoring, Lift-Wing
ACraze added a comment to T272919: Install KFServing standalone.

It seems mostly related to model servers for various providers, but I have no idea if we need them now or not. Can you shed some light? :D

@elukey -- mostly echoing @Theofpa: I think all we need right now for the MVP is controller & agent from KFServing. The ORES models will be a custom image that we are still finishing and the other model we are working on is the Outlinks topic model, which is another custom image that runs a fastText model. We also might need to do the storage-init as well, but that depends on the outcome of T282802: Implement model storage for enwiki-goodfaith inference service

Mon, May 24, 9:59 PM · Patch-For-Review, Lift-Wing, Machine-Learning-Team (Active Tasks)
ACraze closed T279000: Load a revscoring model into KFServing as Resolved.

Going to mark this task as resolved, since we now have three members of the team running the enwiki-goodfaith model as a custom inference service.

Mon, May 24, 6:27 PM · Machine-Learning-Team (Active Tasks), artificial-intelligence, revscoring, Lift-Wing
ACraze closed T279000: Load a revscoring model into KFServing, a subtask of T272874: Prepare 4 ORES English models for Lift Wing, as Resolved.
Mon, May 24, 6:27 PM · Lift-Wing, Machine-Learning-Team (Active Tasks)
ACraze added a subtask for T272874: Prepare 4 ORES English models for Lift Wing: T283526: Create generic revscoring inference service.
Mon, May 24, 6:25 PM · Lift-Wing, Machine-Learning-Team (Active Tasks)
ACraze added a parent task for T283526: Create generic revscoring inference service: T272874: Prepare 4 ORES English models for Lift Wing.
Mon, May 24, 6:25 PM · Machine-Learning-Team (Active Tasks), artificial-intelligence, revscoring, Lift-Wing
ACraze created T283526: Create generic revscoring inference service.
Mon, May 24, 6:24 PM · Machine-Learning-Team (Active Tasks), artificial-intelligence, revscoring, Lift-Wing

May 21 2021

ACraze added a comment to T282802: Implement model storage for enwiki-goodfaith inference service.

Did some more digging into storage today. It seems that the V1alpha2CustomSpec does not support the storageUri field, like the other framework specs do
(i.e. V1alpha2SKLearnSpec)

May 21 2021, 11:09 PM · Machine-Learning-Team (Active Tasks), Patch-For-Review, artificial-intelligence, revscoring, Lift-Wing
ACraze updated subscribers of T279000: Load a revscoring model into KFServing.

Also confirming that @elukey was able to run a prediction with enwiki-goodfaith on his own minikube instance using some of our own images today (istio etc.).

May 21 2021, 7:19 PM · Machine-Learning-Team (Active Tasks), artificial-intelligence, revscoring, Lift-Wing
ACraze added a comment to T279000: Load a revscoring model into KFServing.

@kevinbazira excellent work on this! Confirming I am able to use the updated infer.sh script to generate a new session cookie and retrieve a prediction. This is going to save us so much time while developing in the sandbox clusters. Thank you!!

May 21 2021, 7:16 PM · Machine-Learning-Team (Active Tasks), artificial-intelligence, revscoring, Lift-Wing
ACraze edited P16152 infer-with-authservice.sh.
May 21 2021, 7:11 PM · Lift-Wing
ACraze edited P16152 infer-with-authservice.sh.
May 21 2021, 7:11 PM · Lift-Wing
ACraze created P16152 infer-with-authservice.sh.
May 21 2021, 7:10 PM · Lift-Wing
ACraze created P16151 authservice-session-generator.sh.
May 21 2021, 7:06 PM · Lift-Wing

May 20 2021

ACraze moved T282802: Implement model storage for enwiki-goodfaith inference service from Unorganized to Active Tasks on the Machine-Learning-Team board.
May 20 2021, 9:15 PM · Machine-Learning-Team (Active Tasks), Patch-For-Review, artificial-intelligence, revscoring, Lift-Wing
ACraze added a comment to T279000: Load a revscoring model into KFServing.

@kevinbazira awesome! i'm glad you were able to deploy the custom inference service on the sandbox cluster. In response to your thoughts:

May 20 2021, 6:27 PM · Machine-Learning-Team (Active Tasks), artificial-intelligence, revscoring, Lift-Wing
ACraze created P16129 revscoring-input.json.
May 20 2021, 4:19 PM · revscoring, Lift-Wing
ACraze committed rMLISacbd45b83355: disable sidecar injection for enwiki-goodfaith (authored by ACraze).
disable sidecar injection for enwiki-goodfaith
May 20 2021, 4:00 PM

May 19 2021

ACraze added a comment to T276862: Load outlinks topic model in to KFServing.

Quick update: I've been doing some testing over the past couple of days and have noticed a timeout issue when testing high throughput loads (like 50-100 calls per second). I traced it down to when we are retrieving all the outlinks via mwapi.Session. After ~100 calls, the outlinks eventually get returned as None: https://github.com/wikimedia/machinelearning-liftwing-inference-services/blob/main/outlink-topic-model/model-server/model.py#L105

May 19 2021, 11:25 PM · Patch-For-Review, Machine-Learning-Team (Active Tasks), Lift-Wing
ACraze updated the title for P16105 infer-enwiki-goodfaith.sh from enwiki-goodfaith prediction using curl to infer-enwiki-goodfaith.sh.
May 19 2021, 10:33 PM · Lift-Wing
ACraze added a comment to T279000: Load a revscoring model into KFServing.

@kevinbazira I reviewed your deployed inference service on the KFv1.1 sandbox. So far great progress :)

May 19 2021, 10:17 PM · Machine-Learning-Team (Active Tasks), artificial-intelligence, revscoring, Lift-Wing
ACraze added a comment to P16105 infer-enwiki-goodfaith.sh.

This is how I am testing our custom KFServing inference services on the KFv1.1 sandbox cluster. The session cookie is required since we are using Dex/Istio authentication. Also you must disable Istio sidecar injection in the CRD yaml in order to reach the service.

May 19 2021, 7:33 PM · Lift-Wing
ACraze created P16105 infer-enwiki-goodfaith.sh.
May 19 2021, 7:32 PM · Lift-Wing
ACraze committed rMLIS7acca501e9d1: add threshold param to outlinks request (authored by ACraze).
add threshold param to outlinks request
May 19 2021, 4:34 PM
ACraze committed rMLIS4ed049381304: add unique name to enwiki-goodfaith load tests (authored by ACraze).
add unique name to enwiki-goodfaith load tests
May 19 2021, 4:34 PM
ACraze committed rMLIS33efc038aea9: add outlink load test job configs (authored by ACraze).
add outlink load test job configs
May 19 2021, 4:34 PM
ACraze committed rMLISbc616cdc4336: move name field to spec.predictor.custom.container (authored by ACraze).
move name field to spec.predictor.custom.container
May 19 2021, 4:34 PM

May 17 2021

ACraze closed T282797: Request membership in machinelearning group for ML team as Resolved.
May 17 2021, 10:36 PM · Gerrit-Privilege-Requests
ACraze added a comment to T282797: Request membership in machinelearning group for ML team.

@Legoktm yes I was able to fix this a bit earlier today, forgot to mark this as resolved. Also, can you point me towards some docs for setting up jenkins-bot CI??

May 17 2021, 10:07 PM · Gerrit-Privilege-Requests
ACraze committed rMLIS772ae96b3a76: Add license file (authored by ACraze).
Add license file
May 17 2021, 6:38 PM
ACraze committed rMLISaa1889e9a828: update predictor name for enwiki-goodfaith (authored by ACraze).
update predictor name for enwiki-goodfaith
May 17 2021, 6:38 PM
ACraze updated the task description for T282797: Request membership in machinelearning group for ML team.
May 17 2021, 4:54 PM · Gerrit-Privilege-Requests
ACraze added a comment to T282802: Implement model storage for enwiki-goodfaith inference service.

Awesome, thank you both! Next I'm going to wire up the storage_uri , but I'll need to setup a custom s3 endpoint. Do either of you know what the Thanos Swift url is? edit: nm I found it https://thanos-swift.discovery.wmnet

May 17 2021, 4:46 PM · Machine-Learning-Team (Active Tasks), Patch-For-Review, artificial-intelligence, revscoring, Lift-Wing
ACraze added a comment to T282752: Upgrade Istio & Knative on sandbox cluster.

Spun up a new VM running Kubeflow v1.3:

May 17 2021, 4:42 PM · Lift-Wing, Machine-Learning-Team

May 13 2021

ACraze edited P15960 model_upload.sh.
May 13 2021, 10:26 PM · Lift-Wing
ACraze edited P15960 model_upload.sh.
May 13 2021, 10:23 PM · Lift-Wing
ACraze updated subscribers of T282802: Implement model storage for enwiki-goodfaith inference service.

@elukey or @klausman: whenever you have time, can you please upload the enwiki-goodfaith model to Thanos Swift?
I think you two are the only ones with access to the credentials.

May 13 2021, 10:22 PM · Machine-Learning-Team (Active Tasks), Patch-For-Review, artificial-intelligence, revscoring, Lift-Wing
ACraze created P15960 model_upload.sh.
May 13 2021, 10:17 PM · Lift-Wing
ACraze updated the task description for T282802: Implement model storage for enwiki-goodfaith inference service.
May 13 2021, 9:05 PM · Machine-Learning-Team (Active Tasks), Patch-For-Review, artificial-intelligence, revscoring, Lift-Wing
ACraze updated the task description for T282802: Implement model storage for enwiki-goodfaith inference service.
May 13 2021, 5:50 PM · Machine-Learning-Team (Active Tasks), Patch-For-Review, artificial-intelligence, revscoring, Lift-Wing
ACraze added a subtask for T280025: Find a way to store models for Kubeflow: T282802: Implement model storage for enwiki-goodfaith inference service.
May 13 2021, 5:38 PM · Lift-Wing, Machine-Learning-Team (Active Tasks)
ACraze added a parent task for T282802: Implement model storage for enwiki-goodfaith inference service: T280025: Find a way to store models for Kubeflow.
May 13 2021, 5:38 PM · Machine-Learning-Team (Active Tasks), Patch-For-Review, artificial-intelligence, revscoring, Lift-Wing
ACraze created T282802: Implement model storage for enwiki-goodfaith inference service.
May 13 2021, 5:37 PM · Machine-Learning-Team (Active Tasks), Patch-For-Review, artificial-intelligence, revscoring, Lift-Wing
ACraze created T282797: Request membership in machinelearning group for ML team.
May 13 2021, 4:56 PM · Gerrit-Privilege-Requests

May 12 2021

ACraze edited projects for T276862: Load outlinks topic model in to KFServing, added: Machine-Learning-Team (Active Tasks); removed Machine-Learning-Team.
May 12 2021, 10:50 PM · Patch-For-Review, Machine-Learning-Team (Active Tasks), Lift-Wing
ACraze edited projects for T279000: Load a revscoring model into KFServing, added: Machine-Learning-Team (Active Tasks); removed Machine-Learning-Team.
May 12 2021, 10:33 PM · Machine-Learning-Team (Active Tasks), artificial-intelligence, revscoring, Lift-Wing
ACraze renamed T276862: Load outlinks topic model in to KFServing from Load a fastText model in to KFServing to Load outlinks topic model model in to KFServing.
May 12 2021, 10:13 PM · Patch-For-Review, Machine-Learning-Team (Active Tasks), Lift-Wing
ACraze added a comment to T276862: Load outlinks topic model in to KFServing.

Thanks @Isaac, I see that reflected in the code now, but didn't have threshold documented with the other params. I've added a patch for that in gerrit.

May 12 2021, 10:05 PM · Patch-For-Review, Machine-Learning-Team (Active Tasks), Lift-Wing
ACraze created T282752: Upgrade Istio & Knative on sandbox cluster.
May 12 2021, 9:49 PM · Lift-Wing, Machine-Learning-Team

May 11 2021

ACraze created P15917 enwiki-goodfaith kfserving initial load test.
May 11 2021, 10:33 PM · revscoring, Lift-Wing
ACraze added a comment to T279000: Load a revscoring model into KFServing.

As mentioned in T279004, we have successfully deployed the enwiki-goodfaith model as a custom KFServing inference service based off of @kevinbazira's revscoring container image.
From initial testing it seems that we can use the revscoring image as a 'base' image and then inject the model binaries into individual inference services.

May 11 2021, 10:24 PM · Machine-Learning-Team (Active Tasks), artificial-intelligence, revscoring, Lift-Wing

May 10 2021

ACraze added a comment to T276862: Load outlinks topic model in to KFServing.

Confirming that the Outlinks topic model can indeed be loaded as a custom KFServing inference service to be used by Lift-Wing .
I was able to package and deploy the model inside our Kubeflow sandbox today.

May 10 2021, 10:37 PM · Patch-For-Review, Machine-Learning-Team (Active Tasks), Lift-Wing
ACraze added a comment to T282429: Host Label Studio on Toolforge.

Excellent work @kevinbazira , thank you for the documentation! This will be very helpful in creating a tutorial for running community models too (T281317)

May 10 2021, 5:55 PM · Machine-Learning-Team (Active Tasks), Pilot-Flag
ACraze awarded T282422: Explore Label Studio as we prepare for Pilot Flag a Like token.
May 10 2021, 4:15 PM · Machine-Learning-Team (Active Tasks), Pilot-Flag

May 7 2021

ACraze added a comment to T279004: Production images for ORES/revscoring models.

@kevinbazira: good news! your revscoring container is now running on the Kubeflow sandbox! The enwiki-goodfaith model is running as a custom inferenceservice via KFServing using the Dockerfile you created.
I am able to hit the service and retrieve a prediction.

May 7 2021, 9:01 PM · Patch-For-Review, Machine-Learning-Team (Active Tasks), Lift-Wing
ACraze created P15863 KFServing enwiki-goodfaith server.
May 7 2021, 8:24 PM · revscoring, Lift-Wing
ACraze created P15862 Revscoring KFServing Dockerfile.
May 7 2021, 8:22 PM · revscoring, Lift-Wing
ACraze created P15860 KFServing + Dex Inference.
May 7 2021, 6:32 PM · Lift-Wing

Apr 27 2021

ACraze added a comment to T280467: Naming convention for the model storage structure.

@klausman I think most of our models should be ok with minute-granularity. The only case I can think of where it could be problematic is if we train two slightly different versions of the same model in parallel (and they finish within ~60 secs of each other), but I don't see that happening anytime soon. We could also just limit the pipelines to only running one at a time to make sure we avoid it.

Apr 27 2021, 10:50 PM · Lift-Wing, Machine-Learning-Team (Active Tasks)
ACraze updated the task description for T281317: Create a tutorial for deploying a model on toolforge.
Apr 27 2021, 10:02 PM · artificial-intelligence, Lift-Wing, Machine-Learning-Team (Active Tasks)
ACraze closed T265066: Proof of concept: dockerized model served over CloudVPS as Declined.

Talked about this with @calbon today and determined that toolforge is probably a better place to direct folks to for deploying community models at this time. This has been captured in T281317
Closing this ticket for now, feel free to reopen if we want to re-evaluate cloudvps.

Apr 27 2021, 9:39 PM · Machine-Learning-Team
ACraze updated the task description for T281317: Create a tutorial for deploying a model on toolforge.
Apr 27 2021, 9:38 PM · artificial-intelligence, Lift-Wing, Machine-Learning-Team (Active Tasks)
ACraze added projects to T281317: Create a tutorial for deploying a model on toolforge: Lift-Wing, artificial-intelligence.
Apr 27 2021, 9:33 PM · artificial-intelligence, Lift-Wing, Machine-Learning-Team (Active Tasks)
ACraze created T281317: Create a tutorial for deploying a model on toolforge.
Apr 27 2021, 9:29 PM · artificial-intelligence, Lift-Wing, Machine-Learning-Team (Active Tasks)

Apr 23 2021

ACraze added a comment to T280467: Naming convention for the model storage structure.

Thanks @Theofpa , this is really helpful right now as I'm working with a sandbox KF install to test some of our models with.

Apr 23 2021, 7:50 PM · Lift-Wing, Machine-Learning-Team (Active Tasks)

Apr 19 2021

ACraze added a comment to T280025: Find a way to store models for Kubeflow.

ok if I ask SRE to create an ML account on Thanos Swift then?

@elukey - yeah I think we should give this a try and see how it goes.

Apr 19 2021, 4:44 PM · Lift-Wing, Machine-Learning-Team (Active Tasks)