Enable spark jobs on the dse-k8s cluster via the spark-operator
Open, In Progress, MediumPublic
Actions

Assigned To

None

Authored By

	• EChetty
	Sep 27 2022, 1:39 PM

Description

This ticket is closely aligned with T318535: Document ideas & investigation results from out spike with "Spark on k8s" [SPIKE - 1.5 Sprints] and forms part of an early, experimentation phase of T308317: Data Infrastructure as a Service MVP, in support of T302728: Analytics Platform Future State Planing.

We would like to be able to test the Spark on K8S operator on the DSE cluster: https://github.com/GoogleCloudPlatform/spark-on-k8s-operator

The intended outcome is to be able to execute a spark job as a normal user on a stat box using sparkctl create
https://github.com/GoogleCloudPlatform/spark-on-k8s-operator/blob/master/sparkctl/README.md#create.

The nature of the spark job itself is not important at this stage. It could be stateless.
In future we will need to investigate both HDFS and Ceph storage back-ends capabilities.

Goal:
Run Spark K8 Operator on the DSE Cluster

Task:

Make the spark-on-k8s operator packages/images available for use
Add the spark-on-k8s operator privileged components to the dse-k8s cluster
Add the sparkctl binary to the stat boxes
Submit a spark job to the dse-k8s cluster

Outcomes:

Can successfully launch a spark job on the dse-k8s cluster with sparkctl from a stat box and monitor/log its execution.

Related Objects
Search...

Status	Assigned	Task
In Progress	None	T318712 Enable spark jobs on the dse-k8s cluster via the spark-operator
Resolved	BTullis	T318730 Add spark and spark-operator images to operations/docker-images/production-images
Duplicate	None	T318923 Add the sparkctl binary to the stat boxes
Resolved	BTullis	T318924 Submit a spark job to the dse-k8s cluster
Duplicate	None	T318925 Getting the Metrics API (K8) functioning to support Auto Scaling
Resolved	BTullis	T318926 Deploy spark-operator to the dse-k8s cluster
Resolved	BTullis	T321686 Create namespaces and kubernetes users for spark-operator and for spark jobs
Resolved	BTullis	T322635 Define necessary RBAC rules for spark on dse-k8s cluster
Open	None	T332912 [dse-k8s] Provide common hive config for spark jobs
Open	None	T332913 [dse-k8s] Provide common spark config for spark jobs
In Progress	None	T332909 [dse-k8s] Provide common hadooop config for spark jobs
In Progress	None	T332908 [dse-k8s] Spark-deploy need to create secret object in spark namespace
Open	None	T331971 [dse-k8s] Deploy spark cli to submit jobs on DSE K8S cluster with K8S config
Resolved	BTullis	T327257 DSE Experiment - User Story 1 (Address Kerberos)
Resolved	BTullis	T330162 Research and test methods for accessing kerberized services from spark running on the DSE K8S cluster

Event Timeline

• EChetty created this task.Sep 27 2022, 1:39 PM

Restricted Application added a subscriber: Aklapper. · View Herald TranscriptSep 27 2022, 1:39 PM

• EChetty set the point value for this task to 10.Sep 27 2022, 1:55 PM

• EChetty moved this task from To be discussed to Sprint 02 on the Shared-Data-Infrastructure board.

• EChetty edited projects, added Shared-Data-Infrastructure (Sprint 02); removed Shared-Data-Infrastructure.

BTullis updated the task description. (Show Details)Sep 27 2022, 3:25 PM

BTullis moved this task from Next Up to In Progress on the Shared-Data-Infrastructure (Sprint 02) board.Sep 28 2022, 9:24 AM

• EChetty claimed this task.Sep 29 2022, 12:16 PM

• EChetty edited projects, added Shared-Data-Infrastructure, Epic, Foundational Technology Requests; removed Shared-Data-Infrastructure (Sprint 02).

• EChetty moved this task from Backlog to Epics on the Shared-Data-Infrastructure board.

• EChetty removed the point value for this task.Sep 29 2022, 12:57 PM

• EChetty moved this task from Backlog to Investigate on the Foundational Technology Requests board.

gmodena mentioned this in T320812: [SPIKE] Deploy event driven stateless Flink service to DSE cluster.Oct 24 2022, 12:48 PM

BTullis closed subtask T318730: Add spark and spark-operator images to operations/docker-images/production-images as Resolved.Nov 8 2022, 12:20 PM

• EChetty changed the task status from Open to In Progress.Jan 18 2023, 11:56 AM

• EChetty moved this task from Investigate to Work In Progress on the Foundational Technology Requests board.

• EChetty added a parent task: T327258: DSE Experiment - User Story 2 (Make Compute available).Jan 18 2023, 12:14 PM

Removing inactive assignee (please do so as part of team offboarding!).

BTullis closed subtask T318926: Deploy spark-operator to the dse-k8s cluster as Resolved.Mar 9 2023, 11:21 AM

BTullis closed subtask T318924: Submit a spark job to the dse-k8s cluster as Resolved.Mar 14 2023, 4:33 PM

BTullis added a project: Data-Platform-SRE.Jun 9 2023, 11:56 AM

JArguello-WMF removed a project: Shared-Data-Infrastructure.Jun 29 2023, 1:43 PM

BTullis added a subtask: T332912: [dse-k8s] Provide common hive config for spark jobs.Jul 3 2023, 12:00 PM

BTullis added a subtask: T332913: [dse-k8s] Provide common spark config for spark jobs.

BTullis added a subtask: T332909: [dse-k8s] Provide common hadooop config for spark jobs.Jul 3 2023, 12:02 PM

BTullis added a subtask: T332908: [dse-k8s] Spark-deploy need to create secret object in spark namespace.

BTullis added a subtask: T331971: [dse-k8s] Deploy spark cli to submit jobs on DSE K8S cluster with K8S config.Jul 3 2023, 12:08 PM

BTullis added a subtask: T327257: DSE Experiment - User Story 1 (Address Kerberos).Jul 18 2023, 11:00 AM

BTullis merged a task: T327258: DSE Experiment - User Story 2 (Make Compute available).

BTullis renamed this task from POC for Running Spark on DSE to Enable spark jobs on the dse-k8s cluster via the spark-operator.Jul 18 2023, 11:08 AM

BTullis triaged this task as Medium priority.

BTullis updated the task description. (Show Details)

BTullis removed a subscriber: • EChetty.

Gehel moved this task from Incoming to Misc on the Data-Platform-SRE board.Dec 7 2023, 2:00 PM

BTullis closed subtask T327257: DSE Experiment - User Story 1 (Address Kerberos) as Resolved.Mar 22 2024, 4:56 PM

BTullis moved this task from Misc to Epics on the Data-Platform-SRE board.Mar 22 2024, 5:04 PM

BTullis removed a parent task: T327258: DSE Experiment - User Story 2 (Make Compute available).May 1 2024, 9:27 AM

Enable spark jobs on the dse-k8s cluster via the spark-operatorOpen, In Progress, MediumPublicActions

Description

Related ObjectsSearch...

Event Timeline

Enable spark jobs on the dse-k8s cluster via the spark-operator
Open, In Progress, MediumPublic
Actions

Related Objects
Search...