Welcome to Scribd!

Spark Submit: Executing Spark Applications

Uploaded by

0% found this document useful (0 votes)

4 views3 pages

Spark programs are executed using the spark-submit command, which takes parameters such as the application jar file and driver class. Spark-submit can run the driver in client or cluster mode - client mode runs the driver locally while cluster mode runs it on the cluster. It also takes parameters to configure executors and resources. The document provides examples of spark-submit commands to run a Spark application locally or on a Hadoop cluster in YARN mode.

Original Description:

HOw to submit spark application

Original Title

18 SparkSubmit BigData 6x

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

4 views3 pages

Spark Submit: Executing Spark Applications

Uploaded by

Francesco Ciaschini

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 3

Search inside document

2/26/2017

Spark programs are executed (submitted) by spark-submit has also two parameters that
using the spark-submit command are used to specify where the application is
It is a command line program
executed
--master option
It is characterized by a set of parameters
Specify which environment/scheduler is used to execute
E.g., the name of the jar file containing all the classes of the application
the Spark application we want to execute spark://host:port The spark scheduler is used
The name of the Driver class mesos://host:port The memos scheduler is used
yarn The YARN scheduler (i.e., the one of
The parameters of the Spark application Hadoop)
etc. local The application is executed exclusively on
the local PC
3 4

--deploy-mode option
Specify where the Driver is launched/executed
client The driver is launched locally (in the
local PC executing spark-submit)
cluster The driver is launched on one node of the
cluster

5 6

1
2/26/2017

In cluster mode
The Spark driver runs in the ApplicationMaster on a cluster node.
The cluster nodes are used also to store RDDs and execute transformations and
actions on the RDDs
A single process in a YARN container is responsible for both driving the application
and requesting resources from YARN.
The resources (memory and CPU) of the client that launches the application are
not used.

7 8

Spark-submit allows specifying

The number of executors
--num-executors NUM
Default value: NUM=2 executors

In client mode
The number of cores per executor
The Spark driver runs on the host where the job is submitted (i.e., the resources --executor-cores NUM
of the client are used to execute the Driver) Default value: NUM=1 core
The cluster nodes are used to store RDDs and execute transformations and Main memory per executor
actions on the RDDs
The ApplicationMaster is responsible only for requesting executor containers from
--executor-memory MEM
YARN. Default value: MEM=1GB
The maximum values of these parameters are
limited by the configuration of the cluster

9 10

Spark-submit allows specifying The following command submits a Spark

The number of cores for the driver application on a Hadoop cluster
spark-submit --class
--driver-cores NUM
it.polito.bigdata.spark.DriverMyApplication --deploy-
Default value: NUM=1 core
mode cluster --master yarn MyApplication.jar arguments
Main memory for the driver It executes/submits the application
--driver-memory MEM it.polito.bigdata.spark.DriverMyApplication
Default value: MEM=1GB
contained in MyApplication.jar
Also the maximum values of these parameters The application is executed on a Hadoop cluster
are limited by the configuration of the cluster based on the YARN scheduler
when the deploy-mode is set to cluster Also the Driver is executed in a node of cluster
11 12

2
2/26/2017

The following command submits a Spark application

on a local PC
spark-submit --class
it.polito.bigdata.spark.DriverMyApplication --deploy-mode
client --master local MyApplication.jar arguments
It executes/submits the application
it.polito.bigdata.spark.DriverMyApplication
contained in MyApplication.jar
The application is completely executed on the local
PC
Both Driver and Executors
Hadoop is not needed in this case
You only need the Spark software

4.2. Spark Applications
Document19 pages
4.2. Spark Applications
yagoencuestas
No ratings yet
Spark Tuning
Document26 pages
Spark Tuning
ajquinonesp
No ratings yet
Spark and YARN: Better Together for Big Data Clusters
Document22 pages
Spark and YARN: Better Together for Big Data Clusters
NavneetKamboj
No ratings yet
Basics of Apache Spark Configuration Settings - by Halil Ertan - Towards Data Science
Document11 pages
Basics of Apache Spark Configuration Settings - by Halil Ertan - Towards Data Science
arunshan
No ratings yet
Spark ETL and Process
Document15 pages
Spark ETL and Process
Ankita Kukreja
No ratings yet
Spark Concept
Document18 pages
Spark Concept
suchanda kundu
No ratings yet
Learning Apache Spark With Python
Document10 pages
Learning Apache Spark With Python
dalalroshan
No ratings yet
Configuration - Spark 2.3.2 Documentation
Document20 pages
Configuration - Spark 2.3.2 Documentation
eek.the.kat
No ratings yet
SparkArchitecture
Document7 pages
SparkArchitecture
KRamakrishna
No ratings yet
Data Platform and Analytics Foundational Training: (Speaker Name)
Document14 pages
Data Platform and Analytics Foundational Training: (Speaker Name)
Kathalina Suarez
No ratings yet
Name: Wable Snehal Mahesh Subject:-Scala & Spark Div: - Mba Ii Roll No: - 57 Guidence Name: - Prof. Archana Suryawanshi - Kadam
Document11 pages
Name: Wable Snehal Mahesh Subject:-Scala & Spark Div: - Mba Ii Roll No: - 57 Guidence Name: - Prof. Archana Suryawanshi - Kadam
Snehal Mahesh Wable
No ratings yet
Spark Interview Questions: Click Here
Document35 pages
Spark Interview Questions: Click Here
Keshav Krishna
No ratings yet
14 SparkParallelProcessing
Document51 pages
14 SparkParallelProcessing
Petter P
No ratings yet
Spark 20 Tuning Guide
Document21 pages
Spark 20 Tuning Guide
ajquinonesp
No ratings yet
Apache Spark Interview Questions Book
Document15 pages
Apache Spark Interview Questions Book
Praneeth Krishna
100% (1)
Apache Spark Architecture
Document4 pages
Apache Spark Architecture
nitinlucky
No ratings yet
Introduction To Spark
Document84 pages
Introduction To Spark
Namruta G H
No ratings yet
Spark Preliminaries
Document4 pages
Spark Preliminaries
Sai Gopi
No ratings yet
Riscv Rocket Chip Tutorial Bootcamp Jan2015
Document30 pages
Riscv Rocket Chip Tutorial Bootcamp Jan2015
Walt Wang
No ratings yet
Using Spark On Cori: Lisa Gerhardt, Evan Racah NERSC New User Training
Document14 pages
Using Spark On Cori: Lisa Gerhardt, Evan Racah NERSC New User Training
sonlongho
No ratings yet
Spark-Rdd
Document15 pages
Spark-Rdd
K Anantha Krishnan
No ratings yet
Databricks Apache Spark Certified Developer Master Cheat Sheet
Document29 pages
Databricks Apache Spark Certified Developer Master Cheat Sheet
rajikare
No ratings yet
Openmp: Dr. Nitya Hariharan (Intel)
Document42 pages
Openmp: Dr. Nitya Hariharan (Intel)
Kiara
No ratings yet
10 SparkBasics
Document45 pages
10 SparkBasics
Petter P
No ratings yet
Unit 22 Add Node Lab
Document21 pages
Unit 22 Add Node Lab
robin
No ratings yet
Spark Runtime Architecture Overview
Document5 pages
Spark Runtime Architecture Overview
kolodacool
No ratings yet
APACHE SPARK Architecture: Computing Engine
Document7 pages
APACHE SPARK Architecture: Computing Engine
Every Medias
No ratings yet
Ghs
Document4 pages
Ghs
aakash8694
No ratings yet
Chapter 2
Document22 pages
Chapter 2
Huy Nguyễn
No ratings yet
Hard Partitioning With Oracle VM Server For SPARC: An Oracle White Paper December 2012
Document11 pages
Hard Partitioning With Oracle VM Server For SPARC: An Oracle White Paper December 2012
Kaps Blaze
No ratings yet
4.1. Spark Basics
Document28 pages
4.1. Spark Basics
yagoencuestas
No ratings yet
Tutorial
Document8 pages
Tutorial
Alanoud Alsalman
No ratings yet
Solaris 10 Network Config - Numpang CoPas
Document1 page
Solaris 10 Network Config - Numpang CoPas
rasim
No ratings yet
Apache Spark Theory by Arsh
Document4 pages
Apache Spark Theory by Arsh
Faraz Akhtar
No ratings yet
Big Data Assignment
Document6 pages
Big Data Assignment
suibian.270619
No ratings yet
2018 Whitepaper DockerKubernetes 47237 v3
Document20 pages
2018 Whitepaper DockerKubernetes 47237 v3
puneet kumar
No ratings yet
CC Lab-8 To 11
Document16 pages
CC Lab-8 To 11
VivuEtukuru
No ratings yet
Framework For Processing Data in Hadoop - : Yarn and Mapreduce
Document31 pages
Framework For Processing Data in Hadoop - : Yarn and Mapreduce
Harshitha Raaj
No ratings yet
Google MapReduce Framework
Document27 pages
Google MapReduce Framework
Radhamani V
No ratings yet
Tutorial
Document9 pages
Tutorial
Alanoud Alsalman
No ratings yet
Spark Ops Final
Document45 pages
Spark Ops Final
jeanluc_orsai185
No ratings yet
Tutorial
Document8 pages
Tutorial
Alanoud Alsalman
No ratings yet
Real Time Analytics With Apache Kafka and Spark: @rahuldausa
Document54 pages
Real Time Analytics With Apache Kafka and Spark: @rahuldausa
Ozioma Ihekwoaba
No ratings yet
09 Programming Hadoop - Spark, R and Pig
Document80 pages
09 Programming Hadoop - Spark, R and Pig
Neeraj Garg
No ratings yet
Real Time Analytics With Apache Kafka and Spark: Rahul Jain
Document54 pages
Real Time Analytics With Apache Kafka and Spark: Rahul Jain
Sudhanshoo Saxena
No ratings yet
Practice 02 Create Oracle 12c R1 Two Node RAC Database PDF
Document25 pages
Practice 02 Create Oracle 12c R1 Two Node RAC Database PDF
ORA_DBaas
No ratings yet
Spark Interview Questions and Answers
Document31 pages
Spark Interview Questions and Answers
srinivas75k
100% (1)
CDIS CV PL.2 Scouting
Document33 pages
CDIS CV PL.2 Scouting
Ivanildo Costa
No ratings yet
Unit 5
Document109 pages
Unit 5
Rajesh Kumar Rakasula
No ratings yet
Lecture 25
Document59 pages
Lecture 25
SandraPerera
No ratings yet
Optimize HP-UX for WebSphere with OS tuning
Document8 pages
Optimize HP-UX for WebSphere with OS tuning
Semeh Arbi
No ratings yet
Spark Architecture
Document12 pages
Spark Architecture
abikoolin
No ratings yet
Cassandra Spark Integration
Document28 pages
Cassandra Spark Integration
Maneet Mathur
No ratings yet
TR Nimblestorage Oracle RAC Setup Guide
Document20 pages
TR Nimblestorage Oracle RAC Setup Guide
fawwaz
No ratings yet
Ace Your Apache Spark Interview
Document22 pages
Ace Your Apache Spark Interview
Venmo 6193
0% (1)
Understanding The Spark Cluster Architecture: Anatomy of A Spark Application
Document17 pages
Understanding The Spark Cluster Architecture: Anatomy of A Spark Application
Huy Nguyễn
No ratings yet
Spark In-Memory Cluster Computing
Document7 pages
Spark In-Memory Cluster Computing
Sailesh Chauhan
No ratings yet
Docker SBeliakou Part04
Document24 pages
Docker SBeliakou Part04
fqkjcwfcidtsmlcxaa
No ratings yet
Learning Apache Spark 2
From Everand
Learning Apache Spark 2
Muhammad Asif Abbasi
No ratings yet
Developing Web Applications with Apache, MySQL, memcached, and Perl
From Everand
Developing Web Applications with Apache, MySQL, memcached, and Perl
Patrick Galbraith
No ratings yet
MAGLEV User Manual
Document19 pages
MAGLEV User Manual
Anthony Littlehouses
No ratings yet
Linear System Theory and Desing PDF
Document688 pages
Linear System Theory and Desing PDF
Jose David Martinez
No ratings yet
CV
Document3 pages
CV
Dhiraj Patra
100% (1)
Analysis of 3PL and 4PL Contracts
Document8 pages
Analysis of 3PL and 4PL Contracts
Goutham Krishna U B
No ratings yet
Chapter 01 - JAVA Programming
Document29 pages
Chapter 01 - JAVA Programming
sunnyxm
No ratings yet
Practice PLSQL SEC 4
Document19 pages
Practice PLSQL SEC 4
annonymous
100% (1)
Pages From RCJ Specifications
Document1 page
Pages From RCJ Specifications
arch ragab
No ratings yet
ABH-2 Pile - Ram
Document5 pages
ABH-2 Pile - Ram
Ganeshalingam Ramprasanna2
No ratings yet
TS1A-13A: Operation and Maintenance Manual
Document204 pages
TS1A-13A: Operation and Maintenance Manual
Javier Aponte
No ratings yet
Packages
Document2 pages
Packages
Steven Miranda
No ratings yet
En 818-6 PDF
Document5 pages
En 818-6 PDF
lub007
No ratings yet
Gauss Lab 01
Document4 pages
Gauss Lab 01
Diego Giraldo Botero
No ratings yet
FiberHome Introduction (07.12.2015)
Document13 pages
FiberHome Introduction (07.12.2015)
Sandeep Sheetal
No ratings yet
Komatsu Backhoe Loaders Spec 088545 PDF
Document16 pages
Komatsu Backhoe Loaders Spec 088545 PDF
Johnny Walker
No ratings yet
Kfloppy
Document17 pages
Kfloppy
Bhanu
No ratings yet
Denison Hydraulics Goldcup Digital Hi-Iq Control: Electronic Control Card User Manual Software Version 2.2B
Document62 pages
Denison Hydraulics Goldcup Digital Hi-Iq Control: Electronic Control Card User Manual Software Version 2.2B
Hassan Haghighi Tajvar
No ratings yet
Quantifying Life Safety Part II - Quantification of Fire Protection Systems
Document6 pages
Quantifying Life Safety Part II - Quantification of Fire Protection Systems
kusumawardati
No ratings yet
Bernard D. Marquez Eduardo M. Axalan Engr. William A.L.T. NG
Document1 page
Bernard D. Marquez Eduardo M. Axalan Engr. William A.L.T. NG
Rhon Nem Kho
No ratings yet
Wireless Communication Assignment-1
Document2 pages
Wireless Communication Assignment-1
rajeshkec
No ratings yet
High-Temperature Quad 2-Inputs OR Gate Datasheet
Document6 pages
High-Temperature Quad 2-Inputs OR Gate Datasheet
Salim Abdul Rahman Sa'dy
No ratings yet
Iso Noremark 2 PDF
Document86 pages
Iso Noremark 2 PDF
Akhmad Makhrus
No ratings yet
Kinematics of Machinery Manual
Document29 pages
Kinematics of Machinery Manual
Shubham Naik
No ratings yet
Agilent GSM
Document74 pages
Agilent GSM
Shashank Prajapati
No ratings yet
Jacoby Tarbox
Document7 pages
Jacoby Tarbox
Bayu Permana Rydha
No ratings yet
Process Variable
Document51 pages
Process Variable
Lance Hernandez
No ratings yet
Monopoles and Electricity: Lawrence J. Wippler Little Falls, MN United States
Document9 pages
Monopoles and Electricity: Lawrence J. Wippler Little Falls, MN United States
waqar mohsin
No ratings yet
Basic Driving Instructor Course
Document9 pages
Basic Driving Instructor Course
Rafael Abdulla
No ratings yet
Comp Notes 122 PDF
Document57 pages
Comp Notes 122 PDF
muhammad wisal
No ratings yet
Fire Extinguisher Location and Placement: Fact Sheet
Document2 pages
Fire Extinguisher Location and Placement: Fact Sheet
Eli Naguit
No ratings yet
50 Kva
Document4 pages
50 Kva
riogad
No ratings yet