JobFlowInstancesConfig
A description of the Amazon EC2 instance on which the cluster (job flow) runs. A valid JobFlowInstancesConfig must contain either InstanceGroups or InstanceFleets. They cannot be used together. You may also have MasterInstanceType, SlaveInstanceType, and InstanceCount (all three must be present), but we don't recommend this configuration.
Types
Properties
A list of additional Amazon EC2 security group IDs for the master node.
A list of additional Amazon EC2 security group IDs for the core and task nodes.
The name of the Amazon EC2 key pair that can be used to connect to the master node using SSH as the user called "hadoop."
Applies to clusters that use the uniform instance group configuration. To launch the cluster in Amazon Virtual Private Cloud (Amazon VPC), set this parameter to the identifier of the Amazon VPC subnet where you want the cluster to launch. If you do not specify this value and your account supports EC2-Classic, the cluster launches in EC2-Classic.
Applies to clusters that use the instance fleet configuration. When multiple Amazon EC2 subnet IDs are specified, Amazon EMR evaluates them and launches instances in the optimal subnet.
The identifier of the Amazon EC2 security group for the master node. If you specify EmrManagedMasterSecurityGroup
, you must also specify EmrManagedSlaveSecurityGroup
.
The identifier of the Amazon EC2 security group for the core and task nodes. If you specify EmrManagedSlaveSecurityGroup
, you must also specify EmrManagedMasterSecurityGroup
.
Applies only to Amazon EMR release versions earlier than 4.0. The Hadoop version for the cluster. Valid inputs are "0.18" (no longer maintained), "0.20" (no longer maintained), "0.20.205" (no longer maintained), "1.0.3", "2.2.0", or "2.4.0". If you do not set this value, the default of 0.18 is used, unless the AmiVersion
parameter is set in the RunJobFlow call, in which case the default version of Hadoop for that AMI version is used.
The number of Amazon EC2 instances in the cluster.
The instance fleet configuration is available only in Amazon EMR releases 4.8.0 and later, excluding 5.0.x versions.
Configuration for the instance groups in a cluster.
Specifies whether the cluster should remain available after completing all steps. Defaults to false
. For more information about configuring cluster termination, see Control Cluster Termination in the EMR Management Guide.
The Amazon EC2 instance type of the master node.
The Availability Zone in which the cluster runs.
The identifier of the Amazon EC2 security group for the Amazon EMR service to access clusters in VPC private subnets.
The Amazon EC2 instance type of the core and task nodes.
Specifies whether to lock the cluster to prevent the Amazon EC2 instances from being terminated by API call, user intervention, or in the event of a job-flow error.
Indicates whether Amazon EMR should gracefully replace core nodes that have degraded within the cluster.