TabularJobConfig
The collection of settings used by an AutoML job V2 for the tabular problem type.
Types
Properties
The configuration information of how model candidates are generated.
How long a job is allowed to run, or how many candidates a job is allowed to generate.
A URL to the Amazon S3 data source containing selected features from the input data source to run an Autopilot job V2. You can input FeatureAttributeNames
(optional) in JSON format as shown below:
Generates possible candidates without training the models. A model candidate is a combination of data preprocessors, algorithms, and algorithm parameter settings.
The method that Autopilot uses to train the data. You can either specify the mode manually or let Autopilot choose for you based on the dataset size by selecting AUTO
. In AUTO
mode, Autopilot chooses ENSEMBLING
for datasets smaller than 100 MB, and HYPERPARAMETER_TUNING
for larger ones.
The type of supervised learning problem available for the model candidates of the AutoML job V2. For more information, see SageMaker Autopilot problem types.
If specified, this column name indicates which column of the dataset should be treated as sample weights for use by the objective metric during the training, evaluation, and the selection of the best model. This column is not considered as a predictive feature. For more information on Autopilot metrics, see Metrics and validation.
The name of the target variable in supervised learning, usually represented by 'y'.