AutoGluon-Tabular hyperparameters
The following table contains the subset of hyperparameters that are required or most
commonly used for the Amazon SageMaker AI AutoGluon-Tabular algorithm. Users set these parameters
to facilitate the estimation of model parameters from data. The SageMaker AI
AutoGluon-Tabular algorithm is an implementation of the open-source AutoGluon-Tabular
Note
The default hyperparameters are based on example datasets in the AutoGluon-Tabular sample notebooks.
By default, the SageMaker AI AutoGluon-Tabular algorithm automatically chooses an evaluation
metric based on the type of classification problem. The algorithm detects the type of
classification problem based on the number of labels in your data. For regression
problems, the evaluation metric is root mean squared error. For binary classification
problems, the evaluation metric is area under the receiver operating characteristic
curve (AUC). For multiclass classification problems, the evaluation metric is accuracy.
You can use the eval_metric
hyperparameter to change the default evaluation
metric. Refer to the following table for more information on AutoGluon-Tabular
hyperparameters, including descriptions, valid values, and default values.
Parameter Name | Description |
---|---|
eval_metric |
The evaluation metric for validation data. If
Valid values: string, refer to the AutoGluon documentation Default value: |
presets |
List of preset configurations for various arguments in
For more details, see AutoGluon Predictors Valid values: string, any of the following:
( Default value: |
auto_stack |
Whether AutoGluon should automatically utilize bagging and
multi-layer stack ensembling to boost predictive accuracy. Set
Valid values: string, Default value: |
num_bag_folds |
Number of folds used for bagging of models. When
Valid values: string, any integer between (and including)
Default value: |
num_bag_sets |
Number of repeats of kfold bagging to perform (values must be
greater than or equal to 1). The total number of models trained
during bagging is equal to Valid values: integer, range: [ Default value: |
num_stack_levels |
Number of stacking levels to use in stack ensemble. Roughly
increases model training time by factor of
Valid values: float, range: [ Default value: |
refit_full |
Whether or not to retrain all models on all of the data (training
and validation) after the normal training procedure. For more
details, see AutoGluon Predictors Valid values: string, Default value: |
set_best_to_refit_full |
Whether or not to change the default model that the predictor uses
for prediction. If Valid values: string, Default value: |
save_space |
Whether or note to reduce the memory and disk size of predictor by
deleting auxiliary model files that aren’t needed for prediction on
new data. This has no impact on inference accuracy. We recommend
setting Valid values: string, Default value: |
verbosity |
The verbosity of print messages. Valid values: integer, any of the following: ( Default value: |