Autotune Action Set

Provides actions to tune machine learning algorithm hyperparameters for individual or multiple model types

tuneLabelSpread Action

Automatically adjusts Label Spreading technique parameters to tune for best objective metric value.

CASL Syntax

autotune.tuneLabelSpread <result=results> <status=rc> /
required parameter trainOptions={key-1=any-list-or-data-type-1 <, key-2=any-list-or-data-type-2, ...>},
tunerOptions={
constraints={{
lowerBound=double,
required parameter metric="ASE" | "AUC" | "CUMULATIVEGAIN" | "F05" | "F1" | "FNR" | "FPR" | "GAMMA" | "GINI" | "HITRATE" | "KS" | "LOSSFUNCTION" | "MAE" | "MCE" | "MCLL" | "MEANAVEPRECISION" | "MEANRECIPRANK" | "MISC" | "MSE" | "MSLE" | "RASE" | "RECALL" | "RMAE" | "RMSLE" | "SCORINGTIME" | "TAU" | "TNR" | "TPR" | "TRAININGTIME",
upperBound=double
}, {...}},
enableLocalSearch=TRUE | FALSE,
evaluationHistory=TRUE | FALSE,
foldColumn="string",
historyTable={
appendLookupData=TRUE | FALSE
liveUpdate=TRUE | FALSE
required parameter table={
caslib="string"
compress=TRUE | FALSE
indexVars={"variable-name-1" <, "variable-name-2", ...>}
label="string"
lifetime=64-bit-integer
maxMemSize=64-bit-integer
memoryFormat="DVR" | "INHERIT" | "STANDARD"
name="table-name"
promote=TRUE | FALSE
replace=TRUE | FALSE
replication=integer
tableRedistUpPolicy="DEFER" | "NOREDIST" | "REBALANCE"
threadBlockSize=64-bit-integer
timeStamp="string"
where={"string-1" <, "string-2", ...>}
}
},
logLevel=integer,
lookupTable={
selectInitialPoint=TRUE | FALSE
table={
caslib="string"
computedOnDemand=TRUE | FALSE
computedVars={{
format="string",
formattedLength=integer,
label="string",
required parameter name="variable-name",
nfd=integer,
nfl=integer
}, {...}}
dataSourceOptions={key-1=any-list-or-data-type-1 <, key-2=any-list-or-data-type-2, ...>}
importOptions={fileType="ANY" | "AUDIO" | "AUTO" | "BASESAS" | "CSV" | "DELIMITED" | "DOCUMENT" | "DTA" | "ESP" | "EXCEL" | "FMT" | "HDAT" | "IMAGE" | "JMP" | "LASR" | "PARQUET" | "SOUND" | "SPSS" | "VIDEO" | "XLS", fileType-specific-parameters}
required parameter name="table-name"
singlePass=TRUE | FALSE
vars={{
format="string",
formattedLength=integer,
label="string",
required parameter name="variable-name",
nfd=integer,
nfl=integer
}, {...}}
where="where-expression"
whereTable={
casLib="string"
dataSourceOptions={adls_noreq-parameters | bigquery-parameters | cas_noreq-parameters | clouddex-parameters | db2-parameters | dnfs-parameters | esp-parameters | fedsvr-parameters | gcs_noreq-parameters | hadoop-parameters | hana-parameters | impala-parameters | informix-parameters | jdbc-parameters | mongodb-parameters | mysql-parameters | odbc-parameters | oracle-parameters | path-parameters | postgres-parameters | redshift-parameters | s3-parameters | sapiq-parameters | sforce-parameters | singlestore_standard-parameters | snowflake-parameters | spark-parameters | spde-parameters | sqlserver-parameters | ss_noreq-parameters | teradata-parameters | vertica-parameters | yellowbrick-parameters}
importOptions={fileType="ANY" | "AUDIO" | "AUTO" | "BASESAS" | "CSV" | "DELIMITED" | "DOCUMENT" | "DTA" | "ESP" | "EXCEL" | "FMT" | "HDAT" | "IMAGE" | "JMP" | "LASR" | "PARQUET" | "SOUND" | "SPSS" | "VIDEO" | "XLS", fileType-specific-parameters}
required parameter name="table-name"
vars={{
format="string",
formattedLength=integer,
label="string",
required parameter name="variable-name",
nfd=integer,
nfl=integer
}, {...}}
where="where-expression"
}
}
},
maxEvals=integer,
maxIters=integer,
maxTime=double,
maxTrainTime=64-bit-integer,
nCrossValFolds=integer,
nParallel=integer,
popSize=integer,
sampleSize=integer,
seed=integer,
shuffleGridPoints=TRUE | FALSE,
targetEvent="string",
userConfigurations={
caslib="string"
computedOnDemand=TRUE | FALSE
computedVars={{
format="string",
formattedLength=integer,
label="string",
required parameter name="variable-name",
nfd=integer,
nfl=integer
}, {...}}
dataSourceOptions={key-1=any-list-or-data-type-1 <, key-2=any-list-or-data-type-2, ...>}
importOptions={fileType="ANY" | "AUDIO" | "AUTO" | "BASESAS" | "CSV" | "DELIMITED" | "DOCUMENT" | "DTA" | "ESP" | "EXCEL" | "FMT" | "HDAT" | "IMAGE" | "JMP" | "LASR" | "PARQUET" | "SOUND" | "SPSS" | "VIDEO" | "XLS", fileType-specific-parameters}
required parameter name="table-name"
singlePass=TRUE | FALSE
vars={{
format="string",
formattedLength=integer,
label="string",
required parameter name="variable-name",
nfd=integer,
nfl=integer
}, {...}}
where="where-expression"
whereTable={
casLib="string"
dataSourceOptions={adls_noreq-parameters | bigquery-parameters | cas_noreq-parameters | clouddex-parameters | db2-parameters | dnfs-parameters | esp-parameters | fedsvr-parameters | gcs_noreq-parameters | hadoop-parameters | hana-parameters | impala-parameters | informix-parameters | jdbc-parameters | mongodb-parameters | mysql-parameters | odbc-parameters | oracle-parameters | path-parameters | postgres-parameters | redshift-parameters | s3-parameters | sapiq-parameters | sforce-parameters | singlestore_standard-parameters | snowflake-parameters | spark-parameters | spde-parameters | sqlserver-parameters | ss_noreq-parameters | teradata-parameters | vertica-parameters | yellowbrick-parameters}
importOptions={fileType="ANY" | "AUDIO" | "AUTO" | "BASESAS" | "CSV" | "DELIMITED" | "DOCUMENT" | "DTA" | "ESP" | "EXCEL" | "FMT" | "HDAT" | "IMAGE" | "JMP" | "LASR" | "PARQUET" | "SOUND" | "SPSS" | "VIDEO" | "XLS", fileType-specific-parameters}
required parameter name="table-name"
vars={{
format="string",
formattedLength=integer,
label="string",
required parameter name="variable-name",
nfd=integer,
nfl=integer
}, {...}}
where="where-expression"
}
},
userDefinedPartition=TRUE | FALSE,
},
tuningParameters={{
exclude=TRUE | FALSE,
initValue=integer | 64-bit-integer | double | TRUE | FALSE | "string",
lowerBound=double,
namePath="string",
upperBound=double,
valueList={any-list-or-data-type-1 <, any-list-or-data-type-2, ...>}
}, {...}},
useParameters="COMBINED" | "CUSTOM" | "STANDARD"
;
indicates a required parameter

Summary: Input and Output Tables

If a row includes a subparameter, you can specify the name, caslib, and so on in the subparameter. Otherwise, you can specify the name, caslib, and so on in the parameter.

Parameters for Reading Input Tables

Parameter

Subparameter

Description

 tunerOptions

lookupTable (and nested parameter table), userConfigurations

specifies a list of parameters for adjusting how the tuner behaves.

Parameters for Creating Output Tables

Parameter

Subparameter

Description

 tunerOptions

historyTable (and nested parameter table)

specifies a list of parameters for adjusting how the tuner behaves.

Parameter Descriptions

* trainOptions={key-1=any-list-or-data-type-1 <, key-2=any-list-or-data-type-2, ...>}

specifies a list of parameters to be used by the labelSpread action in the graphSemiSupLearn action set; for more information, see the parameters for that action.

tunerOptions={tunerOptions}

specifies a list of parameters for adjusting how the tuner behaves.

The tunerOptions value can be one or more of the following:

constraints={{autotuneConstraintDefinition-1} <, {autotuneConstraintDefinition-2}, ...>}

specifies a list of constraints and their definitions.

The autotuneConstraintDefinition value can be one or more of the following:

lowerBound=double

specifies the lower bound of the range for a tuning parameter. This parameter can be used only for numeric tuning parameters, and is ignored if a value list is specified for the tuning parameter.

Alias LB
* metric="ASE" | "AUC" | "CUMULATIVEGAIN" | "F05" | "F1" | "FNR" | "FPR" | "GAMMA" | "GINI" | "HITRATE" | "KS" | "LOSSFUNCTION" | "MAE" | "MCE" | "MCLL" | "MEANAVEPRECISION" | "MEANRECIPRANK" | "MISC" | "MSE" | "MSLE" | "RASE" | "RECALL" | "RMAE" | "RMSLE" | "SCORINGTIME" | "TAU" | "TNR" | "TPR" | "TRAININGTIME"

specifies a metric to constrain during tuning.

ASE

specifies the average square error as the objective function or the constraint metric.

AUC

specifies area under the curve as the objective function or the constraint metric (nominal targets only).

CUMULATIVEGAIN

specifies weighted normalized discounted cumulative gain as the objective function or the constraint metric (recommender model types only).

F05

specifies the F0.5 coefficient as the objective function or the constraint metric (nominal targets only).

F1

specifies the F1 coefficient as the objective function or the constraint metric (nominal targets only).

FNR

specifies false negative rate as the objective function or the constraint metric (nominal targets only).

FPR

specifies false positive rate as the objective function or the constraint metric (nominal targets only).

GAMMA

specifies the gamma coefficient as the objective function or the constraint metric (nominal targets only).

GINI

specifies the Gini coefficient as the objective function or the constraint metric (nominal targets only).

HITRATE

specifies hit rate as the objective function or the constraint metric (recommender model types only).

KS

specifies the Kolmogorov-Smirnov coefficient as the objective function or the constraint metric (nominal targets only).

LOSSFUNCTION

specifies the Loss Function as the constraint metric.

MAE

specifies the mean absolute error as the objective function or the constraint metric (interval targets only).

MCE

specifies the misclassification rate as the objective function or the constraint metric (nominal targets only).

MCLL

specifies the multiclass log loss as the objective function or the constraint metric (nominal targets only).

MEANAVEPRECISION

specifies mean average precision as the objective function or the constraint metric (recommender model types only).

MEANRECIPRANK

specifies mean reciprocal rank as the objective function or the constraint metric (recommender model types only).

MISC

specifies the misclassification error percentage as the objective function or the constraint metric (nominal targets only).

MSE

specifies the mean square error as the objective function or the constraint metric (interval targets only).

MSLE

specifies the mean square logarithmic error as the objective function or the constraint metric (interval targets only).

RASE

specifies the root average square error as the objective function or the constraint metric.

RECALL

specifies recall value as the objective function or the constraint metric (recommender model types only).

RMAE

specifies the root mean absolute error as the objective function or the constraint metric (interval targets only).

RMSLE

specifies the root mean square logarithmic error as the objective function or the constraint metric (interval targets only).

SCORINGTIME

specifies model scoring time as the objective for tuning.

TAU

specifies the tau coefficient as the objective function or the constraint metric (nominal targets only).

TNR

specifies true negative rate as the objective function or the constraint metric (nominal targets only).

TPR

specifies true positive rate as the objective function or the constraint metric (nominal targets only).

TRAININGTIME

specifies model training time as the objective for tuning.

upperBound=double

specifies the upper bound of the range for a tuning parameter. This parameter can be used only for numeric tuning parameters, and is ignored if a value list is specified for the tuning parameter.

Alias UB
enableLocalSearch=TRUE | FALSE

when set to True, local search optimization is enabled for model tuning.

Default TRUE
evaluationHistory=TRUE | FALSE

when set to True, creates a table of every evaluation (every model configuration) and returns those tables to the client.

Default TRUE
foldColumn="string"

specifies a column in the data table in which the cross validation fold value is indicated.

historyTable={history_table}

specifies the CAS table that is created by the autotune action that contains all evaluation history data points.

The history_table value can be one or more of the following:

appendLookupData=TRUE | FALSE

when set to True, specifies that all unused lookup data points be appended to the table specified in the historyTable parameter.

Default FALSE
liveUpdate=TRUE | FALSE

when set to True, updates the table specified in the historyTable parameter after every evaluation. When the liveUpdate parameter is set to True, the table is promoted even if the promote parameter is not set to True, in order to allow other sessions to access the table.

Default FALSE
* table={casouttable}

For more information about specifying the table parameter, see the common casouttable (Form 1) parameter (Appendix A: Common Parameters).

logLevel=integer

specifies the level of log messages to be written: no logs (0), initialization and completion logs (1), setup and iteration results (2), or detailed evaluation results (3).

Default 2
Range 0–3
lookupTable={lookup_table}

specifies the CAS table used by the autotune action for evaluation lookup.

The lookup_table value can be one or more of the following:

selectInitialPoint=TRUE | FALSE

when set to True, selects the best point from the lookup table and uses it as the initial (default) point.

Alias initPoint
Default FALSE
table={castable}

specifies the settings for an input table.

The castable value can be one or more of the following:

caslib="string"

specifies the caslib for the input table that you want to use with the action. By default, the active caslib is used. Specify a value only if you need to access a table from a different caslib.

computedOnDemand=TRUE | FALSE

when set to True, creates the computed variables when the table is loaded instead of when the action begins.

Alias compOnDemand
Default FALSE
computedVars={{casinvardesc-1} <, {casinvardesc-2}, ...>}

specifies the names of the computed variables to create. Specify an expression for each variable in the computedVarsProgram parameter. If you do not specify this parameter, then all variables from computedVarsProgram are automatically included.

Alias compVars

The casinvardesc value can be one or more of the following:

format="string"

specifies the format to apply to the variable.

formattedLength=integer

specifies the length of the format field plus the length of the format precision.

label="string"

specifies the descriptive label for the variable.

* name="variable-name"

specifies the name for the variable.

nfd=integer

specifies the length of the format precision.

nfl=integer

specifies the length of the format field.

computedVarsProgram="string"

specifies an expression for each computed variable that you include in the computedVars parameter.

Alias compPgm
dataSourceOptions={key-1=any-list-or-data-type-1 <, key-2=any-list-or-data-type-2, ...>}

specifies data source options.

Aliases options
dataSource
groupByMode="NOSORT" | "REDISTRIBUTE"

specifies how to create groups.

Default NOSORT
NOSORT

groups the data without sorting on each machine, and then groups the data again on the controller.

REDISTRIBUTE

transfers rows between nodes to guarantee ordering within groups. This method is slower.

importOptions={fileType="ANY" | "AUDIO" | "AUTO" | "BASESAS" | "CSV" | "DOCUMENT" | "DTA" | "ESP" | "EXCEL" | "FMT" | "HDAT" | "IMAGE" | "JMP" | "LASR" | "PARQUET" | "SPSS" | "VIDEO" | "XLS", fileType-specific-parameters}

specifies the settings for reading a table from a data source.

Alias import

For more information about specifying the importOptions parameter, see the common importOptions parameter (Appendix A: Common Parameters).

* name="table-name"

specifies the name of the input table.

singlePass=TRUE | FALSE

when set to True, does not create a transient table on the server. Setting this parameter to True can be efficient, but the data might not have stable ordering upon repeated runs.

Default FALSE
vars={{casinvardesc-1} <, {casinvardesc-2}, ...>}

specifies the variables to use in the action.

The casinvardesc value can be one or more of the following:

format="string"

specifies the format to apply to the variable.

formattedLength=integer

specifies the length of the format field plus the length of the format precision.

label="string"

specifies the descriptive label for the variable.

* name="variable-name"

specifies the name for the variable.

nfd=integer

specifies the length of the format precision.

nfl=integer

specifies the length of the format field.

where="where-expression"

specifies an expression for subsetting the input data.

whereTable={groupbytable}

specifies an input table that contains rows to use as a WHERE filter. If the vars parameter is not specified, then all the variable names that are common to the input table and the filtering table are used to find matching rows. If the where parameter for the input table and this parameter are specified, then this filtering table is applied first.

The groupbytable value can be one or more of the following:

casLib="string"

specifies the caslib for the filter table. By default, the active caslib is used.

dataSourceOptions={adls_noreq-parameters | bigquery-parameters | cas_noreq-parameters | clouddex-parameters | db2-parameters | dnfs-parameters | esp-parameters | fedsvr-parameters | gcs_noreq-parameters | hadoop-parameters | hana-parameters | impala-parameters | informix-parameters | jdbc-parameters | mongodb-parameters | mysql-parameters | odbc-parameters | oracle-parameters | path-parameters | postgres-parameters | redshift-parameters | s3-parameters | sapiq-parameters | sforce-parameters | singlestore_standard-parameters | snowflake-parameters | spark-parameters | spde-parameters | sqlserver-parameters | ss_noreq-parameters | teradata-parameters | vertica-parameters | yellowbrick-parameters}

specifies data source options.

Aliases options
dataSource

For more information about specifying the dataSourceOptions parameter, see the common dataSourceOptions parameter (Appendix A: Common Parameters).

importOptions={fileType="ANY" | "AUDIO" | "AUTO" | "BASESAS" | "CSV" | "DOCUMENT" | "DTA" | "ESP" | "EXCEL" | "FMT" | "HDAT" | "IMAGE" | "JMP" | "LASR" | "PARQUET" | "SPSS" | "VIDEO" | "XLS", fileType-specific-parameters}

specifies the settings for reading a table from a data source.

Alias import

For more information about specifying the importOptions parameter, see the common importOptions parameter (Appendix A: Common Parameters).

* name="table-name"

specifies the name of the filter table.

vars={{casinvardesc-1} <, {casinvardesc-2}, ...>}

specifies the variable names to use from the filter table.

The casinvardesc value can be one or more of the following:

format="string"

specifies the format to apply to the variable.

formattedLength=integer

specifies the length of the format field plus the length of the format precision.

label="string"

specifies the descriptive label for the variable.

* name="variable-name"

specifies the name for the variable.

nfd=integer

specifies the length of the format precision.

nfl=integer

specifies the length of the format field.

where="where-expression"

specifies an expression for subsetting the data from the filter table.

maxBayesianModelSize=integer

specifies the maximum number of points in the kriging model. This parameter is ignored unless the value "BAYESIAN" is specified in the searchMethod parameter.

Alias maxBayesianSize
Default 500
Minimum value 10
maxEvals=integer

specifies the maximum number of objective evaluations (model configurations) to be trained and scored.

Alias maxEvaluations
Default 0
Minimum value 0
maxIters=integer

specifies the maximum number of iterations of optimization for model tuning. At each iteration, the number of evaluated models is less than or equal to the value of the popSize parameter.

Alias maxIterations
Default 5
Minimum value 1
maxTime=double

specifies the maximum time allowed for tuning. The actual time might exceed this value by the amount of time required to train and score a model configuration.

Default 36000
Minimum value 1
maxTrainTime=64-bit-integer

specifies the maximum time allowed for a single model training. The model training is terminated if it exceeds this time, and the objective value is set to missing.

Minimum value 0
nConvergenceIterations=integer

specifies the number of convergence iterations to use for terminating the tuning.

Alias nConvIters
Default 4
Minimum value 1
nCrossValFolds=integer

specifies the number of folds to use for cross validation to assess model fit error as the tuning objective.

Alias nFolds
Default 5
Minimum value 2
nParallel=integer

specifies the number of parallel subsessions to be used for parallel evaluation of model configurations.

Default 0
nSubsessionWorkers=integer

specifies the number of workers to be used by each subsession for parallel evaluation of model configurations.

Alias nSubWorkers
Default 0
Minimum value 0
objective="ASE" | "AUC" | "CUMULATIVEGAIN" | "F05" | "F1" | "FNR" | "FPR" | "GAMMA" | "GINI" | "HITRATE" | "KS" | "MAE" | "MCE" | "MCLL" | "MEANAVEPRECISION" | "MEANRECIPRANK" | "MISC" | "MSE" | "MSLE" | "RASE" | "RECALL" | "RMAE" | "RMSLE" | "SCORINGTIME" | "TAU" | "TNR" | "TPR" | "TRAININGTIME"

specifies the metric to use as the objective during tuning.

ASE

specifies the average square error as the objective function or the constraint metric.

AUC

specifies area under the curve as the objective function or the constraint metric (nominal targets only).

CUMULATIVEGAIN

specifies weighted normalized discounted cumulative gain as the objective function or the constraint metric (recommender model types only).

F05

specifies the F0.5 coefficient as the objective function or the constraint metric (nominal targets only).

F1

specifies the F1 coefficient as the objective function or the constraint metric (nominal targets only).

FNR

specifies false negative rate as the objective function or the constraint metric (nominal targets only).

FPR

specifies false positive rate as the objective function or the constraint metric (nominal targets only).

GAMMA

specifies the gamma coefficient as the objective function or the constraint metric (nominal targets only).

GINI

specifies the Gini coefficient as the objective function or the constraint metric (nominal targets only).

HITRATE

specifies hit rate as the objective function or the constraint metric (recommender model types only).

KS

specifies the Kolmogorov-Smirnov coefficient as the objective function or the constraint metric (nominal targets only).

MAE

specifies the mean absolute error as the objective function or the constraint metric (interval targets only).

MCE

specifies the misclassification rate as the objective function or the constraint metric (nominal targets only).

MCLL

specifies the multiclass log loss as the objective function or the constraint metric (nominal targets only).

MEANAVEPRECISION

specifies mean average precision as the objective function or the constraint metric (recommender model types only).

MEANRECIPRANK

specifies mean reciprocal rank as the objective function or the constraint metric (recommender model types only).

MISC

specifies the misclassification error percentage as the objective function or the constraint metric (nominal targets only).

MSE

specifies the mean square error as the objective function or the constraint metric (interval targets only).

MSLE

specifies the mean square logarithmic error as the objective function or the constraint metric (interval targets only).

RASE

specifies the root average square error as the objective function or the constraint metric.

RECALL

specifies recall value as the objective function or the constraint metric (recommender model types only).

RMAE

specifies the root mean absolute error as the objective function or the constraint metric (interval targets only).

RMSLE

specifies the root mean square logarithmic error as the objective function or the constraint metric (interval targets only).

SCORINGTIME

specifies model scoring time as the objective for tuning.

TAU

specifies the tau coefficient as the objective function or the constraint metric (nominal targets only).

TNR

specifies true negative rate as the objective function or the constraint metric (nominal targets only).

TPR

specifies true positive rate as the objective function or the constraint metric (nominal targets only).

TRAININGTIME

specifies model training time as the objective for tuning.

popSize=integer

specifies the maximum number of model configurations to evaluate at each iteration.

Alias populationSize
Default 10
Minimum value 2
sampleSize=integer

specifies the number of model configurations to evaluate when the value of the searchMethod parameter is "LHS" or "RANDOM".

Default 50
Minimum value 2
searchMethod="BAYESIAN" | "GA" | "GRID" | "LHS" | "RANDOM"

specifies the search method to use during tuning.

Default GA
BAYESIAN

uses the Bayesian search method.

GA

uses the genetic algorithm search method.

GRID

uses the grid search method.

LHS

uses the Latin hypercube sample search method.

RANDOM

uses the random search method.

secondObjective="ASE" | "AUC" | "CUMULATIVEGAIN" | "F05" | "F1" | "FNR" | "FPR" | "GAMMA" | "GINI" | "HITRATE" | "KS" | "MAE" | "MCE" | "MCLL" | "MEANAVEPRECISION" | "MEANRECIPRANK" | "MISC" | "MSE" | "MSLE" | "NONE" | "RASE" | "RECALL" | "RMAE" | "RMSLE" | "SCORINGTIME" | "TAU" | "TNR" | "TPR" | "TRAININGTIME"

specifies the metric to use as the second objective during tuning.

Default NONE
ASE

specifies the average square error as the objective function or the constraint metric.

AUC

specifies area under the curve as the objective function or the constraint metric (nominal targets only).

CUMULATIVEGAIN

specifies weighted normalized discounted cumulative gain as the objective function or the constraint metric (recommender model types only).

F05

specifies the F0.5 coefficient as the objective function or the constraint metric (nominal targets only).

F1

specifies the F1 coefficient as the objective function or the constraint metric (nominal targets only).

FNR

specifies false negative rate as the objective function or the constraint metric (nominal targets only).

FPR

specifies false positive rate as the objective function or the constraint metric (nominal targets only).

GAMMA

specifies the gamma coefficient as the objective function or the constraint metric (nominal targets only).

GINI

specifies the Gini coefficient as the objective function or the constraint metric (nominal targets only).

HITRATE

specifies hit rate as the objective function or the constraint metric (recommender model types only).

KS

specifies the Kolmogorov-Smirnov coefficient as the objective function or the constraint metric (nominal targets only).

MAE

specifies the mean absolute error as the objective function or the constraint metric (interval targets only).

MCE

specifies the misclassification rate as the objective function or the constraint metric (nominal targets only).

MCLL

specifies the multiclass log loss as the objective function or the constraint metric (nominal targets only).

MEANAVEPRECISION

specifies mean average precision as the objective function or the constraint metric (recommender model types only).

MEANRECIPRANK

specifies mean reciprocal rank as the objective function or the constraint metric (recommender model types only).

MISC

specifies the misclassification error percentage as the objective function or the constraint metric (nominal targets only).

MSE

specifies the mean square error as the objective function or the constraint metric (interval targets only).

MSLE

specifies the mean square logarithmic error as the objective function or the constraint metric (interval targets only).

NONE

specifies no second objective to be used for tuning.

RASE

specifies the root average square error as the objective function or the constraint metric.

RECALL

specifies recall value as the objective function or the constraint metric (recommender model types only).

RMAE

specifies the root mean absolute error as the objective function or the constraint metric (interval targets only).

RMSLE

specifies the root mean square logarithmic error as the objective function or the constraint metric (interval targets only).

SCORINGTIME

specifies model scoring time as the objective for tuning.

TAU

specifies the tau coefficient as the objective function or the constraint metric (nominal targets only).

TNR

specifies true negative rate as the objective function or the constraint metric (nominal targets only).

TPR

specifies true positive rate as the objective function or the constraint metric (nominal targets only).

TRAININGTIME

specifies model training time as the objective for tuning.

seed=integer

specifies the seed to use for selecting training data and validation data for a single validation partition or fold number for each observation for cross validation.

Default 0
shuffleGridPoints=TRUE | FALSE

when set to True, shuffles the points that are generated by the grid search method before execution.

Default TRUE
targetEvent="string"

specifies the name of the nominal target event to use for tuning.

trainPartitionFraction=double

specifies the size of a single data partition to use for model training during tuning.

Alias trainFraction
Range 0.01–0.99
userConfigurations={castable}

specifies the CAS table that contains user configurations to be used by the autotune action.

The castable value can be one or more of the following:

caslib="string"

specifies the caslib for the input table that you want to use with the action. By default, the active caslib is used. Specify a value only if you need to access a table from a different caslib.

computedOnDemand=TRUE | FALSE

when set to True, creates the computed variables when the table is loaded instead of when the action begins.

Alias compOnDemand
Default FALSE
computedVars={{casinvardesc-1} <, {casinvardesc-2}, ...>}

specifies the names of the computed variables to create. Specify an expression for each variable in the computedVarsProgram parameter. If you do not specify this parameter, then all variables from computedVarsProgram are automatically included.

Alias compVars

The casinvardesc value can be one or more of the following:

format="string"

specifies the format to apply to the variable.

formattedLength=integer

specifies the length of the format field plus the length of the format precision.

label="string"

specifies the descriptive label for the variable.

* name="variable-name"

specifies the name for the variable.

nfd=integer

specifies the length of the format precision.

nfl=integer

specifies the length of the format field.

computedVarsProgram="string"

specifies an expression for each computed variable that you include in the computedVars parameter.

Alias compPgm
dataSourceOptions={key-1=any-list-or-data-type-1 <, key-2=any-list-or-data-type-2, ...>}

specifies data source options.

Aliases options
dataSource
groupByMode="NOSORT" | "REDISTRIBUTE"

specifies how to create groups.

Default NOSORT
NOSORT

groups the data without sorting on each machine, and then groups the data again on the controller.

REDISTRIBUTE

transfers rows between nodes to guarantee ordering within groups. This method is slower.

importOptions={fileType="ANY" | "AUDIO" | "AUTO" | "BASESAS" | "CSV" | "DOCUMENT" | "DTA" | "ESP" | "EXCEL" | "FMT" | "HDAT" | "IMAGE" | "JMP" | "LASR" | "PARQUET" | "SPSS" | "VIDEO" | "XLS", fileType-specific-parameters}

specifies the settings for reading a table from a data source.

Alias import

For more information about specifying the importOptions parameter, see the common importOptions parameter (Appendix A: Common Parameters).

* name="table-name"

specifies the name of the input table.

singlePass=TRUE | FALSE

when set to True, does not create a transient table on the server. Setting this parameter to True can be efficient, but the data might not have stable ordering upon repeated runs.

Default FALSE
vars={{casinvardesc-1} <, {casinvardesc-2}, ...>}

specifies the variables to use in the action.

The casinvardesc value can be one or more of the following:

format="string"

specifies the format to apply to the variable.

formattedLength=integer

specifies the length of the format field plus the length of the format precision.

label="string"

specifies the descriptive label for the variable.

* name="variable-name"

specifies the name for the variable.

nfd=integer

specifies the length of the format precision.

nfl=integer

specifies the length of the format field.

where="where-expression"

specifies an expression for subsetting the input data.

whereTable={groupbytable}

specifies an input table that contains rows to use as a WHERE filter. If the vars parameter is not specified, then all the variable names that are common to the input table and the filtering table are used to find matching rows. If the where parameter for the input table and this parameter are specified, then this filtering table is applied first.

The groupbytable value can be one or more of the following:

casLib="string"

specifies the caslib for the filter table. By default, the active caslib is used.

dataSourceOptions={adls_noreq-parameters | bigquery-parameters | cas_noreq-parameters | clouddex-parameters | db2-parameters | dnfs-parameters | esp-parameters | fedsvr-parameters | gcs_noreq-parameters | hadoop-parameters | hana-parameters | impala-parameters | informix-parameters | jdbc-parameters | mongodb-parameters | mysql-parameters | odbc-parameters | oracle-parameters | path-parameters | postgres-parameters | redshift-parameters | s3-parameters | sapiq-parameters | sforce-parameters | singlestore_standard-parameters | snowflake-parameters | spark-parameters | spde-parameters | sqlserver-parameters | ss_noreq-parameters | teradata-parameters | vertica-parameters | yellowbrick-parameters}

specifies data source options.

Aliases options
dataSource

For more information about specifying the dataSourceOptions parameter, see the common dataSourceOptions parameter (Appendix A: Common Parameters).

importOptions={fileType="ANY" | "AUDIO" | "AUTO" | "BASESAS" | "CSV" | "DOCUMENT" | "DTA" | "ESP" | "EXCEL" | "FMT" | "HDAT" | "IMAGE" | "JMP" | "LASR" | "PARQUET" | "SPSS" | "VIDEO" | "XLS", fileType-specific-parameters}

specifies the settings for reading a table from a data source.

Alias import

For more information about specifying the importOptions parameter, see the common importOptions parameter (Appendix A: Common Parameters).

* name="table-name"

specifies the name of the filter table.

vars={{casinvardesc-1} <, {casinvardesc-2}, ...>}

specifies the variable names to use from the filter table.

The casinvardesc value can be one or more of the following:

format="string"

specifies the format to apply to the variable.

formattedLength=integer

specifies the length of the format field plus the length of the format precision.

label="string"

specifies the descriptive label for the variable.

* name="variable-name"

specifies the name for the variable.

nfd=integer

specifies the length of the format precision.

nfl=integer

specifies the length of the format field.

where="where-expression"

specifies an expression for subsetting the data from the filter table.

userDefinedPartition=TRUE | FALSE

when set to True, includes a user-defined partition for training and scoring.

Default FALSE
validationPartitionFraction=double

specifies the size of a single validation partition to use in order to assess model fit error as the tuning objective.

Alias validateFraction
Default 0.3
Range 0.01–0.99

tuningParameters={{autotuneTuningParmDefinition-1} <, {autotuneTuningParmDefinition-2}, ...>}

specifies a list of custom tuning parameters and their definitions.

The autotuneTuningParmDefinition value can be one or more of the following:

exclude=TRUE | FALSE

when set to True, excludes a tuning parameter from the tuning process.

Default FALSE
initValue=integer | 64-bit-integer | double | TRUE | FALSE | "string"

specifies the initial (default) value of a tuning parameter.

lowerBound=double

specifies the lower bound of the range for a tuning parameter. This parameter can be used only for numeric tuning parameters, and is ignored if a value list is specified for the tuning parameter.

Alias LB
namePath="string"

specifies the name path of a tuning parameter. For a nested action parameter, this parameter specifies a dot-separated path that includes all its parent parameter names. For a top-level action parameter, this parameter is simply the name of the parameter.

Alias name
upperBound=double

specifies the upper bound of the range for a tuning parameter. This parameter can be used only for numeric tuning parameters, and is ignored if a value list is specified for the tuning parameter.

Alias UB
valueList={any-list-or-data-type-1 <, any-list-or-data-type-2, ...>}

specifies the list of values to try for a tuning parameter. If this parameter is specified, the lower bound and upper bound values of the tuning parameter are ignored.

useParameters="COMBINED" | "CUSTOM" | "STANDARD"

specifies whether to use only standard tuning parameters, only custom tuning parameters, or a combination of both.

Default COMBINED

tuneLabelSpread Action

Automatically adjusts Label Spreading technique parameters to tune for best objective metric value.

Lua Syntax

results, info = s:autotune_tuneLabelSpread{
required parameter trainOptions={key-1=any-list-or-data-type-1 <, key-2=any-list-or-data-type-2, ...>},
tunerOptions={
constraints={{
lowerBound=double,
required parameter metric="ASE" | "AUC" | "CUMULATIVEGAIN" | "F05" | "F1" | "FNR" | "FPR" | "GAMMA" | "GINI" | "HITRATE" | "KS" | "LOSSFUNCTION" | "MAE" | "MCE" | "MCLL" | "MEANAVEPRECISION" | "MEANRECIPRANK" | "MISC" | "MSE" | "MSLE" | "RASE" | "RECALL" | "RMAE" | "RMSLE" | "SCORINGTIME" | "TAU" | "TNR" | "TPR" | "TRAININGTIME",
upperBound=double
}, {...}},
enableLocalSearch=true | false,
evaluationHistory=true | false,
foldColumn="string",
historyTable={
appendLookupData=true | false
liveUpdate=true | false
required parameter table={
caslib="string"
compress=true | false
indexVars={"variable-name-1" <, "variable-name-2", ...>}
label="string"
lifetime=64-bit-integer
maxMemSize=64-bit-integer
memoryFormat="DVR" | "INHERIT" | "STANDARD"
name="table-name"
promote=true | false
replace=true | false
replication=integer
tableRedistUpPolicy="DEFER" | "NOREDIST" | "REBALANCE"
threadBlockSize=64-bit-integer
timeStamp="string"
where={"string-1" <, "string-2", ...>}
}
},
logLevel=integer,
lookupTable={
selectInitialPoint=true | false
table={
caslib="string"
computedOnDemand=true | false
computedVars={{
format="string",
formattedLength=integer,
label="string",
required parameter name="variable-name",
nfd=integer,
nfl=integer
}, {...}}
dataSourceOptions={key-1=any-list-or-data-type-1 <, key-2=any-list-or-data-type-2, ...>}
importOptions={fileType="ANY" | "AUDIO" | "AUTO" | "BASESAS" | "CSV" | "DELIMITED" | "DOCUMENT" | "DTA" | "ESP" | "EXCEL" | "FMT" | "HDAT" | "IMAGE" | "JMP" | "LASR" | "PARQUET" | "SOUND" | "SPSS" | "VIDEO" | "XLS", fileType-specific-parameters}
required parameter name="table-name"
singlePass=true | false
vars={{
format="string",
formattedLength=integer,
label="string",
required parameter name="variable-name",
nfd=integer,
nfl=integer
}, {...}}
where="where-expression"
whereTable={
casLib="string"
dataSourceOptions={adls_noreq-parameters | bigquery-parameters | cas_noreq-parameters | clouddex-parameters | db2-parameters | dnfs-parameters | esp-parameters | fedsvr-parameters | gcs_noreq-parameters | hadoop-parameters | hana-parameters | impala-parameters | informix-parameters | jdbc-parameters | mongodb-parameters | mysql-parameters | odbc-parameters | oracle-parameters | path-parameters | postgres-parameters | redshift-parameters | s3-parameters | sapiq-parameters | sforce-parameters | singlestore_standard-parameters | snowflake-parameters | spark-parameters | spde-parameters | sqlserver-parameters | ss_noreq-parameters | teradata-parameters | vertica-parameters | yellowbrick-parameters}
importOptions={fileType="ANY" | "AUDIO" | "AUTO" | "BASESAS" | "CSV" | "DELIMITED" | "DOCUMENT" | "DTA" | "ESP" | "EXCEL" | "FMT" | "HDAT" | "IMAGE" | "JMP" | "LASR" | "PARQUET" | "SOUND" | "SPSS" | "VIDEO" | "XLS", fileType-specific-parameters}
required parameter name="table-name"
vars={{
format="string",
formattedLength=integer,
label="string",
required parameter name="variable-name",
nfd=integer,
nfl=integer
}, {...}}
where="where-expression"
}
}
},
maxEvals=integer,
maxIters=integer,
maxTime=double,
maxTrainTime=64-bit-integer,
nCrossValFolds=integer,
nParallel=integer,
popSize=integer,
sampleSize=integer,
seed=integer,
shuffleGridPoints=true | false,
targetEvent="string",
userConfigurations={
caslib="string"
computedOnDemand=true | false
computedVars={{
format="string",
formattedLength=integer,
label="string",
required parameter name="variable-name",
nfd=integer,
nfl=integer
}, {...}}
dataSourceOptions={key-1=any-list-or-data-type-1 <, key-2=any-list-or-data-type-2, ...>}
importOptions={fileType="ANY" | "AUDIO" | "AUTO" | "BASESAS" | "CSV" | "DELIMITED" | "DOCUMENT" | "DTA" | "ESP" | "EXCEL" | "FMT" | "HDAT" | "IMAGE" | "JMP" | "LASR" | "PARQUET" | "SOUND" | "SPSS" | "VIDEO" | "XLS", fileType-specific-parameters}
required parameter name="table-name"
singlePass=true | false
vars={{
format="string",
formattedLength=integer,
label="string",
required parameter name="variable-name",
nfd=integer,
nfl=integer
}, {...}}
where="where-expression"
whereTable={
casLib="string"
dataSourceOptions={adls_noreq-parameters | bigquery-parameters | cas_noreq-parameters | clouddex-parameters | db2-parameters | dnfs-parameters | esp-parameters | fedsvr-parameters | gcs_noreq-parameters | hadoop-parameters | hana-parameters | impala-parameters | informix-parameters | jdbc-parameters | mongodb-parameters | mysql-parameters | odbc-parameters | oracle-parameters | path-parameters | postgres-parameters | redshift-parameters | s3-parameters | sapiq-parameters | sforce-parameters | singlestore_standard-parameters | snowflake-parameters | spark-parameters | spde-parameters | sqlserver-parameters | ss_noreq-parameters | teradata-parameters | vertica-parameters | yellowbrick-parameters}
importOptions={fileType="ANY" | "AUDIO" | "AUTO" | "BASESAS" | "CSV" | "DELIMITED" | "DOCUMENT" | "DTA" | "ESP" | "EXCEL" | "FMT" | "HDAT" | "IMAGE" | "JMP" | "LASR" | "PARQUET" | "SOUND" | "SPSS" | "VIDEO" | "XLS", fileType-specific-parameters}
required parameter name="table-name"
vars={{
format="string",
formattedLength=integer,
label="string",
required parameter name="variable-name",
nfd=integer,
nfl=integer
}, {...}}
where="where-expression"
}
},
userDefinedPartition=true | false,
},
tuningParameters={{
exclude=true | false,
initValue=integer | 64-bit-integer | double | true | false | "string",
lowerBound=double,
namePath="string",
upperBound=double,
valueList={any-list-or-data-type-1 <, any-list-or-data-type-2, ...>}
}, {...}},
useParameters="COMBINED" | "CUSTOM" | "STANDARD"
}
indicates a required parameter

Summary: Input and Output Tables

If a row includes a subparameter, you can specify the name, caslib, and so on in the subparameter. Otherwise, you can specify the name, caslib, and so on in the parameter.

Parameters for Reading Input Tables

Parameter

Subparameter

Description

 tunerOptions

lookupTable (and nested parameter table), userConfigurations

specifies a list of parameters for adjusting how the tuner behaves.

Parameters for Creating Output Tables

Parameter

Subparameter

Description

 tunerOptions

historyTable (and nested parameter table)

specifies a list of parameters for adjusting how the tuner behaves.

Parameter Descriptions

* trainOptions={key-1=any-list-or-data-type-1 <, key-2=any-list-or-data-type-2, ...>}

specifies a list of parameters to be used by the labelSpread action in the graphSemiSupLearn action set; for more information, see the parameters for that action.

tunerOptions={tunerOptions}

specifies a list of parameters for adjusting how the tuner behaves.

The tunerOptions value can be one or more of the following:

constraints={{autotuneConstraintDefinition-1} <, {autotuneConstraintDefinition-2}, ...>}

specifies a list of constraints and their definitions.

The autotuneConstraintDefinition value can be one or more of the following:

lowerBound=double

specifies the lower bound of the range for a tuning parameter. This parameter can be used only for numeric tuning parameters, and is ignored if a value list is specified for the tuning parameter.

Alias LB
* metric="ASE" | "AUC" | "CUMULATIVEGAIN" | "F05" | "F1" | "FNR" | "FPR" | "GAMMA" | "GINI" | "HITRATE" | "KS" | "LOSSFUNCTION" | "MAE" | "MCE" | "MCLL" | "MEANAVEPRECISION" | "MEANRECIPRANK" | "MISC" | "MSE" | "MSLE" | "RASE" | "RECALL" | "RMAE" | "RMSLE" | "SCORINGTIME" | "TAU" | "TNR" | "TPR" | "TRAININGTIME"

specifies a metric to constrain during tuning.

ASE

specifies the average square error as the objective function or the constraint metric.

AUC

specifies area under the curve as the objective function or the constraint metric (nominal targets only).

CUMULATIVEGAIN

specifies weighted normalized discounted cumulative gain as the objective function or the constraint metric (recommender model types only).

F05

specifies the F0.5 coefficient as the objective function or the constraint metric (nominal targets only).

F1

specifies the F1 coefficient as the objective function or the constraint metric (nominal targets only).

FNR

specifies false negative rate as the objective function or the constraint metric (nominal targets only).

FPR

specifies false positive rate as the objective function or the constraint metric (nominal targets only).

GAMMA

specifies the gamma coefficient as the objective function or the constraint metric (nominal targets only).

GINI

specifies the Gini coefficient as the objective function or the constraint metric (nominal targets only).

HITRATE

specifies hit rate as the objective function or the constraint metric (recommender model types only).

KS

specifies the Kolmogorov-Smirnov coefficient as the objective function or the constraint metric (nominal targets only).

LOSSFUNCTION

specifies the Loss Function as the constraint metric.

MAE

specifies the mean absolute error as the objective function or the constraint metric (interval targets only).

MCE

specifies the misclassification rate as the objective function or the constraint metric (nominal targets only).

MCLL

specifies the multiclass log loss as the objective function or the constraint metric (nominal targets only).

MEANAVEPRECISION

specifies mean average precision as the objective function or the constraint metric (recommender model types only).

MEANRECIPRANK

specifies mean reciprocal rank as the objective function or the constraint metric (recommender model types only).

MISC

specifies the misclassification error percentage as the objective function or the constraint metric (nominal targets only).

MSE

specifies the mean square error as the objective function or the constraint metric (interval targets only).

MSLE

specifies the mean square logarithmic error as the objective function or the constraint metric (interval targets only).

RASE

specifies the root average square error as the objective function or the constraint metric.

RECALL

specifies recall value as the objective function or the constraint metric (recommender model types only).

RMAE

specifies the root mean absolute error as the objective function or the constraint metric (interval targets only).

RMSLE

specifies the root mean square logarithmic error as the objective function or the constraint metric (interval targets only).

SCORINGTIME

specifies model scoring time as the objective for tuning.

TAU

specifies the tau coefficient as the objective function or the constraint metric (nominal targets only).

TNR

specifies true negative rate as the objective function or the constraint metric (nominal targets only).

TPR

specifies true positive rate as the objective function or the constraint metric (nominal targets only).

TRAININGTIME

specifies model training time as the objective for tuning.

upperBound=double

specifies the upper bound of the range for a tuning parameter. This parameter can be used only for numeric tuning parameters, and is ignored if a value list is specified for the tuning parameter.

Alias UB
enableLocalSearch=true | false

when set to True, local search optimization is enabled for model tuning.

Default true
evaluationHistory=true | false

when set to True, creates a table of every evaluation (every model configuration) and returns those tables to the client.

Default true
foldColumn="string"

specifies a column in the data table in which the cross validation fold value is indicated.

historyTable={history_table}

specifies the CAS table that is created by the autotune action that contains all evaluation history data points.

The history_table value can be one or more of the following:

appendLookupData=true | false

when set to True, specifies that all unused lookup data points be appended to the table specified in the historyTable parameter.

Default false
liveUpdate=true | false

when set to True, updates the table specified in the historyTable parameter after every evaluation. When the liveUpdate parameter is set to True, the table is promoted even if the promote parameter is not set to True, in order to allow other sessions to access the table.

Default false
* table={casouttable}

For more information about specifying the table parameter, see the common casouttable (Form 1) parameter (Appendix A: Common Parameters).

logLevel=integer

specifies the level of log messages to be written: no logs (0), initialization and completion logs (1), setup and iteration results (2), or detailed evaluation results (3).

Default 2
Range 0–3
lookupTable={lookup_table}

specifies the CAS table used by the autotune action for evaluation lookup.

The lookup_table value can be one or more of the following:

selectInitialPoint=true | false

when set to True, selects the best point from the lookup table and uses it as the initial (default) point.

Alias initPoint
Default false
table={castable}

specifies the settings for an input table.

The castable value can be one or more of the following:

caslib="string"

specifies the caslib for the input table that you want to use with the action. By default, the active caslib is used. Specify a value only if you need to access a table from a different caslib.

computedOnDemand=true | false

when set to True, creates the computed variables when the table is loaded instead of when the action begins.

Alias compOnDemand
Default false
computedVars={{casinvardesc-1} <, {casinvardesc-2}, ...>}

specifies the names of the computed variables to create. Specify an expression for each variable in the computedVarsProgram parameter. If you do not specify this parameter, then all variables from computedVarsProgram are automatically included.

Alias compVars

The casinvardesc value can be one or more of the following:

format="string"

specifies the format to apply to the variable.

formattedLength=integer

specifies the length of the format field plus the length of the format precision.

label="string"

specifies the descriptive label for the variable.

* name="variable-name"

specifies the name for the variable.

nfd=integer

specifies the length of the format precision.

nfl=integer

specifies the length of the format field.

computedVarsProgram="string"

specifies an expression for each computed variable that you include in the computedVars parameter.

Alias compPgm
dataSourceOptions={key-1=any-list-or-data-type-1 <, key-2=any-list-or-data-type-2, ...>}

specifies data source options.

Aliases options
dataSource
groupByMode="NOSORT" | "REDISTRIBUTE"

specifies how to create groups.

Default NOSORT
NOSORT

groups the data without sorting on each machine, and then groups the data again on the controller.

REDISTRIBUTE

transfers rows between nodes to guarantee ordering within groups. This method is slower.

importOptions={fileType="ANY" | "AUDIO" | "AUTO" | "BASESAS" | "CSV" | "DOCUMENT" | "DTA" | "ESP" | "EXCEL" | "FMT" | "HDAT" | "IMAGE" | "JMP" | "LASR" | "PARQUET" | "SPSS" | "VIDEO" | "XLS", fileType-specific-parameters}

specifies the settings for reading a table from a data source.

Alias import

For more information about specifying the importOptions parameter, see the common importOptions parameter (Appendix A: Common Parameters).

* name="table-name"

specifies the name of the input table.

singlePass=true | false

when set to True, does not create a transient table on the server. Setting this parameter to True can be efficient, but the data might not have stable ordering upon repeated runs.

Default false
vars={{casinvardesc-1} <, {casinvardesc-2}, ...>}

specifies the variables to use in the action.

The casinvardesc value can be one or more of the following:

format="string"

specifies the format to apply to the variable.

formattedLength=integer

specifies the length of the format field plus the length of the format precision.

label="string"

specifies the descriptive label for the variable.

* name="variable-name"

specifies the name for the variable.

nfd=integer

specifies the length of the format precision.

nfl=integer

specifies the length of the format field.

where="where-expression"

specifies an expression for subsetting the input data.

whereTable={groupbytable}

specifies an input table that contains rows to use as a WHERE filter. If the vars parameter is not specified, then all the variable names that are common to the input table and the filtering table are used to find matching rows. If the where parameter for the input table and this parameter are specified, then this filtering table is applied first.

The groupbytable value can be one or more of the following:

casLib="string"

specifies the caslib for the filter table. By default, the active caslib is used.

dataSourceOptions={adls_noreq-parameters | bigquery-parameters | cas_noreq-parameters | clouddex-parameters | db2-parameters | dnfs-parameters | esp-parameters | fedsvr-parameters | gcs_noreq-parameters | hadoop-parameters | hana-parameters | impala-parameters | informix-parameters | jdbc-parameters | mongodb-parameters | mysql-parameters | odbc-parameters | oracle-parameters | path-parameters | postgres-parameters | redshift-parameters | s3-parameters | sapiq-parameters | sforce-parameters | singlestore_standard-parameters | snowflake-parameters | spark-parameters | spde-parameters | sqlserver-parameters | ss_noreq-parameters | teradata-parameters | vertica-parameters | yellowbrick-parameters}

specifies data source options.

Aliases options
dataSource

For more information about specifying the dataSourceOptions parameter, see the common dataSourceOptions parameter (Appendix A: Common Parameters).

importOptions={fileType="ANY" | "AUDIO" | "AUTO" | "BASESAS" | "CSV" | "DOCUMENT" | "DTA" | "ESP" | "EXCEL" | "FMT" | "HDAT" | "IMAGE" | "JMP" | "LASR" | "PARQUET" | "SPSS" | "VIDEO" | "XLS", fileType-specific-parameters}

specifies the settings for reading a table from a data source.

Alias import

For more information about specifying the importOptions parameter, see the common importOptions parameter (Appendix A: Common Parameters).

* name="table-name"

specifies the name of the filter table.

vars={{casinvardesc-1} <, {casinvardesc-2}, ...>}

specifies the variable names to use from the filter table.

The casinvardesc value can be one or more of the following:

format="string"

specifies the format to apply to the variable.

formattedLength=integer

specifies the length of the format field plus the length of the format precision.

label="string"

specifies the descriptive label for the variable.

* name="variable-name"

specifies the name for the variable.

nfd=integer

specifies the length of the format precision.

nfl=integer

specifies the length of the format field.

where="where-expression"

specifies an expression for subsetting the data from the filter table.

maxBayesianModelSize=integer

specifies the maximum number of points in the kriging model. This parameter is ignored unless the value "BAYESIAN" is specified in the searchMethod parameter.

Alias maxBayesianSize
Default 500
Minimum value 10
maxEvals=integer

specifies the maximum number of objective evaluations (model configurations) to be trained and scored.

Alias maxEvaluations
Default 0
Minimum value 0
maxIters=integer

specifies the maximum number of iterations of optimization for model tuning. At each iteration, the number of evaluated models is less than or equal to the value of the popSize parameter.

Alias maxIterations
Default 5
Minimum value 1
maxTime=double

specifies the maximum time allowed for tuning. The actual time might exceed this value by the amount of time required to train and score a model configuration.

Default 36000
Minimum value 1
maxTrainTime=64-bit-integer

specifies the maximum time allowed for a single model training. The model training is terminated if it exceeds this time, and the objective value is set to missing.

Minimum value 0
nConvergenceIterations=integer

specifies the number of convergence iterations to use for terminating the tuning.

Alias nConvIters
Default 4
Minimum value 1
nCrossValFolds=integer

specifies the number of folds to use for cross validation to assess model fit error as the tuning objective.

Alias nFolds
Default 5
Minimum value 2
nParallel=integer

specifies the number of parallel subsessions to be used for parallel evaluation of model configurations.

Default 0
nSubsessionWorkers=integer

specifies the number of workers to be used by each subsession for parallel evaluation of model configurations.

Alias nSubWorkers
Default 0
Minimum value 0
objective="ASE" | "AUC" | "CUMULATIVEGAIN" | "F05" | "F1" | "FNR" | "FPR" | "GAMMA" | "GINI" | "HITRATE" | "KS" | "MAE" | "MCE" | "MCLL" | "MEANAVEPRECISION" | "MEANRECIPRANK" | "MISC" | "MSE" | "MSLE" | "RASE" | "RECALL" | "RMAE" | "RMSLE" | "SCORINGTIME" | "TAU" | "TNR" | "TPR" | "TRAININGTIME"

specifies the metric to use as the objective during tuning.

ASE

specifies the average square error as the objective function or the constraint metric.

AUC

specifies area under the curve as the objective function or the constraint metric (nominal targets only).

CUMULATIVEGAIN

specifies weighted normalized discounted cumulative gain as the objective function or the constraint metric (recommender model types only).

F05

specifies the F0.5 coefficient as the objective function or the constraint metric (nominal targets only).

F1

specifies the F1 coefficient as the objective function or the constraint metric (nominal targets only).

FNR

specifies false negative rate as the objective function or the constraint metric (nominal targets only).

FPR

specifies false positive rate as the objective function or the constraint metric (nominal targets only).

GAMMA

specifies the gamma coefficient as the objective function or the constraint metric (nominal targets only).

GINI

specifies the Gini coefficient as the objective function or the constraint metric (nominal targets only).

HITRATE

specifies hit rate as the objective function or the constraint metric (recommender model types only).

KS

specifies the Kolmogorov-Smirnov coefficient as the objective function or the constraint metric (nominal targets only).

MAE

specifies the mean absolute error as the objective function or the constraint metric (interval targets only).

MCE

specifies the misclassification rate as the objective function or the constraint metric (nominal targets only).

MCLL

specifies the multiclass log loss as the objective function or the constraint metric (nominal targets only).

MEANAVEPRECISION

specifies mean average precision as the objective function or the constraint metric (recommender model types only).

MEANRECIPRANK

specifies mean reciprocal rank as the objective function or the constraint metric (recommender model types only).

MISC

specifies the misclassification error percentage as the objective function or the constraint metric (nominal targets only).

MSE

specifies the mean square error as the objective function or the constraint metric (interval targets only).

MSLE

specifies the mean square logarithmic error as the objective function or the constraint metric (interval targets only).

RASE

specifies the root average square error as the objective function or the constraint metric.

RECALL

specifies recall value as the objective function or the constraint metric (recommender model types only).

RMAE

specifies the root mean absolute error as the objective function or the constraint metric (interval targets only).

RMSLE

specifies the root mean square logarithmic error as the objective function or the constraint metric (interval targets only).

SCORINGTIME

specifies model scoring time as the objective for tuning.

TAU

specifies the tau coefficient as the objective function or the constraint metric (nominal targets only).

TNR

specifies true negative rate as the objective function or the constraint metric (nominal targets only).

TPR

specifies true positive rate as the objective function or the constraint metric (nominal targets only).

TRAININGTIME

specifies model training time as the objective for tuning.

popSize=integer

specifies the maximum number of model configurations to evaluate at each iteration.

Alias populationSize
Default 10
Minimum value 2
sampleSize=integer

specifies the number of model configurations to evaluate when the value of the searchMethod parameter is "LHS" or "RANDOM".

Default 50
Minimum value 2
searchMethod="BAYESIAN" | "GA" | "GRID" | "LHS" | "RANDOM"

specifies the search method to use during tuning.

Default GA
BAYESIAN

uses the Bayesian search method.

GA

uses the genetic algorithm search method.

GRID

uses the grid search method.

LHS

uses the Latin hypercube sample search method.

RANDOM

uses the random search method.

secondObjective="ASE" | "AUC" | "CUMULATIVEGAIN" | "F05" | "F1" | "FNR" | "FPR" | "GAMMA" | "GINI" | "HITRATE" | "KS" | "MAE" | "MCE" | "MCLL" | "MEANAVEPRECISION" | "MEANRECIPRANK" | "MISC" | "MSE" | "MSLE" | "NONE" | "RASE" | "RECALL" | "RMAE" | "RMSLE" | "SCORINGTIME" | "TAU" | "TNR" | "TPR" | "TRAININGTIME"

specifies the metric to use as the second objective during tuning.

Default NONE
ASE

specifies the average square error as the objective function or the constraint metric.

AUC

specifies area under the curve as the objective function or the constraint metric (nominal targets only).

CUMULATIVEGAIN

specifies weighted normalized discounted cumulative gain as the objective function or the constraint metric (recommender model types only).

F05

specifies the F0.5 coefficient as the objective function or the constraint metric (nominal targets only).

F1

specifies the F1 coefficient as the objective function or the constraint metric (nominal targets only).

FNR

specifies false negative rate as the objective function or the constraint metric (nominal targets only).

FPR

specifies false positive rate as the objective function or the constraint metric (nominal targets only).

GAMMA

specifies the gamma coefficient as the objective function or the constraint metric (nominal targets only).

GINI

specifies the Gini coefficient as the objective function or the constraint metric (nominal targets only).

HITRATE

specifies hit rate as the objective function or the constraint metric (recommender model types only).

KS

specifies the Kolmogorov-Smirnov coefficient as the objective function or the constraint metric (nominal targets only).

MAE

specifies the mean absolute error as the objective function or the constraint metric (interval targets only).

MCE

specifies the misclassification rate as the objective function or the constraint metric (nominal targets only).

MCLL

specifies the multiclass log loss as the objective function or the constraint metric (nominal targets only).

MEANAVEPRECISION

specifies mean average precision as the objective function or the constraint metric (recommender model types only).

MEANRECIPRANK

specifies mean reciprocal rank as the objective function or the constraint metric (recommender model types only).

MISC

specifies the misclassification error percentage as the objective function or the constraint metric (nominal targets only).

MSE

specifies the mean square error as the objective function or the constraint metric (interval targets only).

MSLE

specifies the mean square logarithmic error as the objective function or the constraint metric (interval targets only).

NONE

specifies no second objective to be used for tuning.

RASE

specifies the root average square error as the objective function or the constraint metric.

RECALL

specifies recall value as the objective function or the constraint metric (recommender model types only).

RMAE

specifies the root mean absolute error as the objective function or the constraint metric (interval targets only).

RMSLE

specifies the root mean square logarithmic error as the objective function or the constraint metric (interval targets only).

SCORINGTIME

specifies model scoring time as the objective for tuning.

TAU

specifies the tau coefficient as the objective function or the constraint metric (nominal targets only).

TNR

specifies true negative rate as the objective function or the constraint metric (nominal targets only).

TPR

specifies true positive rate as the objective function or the constraint metric (nominal targets only).

TRAININGTIME

specifies model training time as the objective for tuning.

seed=integer

specifies the seed to use for selecting training data and validation data for a single validation partition or fold number for each observation for cross validation.

Default 0
shuffleGridPoints=true | false

when set to True, shuffles the points that are generated by the grid search method before execution.

Default true
targetEvent="string"

specifies the name of the nominal target event to use for tuning.

trainPartitionFraction=double

specifies the size of a single data partition to use for model training during tuning.

Alias trainFraction
Range 0.01–0.99
userConfigurations={castable}

specifies the CAS table that contains user configurations to be used by the autotune action.

The castable value can be one or more of the following:

caslib="string"

specifies the caslib for the input table that you want to use with the action. By default, the active caslib is used. Specify a value only if you need to access a table from a different caslib.

computedOnDemand=true | false

when set to True, creates the computed variables when the table is loaded instead of when the action begins.

Alias compOnDemand
Default false
computedVars={{casinvardesc-1} <, {casinvardesc-2}, ...>}

specifies the names of the computed variables to create. Specify an expression for each variable in the computedVarsProgram parameter. If you do not specify this parameter, then all variables from computedVarsProgram are automatically included.

Alias compVars

The casinvardesc value can be one or more of the following:

format="string"

specifies the format to apply to the variable.

formattedLength=integer

specifies the length of the format field plus the length of the format precision.

label="string"

specifies the descriptive label for the variable.

* name="variable-name"

specifies the name for the variable.

nfd=integer

specifies the length of the format precision.

nfl=integer

specifies the length of the format field.

computedVarsProgram="string"

specifies an expression for each computed variable that you include in the computedVars parameter.

Alias compPgm
dataSourceOptions={key-1=any-list-or-data-type-1 <, key-2=any-list-or-data-type-2, ...>}

specifies data source options.

Aliases options
dataSource
groupByMode="NOSORT" | "REDISTRIBUTE"

specifies how to create groups.

Default NOSORT
NOSORT

groups the data without sorting on each machine, and then groups the data again on the controller.

REDISTRIBUTE

transfers rows between nodes to guarantee ordering within groups. This method is slower.

importOptions={fileType="ANY" | "AUDIO" | "AUTO" | "BASESAS" | "CSV" | "DOCUMENT" | "DTA" | "ESP" | "EXCEL" | "FMT" | "HDAT" | "IMAGE" | "JMP" | "LASR" | "PARQUET" | "SPSS" | "VIDEO" | "XLS", fileType-specific-parameters}

specifies the settings for reading a table from a data source.

Alias import

For more information about specifying the importOptions parameter, see the common importOptions parameter (Appendix A: Common Parameters).

* name="table-name"

specifies the name of the input table.

singlePass=true | false

when set to True, does not create a transient table on the server. Setting this parameter to True can be efficient, but the data might not have stable ordering upon repeated runs.

Default false
vars={{casinvardesc-1} <, {casinvardesc-2}, ...>}

specifies the variables to use in the action.

The casinvardesc value can be one or more of the following:

format="string"

specifies the format to apply to the variable.

formattedLength=integer

specifies the length of the format field plus the length of the format precision.

label="string"

specifies the descriptive label for the variable.

* name="variable-name"

specifies the name for the variable.

nfd=integer

specifies the length of the format precision.

nfl=integer

specifies the length of the format field.

where="where-expression"

specifies an expression for subsetting the input data.

whereTable={groupbytable}

specifies an input table that contains rows to use as a WHERE filter. If the vars parameter is not specified, then all the variable names that are common to the input table and the filtering table are used to find matching rows. If the where parameter for the input table and this parameter are specified, then this filtering table is applied first.

The groupbytable value can be one or more of the following:

casLib="string"

specifies the caslib for the filter table. By default, the active caslib is used.

dataSourceOptions={adls_noreq-parameters | bigquery-parameters | cas_noreq-parameters | clouddex-parameters | db2-parameters | dnfs-parameters | esp-parameters | fedsvr-parameters | gcs_noreq-parameters | hadoop-parameters | hana-parameters | impala-parameters | informix-parameters | jdbc-parameters | mongodb-parameters | mysql-parameters | odbc-parameters | oracle-parameters | path-parameters | postgres-parameters | redshift-parameters | s3-parameters | sapiq-parameters | sforce-parameters | singlestore_standard-parameters | snowflake-parameters | spark-parameters | spde-parameters | sqlserver-parameters | ss_noreq-parameters | teradata-parameters | vertica-parameters | yellowbrick-parameters}

specifies data source options.

Aliases options
dataSource

For more information about specifying the dataSourceOptions parameter, see the common dataSourceOptions parameter (Appendix A: Common Parameters).

importOptions={fileType="ANY" | "AUDIO" | "AUTO" | "BASESAS" | "CSV" | "DOCUMENT" | "DTA" | "ESP" | "EXCEL" | "FMT" | "HDAT" | "IMAGE" | "JMP" | "LASR" | "PARQUET" | "SPSS" | "VIDEO" | "XLS", fileType-specific-parameters}

specifies the settings for reading a table from a data source.

Alias import

For more information about specifying the importOptions parameter, see the common importOptions parameter (Appendix A: Common Parameters).

* name="table-name"

specifies the name of the filter table.

vars={{casinvardesc-1} <, {casinvardesc-2}, ...>}

specifies the variable names to use from the filter table.

The casinvardesc value can be one or more of the following:

format="string"

specifies the format to apply to the variable.

formattedLength=integer

specifies the length of the format field plus the length of the format precision.

label="string"

specifies the descriptive label for the variable.

* name="variable-name"

specifies the name for the variable.

nfd=integer

specifies the length of the format precision.

nfl=integer

specifies the length of the format field.

where="where-expression"

specifies an expression for subsetting the data from the filter table.

userDefinedPartition=true | false

when set to True, includes a user-defined partition for training and scoring.

Default false
validationPartitionFraction=double

specifies the size of a single validation partition to use in order to assess model fit error as the tuning objective.

Alias validateFraction
Default 0.3
Range 0.01–0.99

tuningParameters={{autotuneTuningParmDefinition-1} <, {autotuneTuningParmDefinition-2}, ...>}

specifies a list of custom tuning parameters and their definitions.

The autotuneTuningParmDefinition value can be one or more of the following:

exclude=true | false

when set to True, excludes a tuning parameter from the tuning process.

Default false
initValue=integer | 64-bit-integer | double | true | false | "string"

specifies the initial (default) value of a tuning parameter.

lowerBound=double

specifies the lower bound of the range for a tuning parameter. This parameter can be used only for numeric tuning parameters, and is ignored if a value list is specified for the tuning parameter.

Alias LB
namePath="string"

specifies the name path of a tuning parameter. For a nested action parameter, this parameter specifies a dot-separated path that includes all its parent parameter names. For a top-level action parameter, this parameter is simply the name of the parameter.

Alias name
upperBound=double

specifies the upper bound of the range for a tuning parameter. This parameter can be used only for numeric tuning parameters, and is ignored if a value list is specified for the tuning parameter.

Alias UB
valueList={any-list-or-data-type-1 <, any-list-or-data-type-2, ...>}

specifies the list of values to try for a tuning parameter. If this parameter is specified, the lower bound and upper bound values of the tuning parameter are ignored.

useParameters="COMBINED" | "CUSTOM" | "STANDARD"

specifies whether to use only standard tuning parameters, only custom tuning parameters, or a combination of both.

Default COMBINED

tuneLabelSpread Action

Automatically adjusts Label Spreading technique parameters to tune for best objective metric value.

Python Syntax

results=s.autotune.tuneLabelSpread(
required parameter trainOptions={"key-1":{any-list-or-data-type-1} <, "key-2":{any-list-or-data-type-2}, ...>},
tunerOptions={
"constraints":[{
"lowerBound":double,
required parameter "metric":"ASE" | "AUC" | "CUMULATIVEGAIN" | "F05" | "F1" | "FNR" | "FPR" | "GAMMA" | "GINI" | "HITRATE" | "KS" | "LOSSFUNCTION" | "MAE" | "MCE" | "MCLL" | "MEANAVEPRECISION" | "MEANRECIPRANK" | "MISC" | "MSE" | "MSLE" | "RASE" | "RECALL" | "RMAE" | "RMSLE" | "SCORINGTIME" | "TAU" | "TNR" | "TPR" | "TRAININGTIME",
"upperBound":double
}<, {...}>],
"enableLocalSearch":True | False,
"evaluationHistory":True | False,
"foldColumn":"string",
"historyTable":{
"appendLookupData":True | False
"liveUpdate":True | False
required parameter "table":{
"caslib":"string"
"compress":True | False
"indexVars":["variable-name-1" <, "variable-name-2", ...>]
"label":"string"
"lifetime":64-bit-integer
"maxMemSize":64-bit-integer
"memoryFormat":"DVR" | "INHERIT" | "STANDARD"
"name":"table-name"
"promote":True | False
"replace":True | False
"replication":integer
"tableRedistUpPolicy":"DEFER" | "NOREDIST" | "REBALANCE"
"threadBlockSize":64-bit-integer
"timeStamp":"string"
"where":["string-1" <, "string-2", ...>]
}
},
"logLevel":integer,
"lookupTable":{
"selectInitialPoint":True | False
"table":{
"caslib":"string"
"computedOnDemand":True | False
"computedVars":[{
"format":"string",
"formattedLength":integer,
"label":"string",
required parameter "name":"variable-name",
"nfd":integer,
"nfl":integer
}<, {...}>]
"dataSourceOptions":{"key-1":{any-list-or-data-type-1} <, "key-2":{any-list-or-data-type-2}, ...>}
"importOptions":{"fileType":"ANY" | "AUDIO" | "AUTO" | "BASESAS" | "CSV" | "DELIMITED" | "DOCUMENT" | "DTA" | "ESP" | "EXCEL" | "FMT" | "HDAT" | "IMAGE" | "JMP" | "LASR" | "PARQUET" | "SOUND" | "SPSS" | "VIDEO" | "XLS", fileType-specific-parameters}
required parameter "name":"table-name"
"singlePass":True | False
"vars":[{
"format":"string",
"formattedLength":integer,
"label":"string",
required parameter "name":"variable-name",
"nfd":integer,
"nfl":integer
}<, {...}>]
"where":"where-expression"
"whereTable":{
"casLib":"string"
"dataSourceOptions":{adls_noreq-parameters | bigquery-parameters | cas_noreq-parameters | clouddex-parameters | db2-parameters | dnfs-parameters | esp-parameters | fedsvr-parameters | gcs_noreq-parameters | hadoop-parameters | hana-parameters | impala-parameters | informix-parameters | jdbc-parameters | mongodb-parameters | mysql-parameters | odbc-parameters | oracle-parameters | path-parameters | postgres-parameters | redshift-parameters | s3-parameters | sapiq-parameters | sforce-parameters | singlestore_standard-parameters | snowflake-parameters | spark-parameters | spde-parameters | sqlserver-parameters | ss_noreq-parameters | teradata-parameters | vertica-parameters | yellowbrick-parameters}
"importOptions":{"fileType":"ANY" | "AUDIO" | "AUTO" | "BASESAS" | "CSV" | "DELIMITED" | "DOCUMENT" | "DTA" | "ESP" | "EXCEL" | "FMT" | "HDAT" | "IMAGE" | "JMP" | "LASR" | "PARQUET" | "SOUND" | "SPSS" | "VIDEO" | "XLS", fileType-specific-parameters}
required parameter "name":"table-name"
"vars":[{
"format":"string",
"formattedLength":integer,
"label":"string",
required parameter "name":"variable-name",
"nfd":integer,
"nfl":integer
}<, {...}>]
"where":"where-expression"
}
}
},
"maxEvals":integer,
"maxIters":integer,
"maxTime":double,
"maxTrainTime":64-bit-integer,
"nCrossValFolds":integer,
"nParallel":integer,
"nSubsessionWorkers":integer,
"popSize":integer,
"sampleSize":integer,
"seed":integer,
"shuffleGridPoints":True | False,
"targetEvent":"string",
"userConfigurations":{
"caslib":"string"
"computedOnDemand":True | False
"computedVars":[{
"format":"string",
"formattedLength":integer,
"label":"string",
required parameter "name":"variable-name",
"nfd":integer,
"nfl":integer
}<, {...}>]
"dataSourceOptions":{"key-1":{any-list-or-data-type-1} <, "key-2":{any-list-or-data-type-2}, ...>}
"importOptions":{"fileType":"ANY" | "AUDIO" | "AUTO" | "BASESAS" | "CSV" | "DELIMITED" | "DOCUMENT" | "DTA" | "ESP" | "EXCEL" | "FMT" | "HDAT" | "IMAGE" | "JMP" | "LASR" | "PARQUET" | "SOUND" | "SPSS" | "VIDEO" | "XLS", fileType-specific-parameters}
required parameter "name":"table-name"
"singlePass":True | False
"vars":[{
"format":"string",
"formattedLength":integer,
"label":"string",
required parameter "name":"variable-name",
"nfd":integer,
"nfl":integer
}<, {...}>]
"where":"where-expression"
"whereTable":{
"casLib":"string"
"dataSourceOptions":{adls_noreq-parameters | bigquery-parameters | cas_noreq-parameters | clouddex-parameters | db2-parameters | dnfs-parameters | esp-parameters | fedsvr-parameters | gcs_noreq-parameters | hadoop-parameters | hana-parameters | impala-parameters | informix-parameters | jdbc-parameters | mongodb-parameters | mysql-parameters | odbc-parameters | oracle-parameters | path-parameters | postgres-parameters | redshift-parameters | s3-parameters | sapiq-parameters | sforce-parameters | singlestore_standard-parameters | snowflake-parameters | spark-parameters | spde-parameters | sqlserver-parameters | ss_noreq-parameters | teradata-parameters | vertica-parameters | yellowbrick-parameters}
"importOptions":{"fileType":"ANY" | "AUDIO" | "AUTO" | "BASESAS" | "CSV" | "DELIMITED" | "DOCUMENT" | "DTA" | "ESP" | "EXCEL" | "FMT" | "HDAT" | "IMAGE" | "JMP" | "LASR" | "PARQUET" | "SOUND" | "SPSS" | "VIDEO" | "XLS", fileType-specific-parameters}
required parameter "name":"table-name"
"vars":[{
"format":"string",
"formattedLength":integer,
"label":"string",
required parameter "name":"variable-name",
"nfd":integer,
"nfl":integer
}<, {...}>]
"where":"where-expression"
}
},
"userDefinedPartition":True | False,
},
tuningParameters=[{
"exclude":True | False,
"initValue":integer | 64-bit-integer | double | True | False | "string",
"lowerBound":double,
"namePath":"string",
"upperBound":double,
"valueList":{any-list-or-data-type-1 <, any-list-or-data-type-2, ...>}
}<, {...}>],
useParameters="COMBINED" | "CUSTOM" | "STANDARD"
)
indicates a required parameter

Summary: Input and Output Tables

If a row includes a subparameter, you can specify the name, caslib, and so on in the subparameter. Otherwise, you can specify the name, caslib, and so on in the parameter.

Parameters for Reading Input Tables

Parameter

Subparameter

Description

 tunerOptions

lookupTable (and nested parameter table), userConfigurations

specifies a list of parameters for adjusting how the tuner behaves.

Parameters for Creating Output Tables

Parameter

Subparameter

Description

 tunerOptions

historyTable (and nested parameter table)

specifies a list of parameters for adjusting how the tuner behaves.

Parameter Descriptions

* trainOptions={"key-1":{any-list-or-data-type-1} <, "key-2":{any-list-or-data-type-2}, ...>}

specifies a list of parameters to be used by the labelSpread action in the graphSemiSupLearn action set; for more information, see the parameters for that action.

tunerOptions={tunerOptions}

specifies a list of parameters for adjusting how the tuner behaves.

The tunerOptions value can be one or more of the following:

"constraints":[{autotuneConstraintDefinition-1} <, {autotuneConstraintDefinition-2}, ...>]

specifies a list of constraints and their definitions.

The autotuneConstraintDefinition value can be one or more of the following:

"lowerBound":double

specifies the lower bound of the range for a tuning parameter. This parameter can be used only for numeric tuning parameters, and is ignored if a value list is specified for the tuning parameter.

Alias LB
* "metric":"ASE" | "AUC" | "CUMULATIVEGAIN" | "F05" | "F1" | "FNR" | "FPR" | "GAMMA" | "GINI" | "HITRATE" | "KS" | "LOSSFUNCTION" | "MAE" | "MCE" | "MCLL" | "MEANAVEPRECISION" | "MEANRECIPRANK" | "MISC" | "MSE" | "MSLE" | "RASE" | "RECALL" | "RMAE" | "RMSLE" | "SCORINGTIME" | "TAU" | "TNR" | "TPR" | "TRAININGTIME"

specifies a metric to constrain during tuning.

ASE

specifies the average square error as the objective function or the constraint metric.

AUC

specifies area under the curve as the objective function or the constraint metric (nominal targets only).

CUMULATIVEGAIN

specifies weighted normalized discounted cumulative gain as the objective function or the constraint metric (recommender model types only).

F05

specifies the F0.5 coefficient as the objective function or the constraint metric (nominal targets only).

F1

specifies the F1 coefficient as the objective function or the constraint metric (nominal targets only).

FNR

specifies false negative rate as the objective function or the constraint metric (nominal targets only).

FPR

specifies false positive rate as the objective function or the constraint metric (nominal targets only).

GAMMA

specifies the gamma coefficient as the objective function or the constraint metric (nominal targets only).

GINI

specifies the Gini coefficient as the objective function or the constraint metric (nominal targets only).

HITRATE

specifies hit rate as the objective function or the constraint metric (recommender model types only).

KS

specifies the Kolmogorov-Smirnov coefficient as the objective function or the constraint metric (nominal targets only).

LOSSFUNCTION

specifies the Loss Function as the constraint metric.

MAE

specifies the mean absolute error as the objective function or the constraint metric (interval targets only).

MCE

specifies the misclassification rate as the objective function or the constraint metric (nominal targets only).

MCLL

specifies the multiclass log loss as the objective function or the constraint metric (nominal targets only).

MEANAVEPRECISION

specifies mean average precision as the objective function or the constraint metric (recommender model types only).

MEANRECIPRANK

specifies mean reciprocal rank as the objective function or the constraint metric (recommender model types only).

MISC

specifies the misclassification error percentage as the objective function or the constraint metric (nominal targets only).

MSE

specifies the mean square error as the objective function or the constraint metric (interval targets only).

MSLE

specifies the mean square logarithmic error as the objective function or the constraint metric (interval targets only).

RASE

specifies the root average square error as the objective function or the constraint metric.

RECALL

specifies recall value as the objective function or the constraint metric (recommender model types only).

RMAE

specifies the root mean absolute error as the objective function or the constraint metric (interval targets only).

RMSLE

specifies the root mean square logarithmic error as the objective function or the constraint metric (interval targets only).

SCORINGTIME

specifies model scoring time as the objective for tuning.

TAU

specifies the tau coefficient as the objective function or the constraint metric (nominal targets only).

TNR

specifies true negative rate as the objective function or the constraint metric (nominal targets only).

TPR

specifies true positive rate as the objective function or the constraint metric (nominal targets only).

TRAININGTIME

specifies model training time as the objective for tuning.

"upperBound":double

specifies the upper bound of the range for a tuning parameter. This parameter can be used only for numeric tuning parameters, and is ignored if a value list is specified for the tuning parameter.

Alias UB
"enableLocalSearch":True | False

when set to True, local search optimization is enabled for model tuning.

Default True
"evaluationHistory":True | False

when set to True, creates a table of every evaluation (every model configuration) and returns those tables to the client.

Default True
"foldColumn":"string"

specifies a column in the data table in which the cross validation fold value is indicated.

"historyTable":{history_table}

specifies the CAS table that is created by the autotune action that contains all evaluation history data points.

The history_table value can be one or more of the following:

"appendLookupData":True | False

when set to True, specifies that all unused lookup data points be appended to the table specified in the historyTable parameter.

Default False
"liveUpdate":True | False

when set to True, updates the table specified in the historyTable parameter after every evaluation. When the liveUpdate parameter is set to True, the table is promoted even if the promote parameter is not set to True, in order to allow other sessions to access the table.

Default False
* "table":{casouttable}

For more information about specifying the table parameter, see the common casouttable (Form 1) parameter (Appendix A: Common Parameters).

"logLevel":integer

specifies the level of log messages to be written: no logs (0), initialization and completion logs (1), setup and iteration results (2), or detailed evaluation results (3).

Default 2
Range 0–3
"lookupTable":{lookup_table}

specifies the CAS table used by the autotune action for evaluation lookup.

The lookup_table value can be one or more of the following:

"selectInitialPoint":True | False

when set to True, selects the best point from the lookup table and uses it as the initial (default) point.

Alias initPoint
Default False
"table":{castable}

specifies the settings for an input table.

The castable value can be one or more of the following:

"caslib":"string"

specifies the caslib for the input table that you want to use with the action. By default, the active caslib is used. Specify a value only if you need to access a table from a different caslib.

"computedOnDemand":True | False

when set to True, creates the computed variables when the table is loaded instead of when the action begins.

Alias compOnDemand
Default False
"computedVars":[{casinvardesc-1} <, {casinvardesc-2}, ...>]

specifies the names of the computed variables to create. Specify an expression for each variable in the computedVarsProgram parameter. If you do not specify this parameter, then all variables from computedVarsProgram are automatically included.

Alias compVars

The casinvardesc value can be one or more of the following:

"format":"string"

specifies the format to apply to the variable.

"formattedLength":integer

specifies the length of the format field plus the length of the format precision.

"label":"string"

specifies the descriptive label for the variable.

* "name":"variable-name"

specifies the name for the variable.

"nfd":integer

specifies the length of the format precision.

"nfl":integer

specifies the length of the format field.

"computedVarsProgram":"string"

specifies an expression for each computed variable that you include in the computedVars parameter.

Alias compPgm
"dataSourceOptions":{"key-1":{any-list-or-data-type-1} <, "key-2":{any-list-or-data-type-2}, ...>}

specifies data source options.

Aliases options
dataSource
"groupByMode":"NOSORT" | "REDISTRIBUTE"

specifies how to create groups.

Default NOSORT
NOSORT

groups the data without sorting on each machine, and then groups the data again on the controller.

REDISTRIBUTE

transfers rows between nodes to guarantee ordering within groups. This method is slower.

"importOptions":{"fileType":"ANY" | "AUDIO" | "AUTO" | "BASESAS" | "CSV" | "DOCUMENT" | "DTA" | "ESP" | "EXCEL" | "FMT" | "HDAT" | "IMAGE" | "JMP" | "LASR" | "PARQUET" | "SPSS" | "VIDEO" | "XLS", fileType-specific-parameters}

specifies the settings for reading a table from a data source.

Alias import_

For more information about specifying the importOptions parameter, see the common importOptions parameter (Appendix A: Common Parameters).

* "name":"table-name"

specifies the name of the input table.

"singlePass":True | False

when set to True, does not create a transient table on the server. Setting this parameter to True can be efficient, but the data might not have stable ordering upon repeated runs.

Default False
"vars":[{casinvardesc-1} <, {casinvardesc-2}, ...>]

specifies the variables to use in the action.

The casinvardesc value can be one or more of the following:

"format":"string"

specifies the format to apply to the variable.

"formattedLength":integer

specifies the length of the format field plus the length of the format precision.

"label":"string"

specifies the descriptive label for the variable.

* "name":"variable-name"

specifies the name for the variable.

"nfd":integer

specifies the length of the format precision.

"nfl":integer

specifies the length of the format field.

"where":"where-expression"

specifies an expression for subsetting the input data.

"whereTable":{groupbytable}

specifies an input table that contains rows to use as a WHERE filter. If the vars parameter is not specified, then all the variable names that are common to the input table and the filtering table are used to find matching rows. If the where parameter for the input table and this parameter are specified, then this filtering table is applied first.

The groupbytable value can be one or more of the following:

"casLib":"string"

specifies the caslib for the filter table. By default, the active caslib is used.

"dataSourceOptions":{adls_noreq-parameters | bigquery-parameters | cas_noreq-parameters | clouddex-parameters | db2-parameters | dnfs-parameters | esp-parameters | fedsvr-parameters | gcs_noreq-parameters | hadoop-parameters | hana-parameters | impala-parameters | informix-parameters | jdbc-parameters | mongodb-parameters | mysql-parameters | odbc-parameters | oracle-parameters | path-parameters | postgres-parameters | redshift-parameters | s3-parameters | sapiq-parameters | sforce-parameters | singlestore_standard-parameters | snowflake-parameters | spark-parameters | spde-parameters | sqlserver-parameters | ss_noreq-parameters | teradata-parameters | vertica-parameters | yellowbrick-parameters}

specifies data source options.

Aliases options
dataSource

For more information about specifying the dataSourceOptions parameter, see the common dataSourceOptions parameter (Appendix A: Common Parameters).

"importOptions":{"fileType":"ANY" | "AUDIO" | "AUTO" | "BASESAS" | "CSV" | "DOCUMENT" | "DTA" | "ESP" | "EXCEL" | "FMT" | "HDAT" | "IMAGE" | "JMP" | "LASR" | "PARQUET" | "SPSS" | "VIDEO" | "XLS", fileType-specific-parameters}

specifies the settings for reading a table from a data source.

Alias import_

For more information about specifying the importOptions parameter, see the common importOptions parameter (Appendix A: Common Parameters).

* "name":"table-name"

specifies the name of the filter table.

"vars":[{casinvardesc-1} <, {casinvardesc-2}, ...>]

specifies the variable names to use from the filter table.

The casinvardesc value can be one or more of the following:

"format":"string"

specifies the format to apply to the variable.

"formattedLength":integer

specifies the length of the format field plus the length of the format precision.

"label":"string"

specifies the descriptive label for the variable.

* "name":"variable-name"

specifies the name for the variable.

"nfd":integer

specifies the length of the format precision.

"nfl":integer

specifies the length of the format field.

"where":"where-expression"

specifies an expression for subsetting the data from the filter table.

"maxBayesianModelSize":integer

specifies the maximum number of points in the kriging model. This parameter is ignored unless the value "BAYESIAN" is specified in the searchMethod parameter.

Alias maxBayesianSize
Default 500
Minimum value 10
"maxEvals":integer

specifies the maximum number of objective evaluations (model configurations) to be trained and scored.

Alias maxEvaluations
Default 0
Minimum value 0
"maxIters":integer

specifies the maximum number of iterations of optimization for model tuning. At each iteration, the number of evaluated models is less than or equal to the value of the popSize parameter.

Alias maxIterations
Default 5
Minimum value 1
"maxTime":double

specifies the maximum time allowed for tuning. The actual time might exceed this value by the amount of time required to train and score a model configuration.

Default 36000
Minimum value 1
"maxTrainTime":64-bit-integer

specifies the maximum time allowed for a single model training. The model training is terminated if it exceeds this time, and the objective value is set to missing.

Minimum value 0
"nConvergenceIterations":integer

specifies the number of convergence iterations to use for terminating the tuning.

Alias nConvIters
Default 4
Minimum value 1
"nCrossValFolds":integer

specifies the number of folds to use for cross validation to assess model fit error as the tuning objective.

Alias nFolds
Default 5
Minimum value 2
"nParallel":integer

specifies the number of parallel subsessions to be used for parallel evaluation of model configurations.

Default 0
"nSubsessionWorkers":integer

specifies the number of workers to be used by each subsession for parallel evaluation of model configurations.

Alias nSubWorkers
Default 0
Minimum value 0
"objective":"ASE" | "AUC" | "CUMULATIVEGAIN" | "F05" | "F1" | "FNR" | "FPR" | "GAMMA" | "GINI" | "HITRATE" | "KS" | "MAE" | "MCE" | "MCLL" | "MEANAVEPRECISION" | "MEANRECIPRANK" | "MISC" | "MSE" | "MSLE" | "RASE" | "RECALL" | "RMAE" | "RMSLE" | "SCORINGTIME" | "TAU" | "TNR" | "TPR" | "TRAININGTIME"

specifies the metric to use as the objective during tuning.

ASE

specifies the average square error as the objective function or the constraint metric.

AUC

specifies area under the curve as the objective function or the constraint metric (nominal targets only).

CUMULATIVEGAIN

specifies weighted normalized discounted cumulative gain as the objective function or the constraint metric (recommender model types only).

F05

specifies the F0.5 coefficient as the objective function or the constraint metric (nominal targets only).

F1

specifies the F1 coefficient as the objective function or the constraint metric (nominal targets only).

FNR

specifies false negative rate as the objective function or the constraint metric (nominal targets only).

FPR

specifies false positive rate as the objective function or the constraint metric (nominal targets only).

GAMMA

specifies the gamma coefficient as the objective function or the constraint metric (nominal targets only).

GINI

specifies the Gini coefficient as the objective function or the constraint metric (nominal targets only).

HITRATE

specifies hit rate as the objective function or the constraint metric (recommender model types only).

KS

specifies the Kolmogorov-Smirnov coefficient as the objective function or the constraint metric (nominal targets only).

MAE

specifies the mean absolute error as the objective function or the constraint metric (interval targets only).

MCE

specifies the misclassification rate as the objective function or the constraint metric (nominal targets only).

MCLL

specifies the multiclass log loss as the objective function or the constraint metric (nominal targets only).

MEANAVEPRECISION

specifies mean average precision as the objective function or the constraint metric (recommender model types only).

MEANRECIPRANK

specifies mean reciprocal rank as the objective function or the constraint metric (recommender model types only).

MISC

specifies the misclassification error percentage as the objective function or the constraint metric (nominal targets only).

MSE

specifies the mean square error as the objective function or the constraint metric (interval targets only).

MSLE

specifies the mean square logarithmic error as the objective function or the constraint metric (interval targets only).

RASE

specifies the root average square error as the objective function or the constraint metric.

RECALL

specifies recall value as the objective function or the constraint metric (recommender model types only).

RMAE

specifies the root mean absolute error as the objective function or the constraint metric (interval targets only).

RMSLE

specifies the root mean square logarithmic error as the objective function or the constraint metric (interval targets only).

SCORINGTIME

specifies model scoring time as the objective for tuning.

TAU

specifies the tau coefficient as the objective function or the constraint metric (nominal targets only).

TNR

specifies true negative rate as the objective function or the constraint metric (nominal targets only).

TPR

specifies true positive rate as the objective function or the constraint metric (nominal targets only).

TRAININGTIME

specifies model training time as the objective for tuning.

"popSize":integer

specifies the maximum number of model configurations to evaluate at each iteration.

Alias populationSize
Default 10
Minimum value 2
"sampleSize":integer

specifies the number of model configurations to evaluate when the value of the searchMethod parameter is "LHS" or "RANDOM".

Default 50
Minimum value 2
"searchMethod":"BAYESIAN" | "GA" | "GRID" | "LHS" | "RANDOM"

specifies the search method to use during tuning.

Default GA
BAYESIAN

uses the Bayesian search method.

GA

uses the genetic algorithm search method.

GRID

uses the grid search method.

LHS

uses the Latin hypercube sample search method.

RANDOM

uses the random search method.

"secondObjective":"ASE" | "AUC" | "CUMULATIVEGAIN" | "F05" | "F1" | "FNR" | "FPR" | "GAMMA" | "GINI" | "HITRATE" | "KS" | "MAE" | "MCE" | "MCLL" | "MEANAVEPRECISION" | "MEANRECIPRANK" | "MISC" | "MSE" | "MSLE" | "NONE" | "RASE" | "RECALL" | "RMAE" | "RMSLE" | "SCORINGTIME" | "TAU" | "TNR" | "TPR" | "TRAININGTIME"

specifies the metric to use as the second objective during tuning.

Default NONE
ASE

specifies the average square error as the objective function or the constraint metric.

AUC

specifies area under the curve as the objective function or the constraint metric (nominal targets only).

CUMULATIVEGAIN

specifies weighted normalized discounted cumulative gain as the objective function or the constraint metric (recommender model types only).

F05

specifies the F0.5 coefficient as the objective function or the constraint metric (nominal targets only).

F1

specifies the F1 coefficient as the objective function or the constraint metric (nominal targets only).

FNR

specifies false negative rate as the objective function or the constraint metric (nominal targets only).

FPR

specifies false positive rate as the objective function or the constraint metric (nominal targets only).

GAMMA

specifies the gamma coefficient as the objective function or the constraint metric (nominal targets only).

GINI

specifies the Gini coefficient as the objective function or the constraint metric (nominal targets only).

HITRATE

specifies hit rate as the objective function or the constraint metric (recommender model types only).

KS

specifies the Kolmogorov-Smirnov coefficient as the objective function or the constraint metric (nominal targets only).

MAE

specifies the mean absolute error as the objective function or the constraint metric (interval targets only).

MCE

specifies the misclassification rate as the objective function or the constraint metric (nominal targets only).

MCLL

specifies the multiclass log loss as the objective function or the constraint metric (nominal targets only).

MEANAVEPRECISION

specifies mean average precision as the objective function or the constraint metric (recommender model types only).

MEANRECIPRANK

specifies mean reciprocal rank as the objective function or the constraint metric (recommender model types only).

MISC

specifies the misclassification error percentage as the objective function or the constraint metric (nominal targets only).

MSE

specifies the mean square error as the objective function or the constraint metric (interval targets only).

MSLE

specifies the mean square logarithmic error as the objective function or the constraint metric (interval targets only).

NONE

specifies no second objective to be used for tuning.

RASE

specifies the root average square error as the objective function or the constraint metric.

RECALL

specifies recall value as the objective function or the constraint metric (recommender model types only).

RMAE

specifies the root mean absolute error as the objective function or the constraint metric (interval targets only).

RMSLE

specifies the root mean square logarithmic error as the objective function or the constraint metric (interval targets only).

SCORINGTIME

specifies model scoring time as the objective for tuning.

TAU

specifies the tau coefficient as the objective function or the constraint metric (nominal targets only).

TNR

specifies true negative rate as the objective function or the constraint metric (nominal targets only).

TPR

specifies true positive rate as the objective function or the constraint metric (nominal targets only).

TRAININGTIME

specifies model training time as the objective for tuning.

"seed":integer

specifies the seed to use for selecting training data and validation data for a single validation partition or fold number for each observation for cross validation.

Default 0
"shuffleGridPoints":True | False

when set to True, shuffles the points that are generated by the grid search method before execution.

Default True
"targetEvent":"string"

specifies the name of the nominal target event to use for tuning.

"trainPartitionFraction":double

specifies the size of a single data partition to use for model training during tuning.

Alias trainFraction
Range 0.01–0.99
"userConfigurations":{castable}

specifies the CAS table that contains user configurations to be used by the autotune action.

The castable value can be one or more of the following:

"caslib":"string"

specifies the caslib for the input table that you want to use with the action. By default, the active caslib is used. Specify a value only if you need to access a table from a different caslib.

"computedOnDemand":True | False

when set to True, creates the computed variables when the table is loaded instead of when the action begins.

Alias compOnDemand
Default False
"computedVars":[{casinvardesc-1} <, {casinvardesc-2}, ...>]

specifies the names of the computed variables to create. Specify an expression for each variable in the computedVarsProgram parameter. If you do not specify this parameter, then all variables from computedVarsProgram are automatically included.

Alias compVars

The casinvardesc value can be one or more of the following:

"format":"string"

specifies the format to apply to the variable.

"formattedLength":integer

specifies the length of the format field plus the length of the format precision.

"label":"string"

specifies the descriptive label for the variable.

* "name":"variable-name"

specifies the name for the variable.

"nfd":integer

specifies the length of the format precision.

"nfl":integer

specifies the length of the format field.

"computedVarsProgram":"string"

specifies an expression for each computed variable that you include in the computedVars parameter.

Alias compPgm
"dataSourceOptions":{"key-1":{any-list-or-data-type-1} <, "key-2":{any-list-or-data-type-2}, ...>}

specifies data source options.

Aliases options
dataSource
"groupByMode":"NOSORT" | "REDISTRIBUTE"

specifies how to create groups.

Default NOSORT
NOSORT

groups the data without sorting on each machine, and then groups the data again on the controller.

REDISTRIBUTE

transfers rows between nodes to guarantee ordering within groups. This method is slower.

"importOptions":{"fileType":"ANY" | "AUDIO" | "AUTO" | "BASESAS" | "CSV" | "DOCUMENT" | "DTA" | "ESP" | "EXCEL" | "FMT" | "HDAT" | "IMAGE" | "JMP" | "LASR" | "PARQUET" | "SPSS" | "VIDEO" | "XLS", fileType-specific-parameters}

specifies the settings for reading a table from a data source.

Alias import_

For more information about specifying the importOptions parameter, see the common importOptions parameter (Appendix A: Common Parameters).

* "name":"table-name"

specifies the name of the input table.

"singlePass":True | False

when set to True, does not create a transient table on the server. Setting this parameter to True can be efficient, but the data might not have stable ordering upon repeated runs.

Default False
"vars":[{casinvardesc-1} <, {casinvardesc-2}, ...>]

specifies the variables to use in the action.

The casinvardesc value can be one or more of the following:

"format":"string"

specifies the format to apply to the variable.

"formattedLength":integer

specifies the length of the format field plus the length of the format precision.

"label":"string"

specifies the descriptive label for the variable.

* "name":"variable-name"

specifies the name for the variable.

"nfd":integer

specifies the length of the format precision.

"nfl":integer

specifies the length of the format field.

"where":"where-expression"

specifies an expression for subsetting the input data.

"whereTable":{groupbytable}

specifies an input table that contains rows to use as a WHERE filter. If the vars parameter is not specified, then all the variable names that are common to the input table and the filtering table are used to find matching rows. If the where parameter for the input table and this parameter are specified, then this filtering table is applied first.

The groupbytable value can be one or more of the following:

"casLib":"string"

specifies the caslib for the filter table. By default, the active caslib is used.

"dataSourceOptions":{adls_noreq-parameters | bigquery-parameters | cas_noreq-parameters | clouddex-parameters | db2-parameters | dnfs-parameters | esp-parameters | fedsvr-parameters | gcs_noreq-parameters | hadoop-parameters | hana-parameters | impala-parameters | informix-parameters | jdbc-parameters | mongodb-parameters | mysql-parameters | odbc-parameters | oracle-parameters | path-parameters | postgres-parameters | redshift-parameters | s3-parameters | sapiq-parameters | sforce-parameters | singlestore_standard-parameters | snowflake-parameters | spark-parameters | spde-parameters | sqlserver-parameters | ss_noreq-parameters | teradata-parameters | vertica-parameters | yellowbrick-parameters}

specifies data source options.

Aliases options
dataSource

For more information about specifying the dataSourceOptions parameter, see the common dataSourceOptions parameter (Appendix A: Common Parameters).

"importOptions":{"fileType":"ANY" | "AUDIO" | "AUTO" | "BASESAS" | "CSV" | "DOCUMENT" | "DTA" | "ESP" | "EXCEL" | "FMT" | "HDAT" | "IMAGE" | "JMP" | "LASR" | "PARQUET" | "SPSS" | "VIDEO" | "XLS", fileType-specific-parameters}

specifies the settings for reading a table from a data source.

Alias import_

For more information about specifying the importOptions parameter, see the common importOptions parameter (Appendix A: Common Parameters).

* "name":"table-name"

specifies the name of the filter table.

"vars":[{casinvardesc-1} <, {casinvardesc-2}, ...>]

specifies the variable names to use from the filter table.

The casinvardesc value can be one or more of the following:

"format":"string"

specifies the format to apply to the variable.

"formattedLength":integer

specifies the length of the format field plus the length of the format precision.

"label":"string"

specifies the descriptive label for the variable.

* "name":"variable-name"

specifies the name for the variable.

"nfd":integer

specifies the length of the format precision.

"nfl":integer

specifies the length of the format field.

"where":"where-expression"

specifies an expression for subsetting the data from the filter table.

"userDefinedPartition":True | False

when set to True, includes a user-defined partition for training and scoring.

Default False
"validationPartitionFraction":double

specifies the size of a single validation partition to use in order to assess model fit error as the tuning objective.

Alias validateFraction
Default 0.3
Range 0.01–0.99

tuningParameters=[{autotuneTuningParmDefinition-1} <, {autotuneTuningParmDefinition-2}, ...>]

specifies a list of custom tuning parameters and their definitions.

The autotuneTuningParmDefinition value can be one or more of the following:

"exclude":True | False

when set to True, excludes a tuning parameter from the tuning process.

Default False
"initValue":integer | 64-bit-integer | double | True | False | "string"

specifies the initial (default) value of a tuning parameter.

"lowerBound":double

specifies the lower bound of the range for a tuning parameter. This parameter can be used only for numeric tuning parameters, and is ignored if a value list is specified for the tuning parameter.

Alias LB
"namePath":"string"

specifies the name path of a tuning parameter. For a nested action parameter, this parameter specifies a dot-separated path that includes all its parent parameter names. For a top-level action parameter, this parameter is simply the name of the parameter.

Alias name
"upperBound":double

specifies the upper bound of the range for a tuning parameter. This parameter can be used only for numeric tuning parameters, and is ignored if a value list is specified for the tuning parameter.

Alias UB
"valueList":{any-list-or-data-type-1 <, any-list-or-data-type-2, ...>}

specifies the list of values to try for a tuning parameter. If this parameter is specified, the lower bound and upper bound values of the tuning parameter are ignored.

useParameters="COMBINED" | "CUSTOM" | "STANDARD"

specifies whether to use only standard tuning parameters, only custom tuning parameters, or a combination of both.

Default COMBINED

tuneLabelSpread Action

Automatically adjusts Label Spreading technique parameters to tune for best objective metric value.

R Syntax

results <– cas.autotune.tuneLabelSpread(s,
required parameter trainOptions=list(key-1=list(any-list-or-data-type-1) <, key-2=list(any-list-or-data-type-2), ...>),
tunerOptions=list(
constraints=list( list(
lowerBound=double,
required parameter metric="ASE" | "AUC" | "CUMULATIVEGAIN" | "F05" | "F1" | "FNR" | "FPR" | "GAMMA" | "GINI" | "HITRATE" | "KS" | "LOSSFUNCTION" | "MAE" | "MCE" | "MCLL" | "MEANAVEPRECISION" | "MEANRECIPRANK" | "MISC" | "MSE" | "MSLE" | "RASE" | "RECALL" | "RMAE" | "RMSLE" | "SCORINGTIME" | "TAU" | "TNR" | "TPR" | "TRAININGTIME",
upperBound=double
) <, list(...)>),
enableLocalSearch=TRUE | FALSE,
evaluationHistory=TRUE | FALSE,
foldColumn="string",
historyTable=list(
appendLookupData=TRUE | FALSE
liveUpdate=TRUE | FALSE
required parameter table=list(
caslib="string"
compress=TRUE | FALSE
indexVars=list("variable-name-1" <, "variable-name-2", ...>)
label="string"
lifetime=64-bit-integer
maxMemSize=64-bit-integer
memoryFormat="DVR" | "INHERIT" | "STANDARD"
name="table-name"
promote=TRUE | FALSE
replace=TRUE | FALSE
replication=integer
tableRedistUpPolicy="DEFER" | "NOREDIST" | "REBALANCE"
threadBlockSize=64-bit-integer
timeStamp="string"
where=list("string-1" <, "string-2", ...>)
)
),
logLevel=integer,
lookupTable=list(
selectInitialPoint=TRUE | FALSE
table=list(
caslib="string"
computedOnDemand=TRUE | FALSE
computedVars=list( list(
format="string",
formattedLength=integer,
label="string",
required parameter name="variable-name",
nfd=integer,
nfl=integer
) <, list(...)>)
dataSourceOptions=list(key-1=list(any-list-or-data-type-1) <, key-2=list(any-list-or-data-type-2), ...>)
importOptions=list(fileType="ANY" | "AUDIO" | "AUTO" | "BASESAS" | "CSV" | "DELIMITED" | "DOCUMENT" | "DTA" | "ESP" | "EXCEL" | "FMT" | "HDAT" | "IMAGE" | "JMP" | "LASR" | "PARQUET" | "SOUND" | "SPSS" | "VIDEO" | "XLS", fileType-specific-parameters)
required parameter name="table-name"
singlePass=TRUE | FALSE
vars=list( list(
format="string",
formattedLength=integer,
label="string",
required parameter name="variable-name",
nfd=integer,
nfl=integer
) <, list(...)>)
where="where-expression"
whereTable=list(
casLib="string"
dataSourceOptions=list(adls_noreq-parameters | bigquery-parameters | cas_noreq-parameters | clouddex-parameters | db2-parameters | dnfs-parameters | esp-parameters | fedsvr-parameters | gcs_noreq-parameters | hadoop-parameters | hana-parameters | impala-parameters | informix-parameters | jdbc-parameters | mongodb-parameters | mysql-parameters | odbc-parameters | oracle-parameters | path-parameters | postgres-parameters | redshift-parameters | s3-parameters | sapiq-parameters | sforce-parameters | singlestore_standard-parameters | snowflake-parameters | spark-parameters | spde-parameters | sqlserver-parameters | ss_noreq-parameters | teradata-parameters | vertica-parameters | yellowbrick-parameters)
importOptions=list(fileType="ANY" | "AUDIO" | "AUTO" | "BASESAS" | "CSV" | "DELIMITED" | "DOCUMENT" | "DTA" | "ESP" | "EXCEL" | "FMT" | "HDAT" | "IMAGE" | "JMP" | "LASR" | "PARQUET" | "SOUND" | "SPSS" | "VIDEO" | "XLS", fileType-specific-parameters)
required parameter name="table-name"
vars=list( list(
format="string",
formattedLength=integer,
label="string",
required parameter name="variable-name",
nfd=integer,
nfl=integer
) <, list(...)>)
where="where-expression"
)
)
),
maxEvals=integer,
maxIters=integer,
maxTime=double,
maxTrainTime=64-bit-integer,
nCrossValFolds=integer,
nParallel=integer,
popSize=integer,
sampleSize=integer,
seed=integer,
shuffleGridPoints=TRUE | FALSE,
targetEvent="string",
userConfigurations=list(
caslib="string"
computedOnDemand=TRUE | FALSE
computedVars=list( list(
format="string",
formattedLength=integer,
label="string",
required parameter name="variable-name",
nfd=integer,
nfl=integer
) <, list(...)>)
dataSourceOptions=list(key-1=list(any-list-or-data-type-1) <, key-2=list(any-list-or-data-type-2), ...>)
importOptions=list(fileType="ANY" | "AUDIO" | "AUTO" | "BASESAS" | "CSV" | "DELIMITED" | "DOCUMENT" | "DTA" | "ESP" | "EXCEL" | "FMT" | "HDAT" | "IMAGE" | "JMP" | "LASR" | "PARQUET" | "SOUND" | "SPSS" | "VIDEO" | "XLS", fileType-specific-parameters)
required parameter name="table-name"
singlePass=TRUE | FALSE
vars=list( list(
format="string",
formattedLength=integer,
label="string",
required parameter name="variable-name",
nfd=integer,
nfl=integer
) <, list(...)>)
where="where-expression"
whereTable=list(
casLib="string"
dataSourceOptions=list(adls_noreq-parameters | bigquery-parameters | cas_noreq-parameters | clouddex-parameters | db2-parameters | dnfs-parameters | esp-parameters | fedsvr-parameters | gcs_noreq-parameters | hadoop-parameters | hana-parameters | impala-parameters | informix-parameters | jdbc-parameters | mongodb-parameters | mysql-parameters | odbc-parameters | oracle-parameters | path-parameters | postgres-parameters | redshift-parameters | s3-parameters | sapiq-parameters | sforce-parameters | singlestore_standard-parameters | snowflake-parameters | spark-parameters | spde-parameters | sqlserver-parameters | ss_noreq-parameters | teradata-parameters | vertica-parameters | yellowbrick-parameters)
importOptions=list(fileType="ANY" | "AUDIO" | "AUTO" | "BASESAS" | "CSV" | "DELIMITED" | "DOCUMENT" | "DTA" | "ESP" | "EXCEL" | "FMT" | "HDAT" | "IMAGE" | "JMP" | "LASR" | "PARQUET" | "SOUND" | "SPSS" | "VIDEO" | "XLS", fileType-specific-parameters)
required parameter name="table-name"
vars=list( list(
format="string",
formattedLength=integer,
label="string",
required parameter name="variable-name",
nfd=integer,
nfl=integer
) <, list(...)>)
where="where-expression"
)
),
userDefinedPartition=TRUE | FALSE,
),
tuningParameters=list( list(
exclude=TRUE | FALSE,
initValue=integer | 64-bit-integer | double | TRUE | FALSE | "string",
lowerBound=double,
namePath="string",
upperBound=double,
valueList=list(any-list-or-data-type-1 <, any-list-or-data-type-2, ...>)
) <, list(...)>),
useParameters="COMBINED" | "CUSTOM" | "STANDARD"
)
indicates a required parameter

Summary: Input and Output Tables

If a row includes a subparameter, you can specify the name, caslib, and so on in the subparameter. Otherwise, you can specify the name, caslib, and so on in the parameter.

Parameters for Reading Input Tables

Parameter

Subparameter

Description

 tunerOptions

lookupTable (and nested parameter table), userConfigurations

specifies a list of parameters for adjusting how the tuner behaves.

Parameters for Creating Output Tables

Parameter

Subparameter

Description

 tunerOptions

historyTable (and nested parameter table)

specifies a list of parameters for adjusting how the tuner behaves.

Parameter Descriptions

* trainOptions=list(key-1=list(any-list-or-data-type-1) <, key-2=list(any-list-or-data-type-2), ...>)

specifies a list of parameters to be used by the labelSpread action in the graphSemiSupLearn action set; for more information, see the parameters for that action.

tunerOptions=list(tunerOptions)

specifies a list of parameters for adjusting how the tuner behaves.

The tunerOptions value can be one or more of the following:

constraints=list( list(autotuneConstraintDefinition-1) <, list(autotuneConstraintDefinition-2), ...>)

specifies a list of constraints and their definitions.

The autotuneConstraintDefinition value can be one or more of the following:

lowerBound=double

specifies the lower bound of the range for a tuning parameter. This parameter can be used only for numeric tuning parameters, and is ignored if a value list is specified for the tuning parameter.

Alias LB
* metric="ASE" | "AUC" | "CUMULATIVEGAIN" | "F05" | "F1" | "FNR" | "FPR" | "GAMMA" | "GINI" | "HITRATE" | "KS" | "LOSSFUNCTION" | "MAE" | "MCE" | "MCLL" | "MEANAVEPRECISION" | "MEANRECIPRANK" | "MISC" | "MSE" | "MSLE" | "RASE" | "RECALL" | "RMAE" | "RMSLE" | "SCORINGTIME" | "TAU" | "TNR" | "TPR" | "TRAININGTIME"

specifies a metric to constrain during tuning.

ASE

specifies the average square error as the objective function or the constraint metric.

AUC

specifies area under the curve as the objective function or the constraint metric (nominal targets only).

CUMULATIVEGAIN

specifies weighted normalized discounted cumulative gain as the objective function or the constraint metric (recommender model types only).

F05

specifies the F0.5 coefficient as the objective function or the constraint metric (nominal targets only).

F1

specifies the F1 coefficient as the objective function or the constraint metric (nominal targets only).

FNR

specifies false negative rate as the objective function or the constraint metric (nominal targets only).

FPR

specifies false positive rate as the objective function or the constraint metric (nominal targets only).

GAMMA

specifies the gamma coefficient as the objective function or the constraint metric (nominal targets only).

GINI

specifies the Gini coefficient as the objective function or the constraint metric (nominal targets only).

HITRATE

specifies hit rate as the objective function or the constraint metric (recommender model types only).

KS

specifies the Kolmogorov-Smirnov coefficient as the objective function or the constraint metric (nominal targets only).

LOSSFUNCTION

specifies the Loss Function as the constraint metric.

MAE

specifies the mean absolute error as the objective function or the constraint metric (interval targets only).

MCE

specifies the misclassification rate as the objective function or the constraint metric (nominal targets only).

MCLL

specifies the multiclass log loss as the objective function or the constraint metric (nominal targets only).

MEANAVEPRECISION

specifies mean average precision as the objective function or the constraint metric (recommender model types only).

MEANRECIPRANK

specifies mean reciprocal rank as the objective function or the constraint metric (recommender model types only).

MISC

specifies the misclassification error percentage as the objective function or the constraint metric (nominal targets only).

MSE

specifies the mean square error as the objective function or the constraint metric (interval targets only).

MSLE

specifies the mean square logarithmic error as the objective function or the constraint metric (interval targets only).

RASE

specifies the root average square error as the objective function or the constraint metric.

RECALL

specifies recall value as the objective function or the constraint metric (recommender model types only).

RMAE

specifies the root mean absolute error as the objective function or the constraint metric (interval targets only).

RMSLE

specifies the root mean square logarithmic error as the objective function or the constraint metric (interval targets only).

SCORINGTIME

specifies model scoring time as the objective for tuning.

TAU

specifies the tau coefficient as the objective function or the constraint metric (nominal targets only).

TNR

specifies true negative rate as the objective function or the constraint metric (nominal targets only).

TPR

specifies true positive rate as the objective function or the constraint metric (nominal targets only).

TRAININGTIME

specifies model training time as the objective for tuning.

upperBound=double

specifies the upper bound of the range for a tuning parameter. This parameter can be used only for numeric tuning parameters, and is ignored if a value list is specified for the tuning parameter.

Alias UB
enableLocalSearch=TRUE | FALSE

when set to True, local search optimization is enabled for model tuning.

Default TRUE
evaluationHistory=TRUE | FALSE

when set to True, creates a table of every evaluation (every model configuration) and returns those tables to the client.

Default TRUE
foldColumn="string"

specifies a column in the data table in which the cross validation fold value is indicated.

historyTable=list(history_table)

specifies the CAS table that is created by the autotune action that contains all evaluation history data points.

The history_table value can be one or more of the following:

appendLookupData=TRUE | FALSE

when set to True, specifies that all unused lookup data points be appended to the table specified in the historyTable parameter.

Default FALSE
liveUpdate=TRUE | FALSE

when set to True, updates the table specified in the historyTable parameter after every evaluation. When the liveUpdate parameter is set to True, the table is promoted even if the promote parameter is not set to True, in order to allow other sessions to access the table.

Default FALSE
* table=list(casouttable)

For more information about specifying the table parameter, see the common casouttable (Form 1) parameter (Appendix A: Common Parameters).

logLevel=integer

specifies the level of log messages to be written: no logs (0), initialization and completion logs (1), setup and iteration results (2), or detailed evaluation results (3).

Default 2
Range 0–3
lookupTable=list(lookup_table)

specifies the CAS table used by the autotune action for evaluation lookup.

The lookup_table value can be one or more of the following:

selectInitialPoint=TRUE | FALSE

when set to True, selects the best point from the lookup table and uses it as the initial (default) point.

Alias initPoint
Default FALSE
table=list(castable)

specifies the settings for an input table.

The castable value can be one or more of the following:

caslib="string"

specifies the caslib for the input table that you want to use with the action. By default, the active caslib is used. Specify a value only if you need to access a table from a different caslib.

computedOnDemand=TRUE | FALSE

when set to True, creates the computed variables when the table is loaded instead of when the action begins.

Alias compOnDemand
Default FALSE
computedVars=list( list(casinvardesc-1) <, list(casinvardesc-2), ...>)

specifies the names of the computed variables to create. Specify an expression for each variable in the computedVarsProgram parameter. If you do not specify this parameter, then all variables from computedVarsProgram are automatically included.

Alias compVars

The casinvardesc value can be one or more of the following:

format="string"

specifies the format to apply to the variable.

formattedLength=integer

specifies the length of the format field plus the length of the format precision.

label="string"

specifies the descriptive label for the variable.

* name="variable-name"

specifies the name for the variable.

nfd=integer

specifies the length of the format precision.

nfl=integer

specifies the length of the format field.

computedVarsProgram="string"

specifies an expression for each computed variable that you include in the computedVars parameter.

Alias compPgm
dataSourceOptions=list(key-1=list(any-list-or-data-type-1) <, key-2=list(any-list-or-data-type-2), ...>)

specifies data source options.

Aliases options
dataSource
groupByMode="NOSORT" | "REDISTRIBUTE"

specifies how to create groups.

Default NOSORT
NOSORT

groups the data without sorting on each machine, and then groups the data again on the controller.

REDISTRIBUTE

transfers rows between nodes to guarantee ordering within groups. This method is slower.

importOptions=list(fileType="ANY" | "AUDIO" | "AUTO" | "BASESAS" | "CSV" | "DOCUMENT" | "DTA" | "ESP" | "EXCEL" | "FMT" | "HDAT" | "IMAGE" | "JMP" | "LASR" | "PARQUET" | "SPSS" | "VIDEO" | "XLS", fileType-specific-parameters)

specifies the settings for reading a table from a data source.

Alias import

For more information about specifying the importOptions parameter, see the common importOptions parameter (Appendix A: Common Parameters).

* name="table-name"

specifies the name of the input table.

singlePass=TRUE | FALSE

when set to True, does not create a transient table on the server. Setting this parameter to True can be efficient, but the data might not have stable ordering upon repeated runs.

Default FALSE
vars=list( list(casinvardesc-1) <, list(casinvardesc-2), ...>)

specifies the variables to use in the action.

The casinvardesc value can be one or more of the following:

format="string"

specifies the format to apply to the variable.

formattedLength=integer

specifies the length of the format field plus the length of the format precision.

label="string"

specifies the descriptive label for the variable.

* name="variable-name"

specifies the name for the variable.

nfd=integer

specifies the length of the format precision.

nfl=integer

specifies the length of the format field.

where="where-expression"

specifies an expression for subsetting the input data.

whereTable=list(groupbytable)

specifies an input table that contains rows to use as a WHERE filter. If the vars parameter is not specified, then all the variable names that are common to the input table and the filtering table are used to find matching rows. If the where parameter for the input table and this parameter are specified, then this filtering table is applied first.

The groupbytable value can be one or more of the following:

casLib="string"

specifies the caslib for the filter table. By default, the active caslib is used.

dataSourceOptions=list(adls_noreq-parameters | bigquery-parameters | cas_noreq-parameters | clouddex-parameters | db2-parameters | dnfs-parameters | esp-parameters | fedsvr-parameters | gcs_noreq-parameters | hadoop-parameters | hana-parameters | impala-parameters | informix-parameters | jdbc-parameters | mongodb-parameters | mysql-parameters | odbc-parameters | oracle-parameters | path-parameters | postgres-parameters | redshift-parameters | s3-parameters | sapiq-parameters | sforce-parameters | singlestore_standard-parameters | snowflake-parameters | spark-parameters | spde-parameters | sqlserver-parameters | ss_noreq-parameters | teradata-parameters | vertica-parameters | yellowbrick-parameters)

specifies data source options.

Aliases options
dataSource

For more information about specifying the dataSourceOptions parameter, see the common dataSourceOptions parameter (Appendix A: Common Parameters).

importOptions=list(fileType="ANY" | "AUDIO" | "AUTO" | "BASESAS" | "CSV" | "DOCUMENT" | "DTA" | "ESP" | "EXCEL" | "FMT" | "HDAT" | "IMAGE" | "JMP" | "LASR" | "PARQUET" | "SPSS" | "VIDEO" | "XLS", fileType-specific-parameters)

specifies the settings for reading a table from a data source.

Alias import

For more information about specifying the importOptions parameter, see the common importOptions parameter (Appendix A: Common Parameters).

* name="table-name"

specifies the name of the filter table.

vars=list( list(casinvardesc-1) <, list(casinvardesc-2), ...>)

specifies the variable names to use from the filter table.

The casinvardesc value can be one or more of the following:

format="string"

specifies the format to apply to the variable.

formattedLength=integer

specifies the length of the format field plus the length of the format precision.

label="string"

specifies the descriptive label for the variable.

* name="variable-name"

specifies the name for the variable.

nfd=integer

specifies the length of the format precision.

nfl=integer

specifies the length of the format field.

where="where-expression"

specifies an expression for subsetting the data from the filter table.

maxBayesianModelSize=integer

specifies the maximum number of points in the kriging model. This parameter is ignored unless the value "BAYESIAN" is specified in the searchMethod parameter.

Alias maxBayesianSize
Default 500
Minimum value 10
maxEvals=integer

specifies the maximum number of objective evaluations (model configurations) to be trained and scored.

Alias maxEvaluations
Default 0
Minimum value 0
maxIters=integer

specifies the maximum number of iterations of optimization for model tuning. At each iteration, the number of evaluated models is less than or equal to the value of the popSize parameter.

Alias maxIterations
Default 5
Minimum value 1
maxTime=double

specifies the maximum time allowed for tuning. The actual time might exceed this value by the amount of time required to train and score a model configuration.

Default 36000
Minimum value 1
maxTrainTime=64-bit-integer

specifies the maximum time allowed for a single model training. The model training is terminated if it exceeds this time, and the objective value is set to missing.

Minimum value 0
nConvergenceIterations=integer

specifies the number of convergence iterations to use for terminating the tuning.

Alias nConvIters
Default 4
Minimum value 1
nCrossValFolds=integer

specifies the number of folds to use for cross validation to assess model fit error as the tuning objective.

Alias nFolds
Default 5
Minimum value 2
nParallel=integer

specifies the number of parallel subsessions to be used for parallel evaluation of model configurations.

Default 0
nSubsessionWorkers=integer

specifies the number of workers to be used by each subsession for parallel evaluation of model configurations.

Alias nSubWorkers
Default 0
Minimum value 0
objective="ASE" | "AUC" | "CUMULATIVEGAIN" | "F05" | "F1" | "FNR" | "FPR" | "GAMMA" | "GINI" | "HITRATE" | "KS" | "MAE" | "MCE" | "MCLL" | "MEANAVEPRECISION" | "MEANRECIPRANK" | "MISC" | "MSE" | "MSLE" | "RASE" | "RECALL" | "RMAE" | "RMSLE" | "SCORINGTIME" | "TAU" | "TNR" | "TPR" | "TRAININGTIME"

specifies the metric to use as the objective during tuning.

ASE

specifies the average square error as the objective function or the constraint metric.

AUC

specifies area under the curve as the objective function or the constraint metric (nominal targets only).

CUMULATIVEGAIN

specifies weighted normalized discounted cumulative gain as the objective function or the constraint metric (recommender model types only).

F05

specifies the F0.5 coefficient as the objective function or the constraint metric (nominal targets only).

F1

specifies the F1 coefficient as the objective function or the constraint metric (nominal targets only).

FNR

specifies false negative rate as the objective function or the constraint metric (nominal targets only).

FPR

specifies false positive rate as the objective function or the constraint metric (nominal targets only).

GAMMA

specifies the gamma coefficient as the objective function or the constraint metric (nominal targets only).

GINI

specifies the Gini coefficient as the objective function or the constraint metric (nominal targets only).

HITRATE

specifies hit rate as the objective function or the constraint metric (recommender model types only).

KS

specifies the Kolmogorov-Smirnov coefficient as the objective function or the constraint metric (nominal targets only).

MAE

specifies the mean absolute error as the objective function or the constraint metric (interval targets only).

MCE

specifies the misclassification rate as the objective function or the constraint metric (nominal targets only).

MCLL

specifies the multiclass log loss as the objective function or the constraint metric (nominal targets only).

MEANAVEPRECISION

specifies mean average precision as the objective function or the constraint metric (recommender model types only).

MEANRECIPRANK

specifies mean reciprocal rank as the objective function or the constraint metric (recommender model types only).

MISC

specifies the misclassification error percentage as the objective function or the constraint metric (nominal targets only).

MSE

specifies the mean square error as the objective function or the constraint metric (interval targets only).

MSLE

specifies the mean square logarithmic error as the objective function or the constraint metric (interval targets only).

RASE

specifies the root average square error as the objective function or the constraint metric.

RECALL

specifies recall value as the objective function or the constraint metric (recommender model types only).

RMAE

specifies the root mean absolute error as the objective function or the constraint metric (interval targets only).

RMSLE

specifies the root mean square logarithmic error as the objective function or the constraint metric (interval targets only).

SCORINGTIME

specifies model scoring time as the objective for tuning.

TAU

specifies the tau coefficient as the objective function or the constraint metric (nominal targets only).

TNR

specifies true negative rate as the objective function or the constraint metric (nominal targets only).

TPR

specifies true positive rate as the objective function or the constraint metric (nominal targets only).

TRAININGTIME

specifies model training time as the objective for tuning.

popSize=integer

specifies the maximum number of model configurations to evaluate at each iteration.

Alias populationSize
Default 10
Minimum value 2
sampleSize=integer

specifies the number of model configurations to evaluate when the value of the searchMethod parameter is "LHS" or "RANDOM".

Default 50
Minimum value 2
searchMethod="BAYESIAN" | "GA" | "GRID" | "LHS" | "RANDOM"

specifies the search method to use during tuning.

Default GA
BAYESIAN

uses the Bayesian search method.

GA

uses the genetic algorithm search method.

GRID

uses the grid search method.

LHS

uses the Latin hypercube sample search method.

RANDOM

uses the random search method.

secondObjective="ASE" | "AUC" | "CUMULATIVEGAIN" | "F05" | "F1" | "FNR" | "FPR" | "GAMMA" | "GINI" | "HITRATE" | "KS" | "MAE" | "MCE" | "MCLL" | "MEANAVEPRECISION" | "MEANRECIPRANK" | "MISC" | "MSE" | "MSLE" | "NONE" | "RASE" | "RECALL" | "RMAE" | "RMSLE" | "SCORINGTIME" | "TAU" | "TNR" | "TPR" | "TRAININGTIME"

specifies the metric to use as the second objective during tuning.

Default NONE
ASE

specifies the average square error as the objective function or the constraint metric.

AUC

specifies area under the curve as the objective function or the constraint metric (nominal targets only).

CUMULATIVEGAIN

specifies weighted normalized discounted cumulative gain as the objective function or the constraint metric (recommender model types only).

F05

specifies the F0.5 coefficient as the objective function or the constraint metric (nominal targets only).

F1

specifies the F1 coefficient as the objective function or the constraint metric (nominal targets only).

FNR

specifies false negative rate as the objective function or the constraint metric (nominal targets only).

FPR

specifies false positive rate as the objective function or the constraint metric (nominal targets only).

GAMMA

specifies the gamma coefficient as the objective function or the constraint metric (nominal targets only).

GINI

specifies the Gini coefficient as the objective function or the constraint metric (nominal targets only).

HITRATE

specifies hit rate as the objective function or the constraint metric (recommender model types only).

KS

specifies the Kolmogorov-Smirnov coefficient as the objective function or the constraint metric (nominal targets only).

MAE

specifies the mean absolute error as the objective function or the constraint metric (interval targets only).

MCE

specifies the misclassification rate as the objective function or the constraint metric (nominal targets only).

MCLL

specifies the multiclass log loss as the objective function or the constraint metric (nominal targets only).

MEANAVEPRECISION

specifies mean average precision as the objective function or the constraint metric (recommender model types only).

MEANRECIPRANK

specifies mean reciprocal rank as the objective function or the constraint metric (recommender model types only).

MISC

specifies the misclassification error percentage as the objective function or the constraint metric (nominal targets only).

MSE

specifies the mean square error as the objective function or the constraint metric (interval targets only).

MSLE

specifies the mean square logarithmic error as the objective function or the constraint metric (interval targets only).

NONE

specifies no second objective to be used for tuning.

RASE

specifies the root average square error as the objective function or the constraint metric.

RECALL

specifies recall value as the objective function or the constraint metric (recommender model types only).

RMAE

specifies the root mean absolute error as the objective function or the constraint metric (interval targets only).

RMSLE

specifies the root mean square logarithmic error as the objective function or the constraint metric (interval targets only).

SCORINGTIME

specifies model scoring time as the objective for tuning.

TAU

specifies the tau coefficient as the objective function or the constraint metric (nominal targets only).

TNR

specifies true negative rate as the objective function or the constraint metric (nominal targets only).

TPR

specifies true positive rate as the objective function or the constraint metric (nominal targets only).

TRAININGTIME

specifies model training time as the objective for tuning.

seed=integer

specifies the seed to use for selecting training data and validation data for a single validation partition or fold number for each observation for cross validation.

Default 0
shuffleGridPoints=TRUE | FALSE

when set to True, shuffles the points that are generated by the grid search method before execution.

Default TRUE
targetEvent="string"

specifies the name of the nominal target event to use for tuning.

trainPartitionFraction=double

specifies the size of a single data partition to use for model training during tuning.

Alias trainFraction
Range 0.01–0.99
userConfigurations=list(castable)

specifies the CAS table that contains user configurations to be used by the autotune action.

The castable value can be one or more of the following:

caslib="string"

specifies the caslib for the input table that you want to use with the action. By default, the active caslib is used. Specify a value only if you need to access a table from a different caslib.

computedOnDemand=TRUE | FALSE

when set to True, creates the computed variables when the table is loaded instead of when the action begins.

Alias compOnDemand
Default FALSE
computedVars=list( list(casinvardesc-1) <, list(casinvardesc-2), ...>)

specifies the names of the computed variables to create. Specify an expression for each variable in the computedVarsProgram parameter. If you do not specify this parameter, then all variables from computedVarsProgram are automatically included.

Alias compVars

The casinvardesc value can be one or more of the following:

format="string"

specifies the format to apply to the variable.

formattedLength=integer

specifies the length of the format field plus the length of the format precision.

label="string"

specifies the descriptive label for the variable.

* name="variable-name"

specifies the name for the variable.

nfd=integer

specifies the length of the format precision.

nfl=integer

specifies the length of the format field.

computedVarsProgram="string"

specifies an expression for each computed variable that you include in the computedVars parameter.

Alias compPgm
dataSourceOptions=list(key-1=list(any-list-or-data-type-1) <, key-2=list(any-list-or-data-type-2), ...>)

specifies data source options.

Aliases options
dataSource
groupByMode="NOSORT" | "REDISTRIBUTE"

specifies how to create groups.

Default NOSORT
NOSORT

groups the data without sorting on each machine, and then groups the data again on the controller.

REDISTRIBUTE

transfers rows between nodes to guarantee ordering within groups. This method is slower.

importOptions=list(fileType="ANY" | "AUDIO" | "AUTO" | "BASESAS" | "CSV" | "DOCUMENT" | "DTA" | "ESP" | "EXCEL" | "FMT" | "HDAT" | "IMAGE" | "JMP" | "LASR" | "PARQUET" | "SPSS" | "VIDEO" | "XLS", fileType-specific-parameters)

specifies the settings for reading a table from a data source.

Alias import

For more information about specifying the importOptions parameter, see the common importOptions parameter (Appendix A: Common Parameters).

* name="table-name"

specifies the name of the input table.

singlePass=TRUE | FALSE

when set to True, does not create a transient table on the server. Setting this parameter to True can be efficient, but the data might not have stable ordering upon repeated runs.

Default FALSE
vars=list( list(casinvardesc-1) <, list(casinvardesc-2), ...>)

specifies the variables to use in the action.

The casinvardesc value can be one or more of the following:

format="string"

specifies the format to apply to the variable.

formattedLength=integer

specifies the length of the format field plus the length of the format precision.

label="string"

specifies the descriptive label for the variable.

* name="variable-name"

specifies the name for the variable.

nfd=integer

specifies the length of the format precision.

nfl=integer

specifies the length of the format field.

where="where-expression"

specifies an expression for subsetting the input data.

whereTable=list(groupbytable)

specifies an input table that contains rows to use as a WHERE filter. If the vars parameter is not specified, then all the variable names that are common to the input table and the filtering table are used to find matching rows. If the where parameter for the input table and this parameter are specified, then this filtering table is applied first.

The groupbytable value can be one or more of the following:

casLib="string"

specifies the caslib for the filter table. By default, the active caslib is used.

dataSourceOptions=list(adls_noreq-parameters | bigquery-parameters | cas_noreq-parameters | clouddex-parameters | db2-parameters | dnfs-parameters | esp-parameters | fedsvr-parameters | gcs_noreq-parameters | hadoop-parameters | hana-parameters | impala-parameters | informix-parameters | jdbc-parameters | mongodb-parameters | mysql-parameters | odbc-parameters | oracle-parameters | path-parameters | postgres-parameters | redshift-parameters | s3-parameters | sapiq-parameters | sforce-parameters | singlestore_standard-parameters | snowflake-parameters | spark-parameters | spde-parameters | sqlserver-parameters | ss_noreq-parameters | teradata-parameters | vertica-parameters | yellowbrick-parameters)

specifies data source options.

Aliases options
dataSource

For more information about specifying the dataSourceOptions parameter, see the common dataSourceOptions parameter (Appendix A: Common Parameters).

importOptions=list(fileType="ANY" | "AUDIO" | "AUTO" | "BASESAS" | "CSV" | "DOCUMENT" | "DTA" | "ESP" | "EXCEL" | "FMT" | "HDAT" | "IMAGE" | "JMP" | "LASR" | "PARQUET" | "SPSS" | "VIDEO" | "XLS", fileType-specific-parameters)

specifies the settings for reading a table from a data source.

Alias import

For more information about specifying the importOptions parameter, see the common importOptions parameter (Appendix A: Common Parameters).

* name="table-name"

specifies the name of the filter table.

vars=list( list(casinvardesc-1) <, list(casinvardesc-2), ...>)

specifies the variable names to use from the filter table.

The casinvardesc value can be one or more of the following:

format="string"

specifies the format to apply to the variable.

formattedLength=integer

specifies the length of the format field plus the length of the format precision.

label="string"

specifies the descriptive label for the variable.

* name="variable-name"

specifies the name for the variable.

nfd=integer

specifies the length of the format precision.

nfl=integer

specifies the length of the format field.

where="where-expression"

specifies an expression for subsetting the data from the filter table.

userDefinedPartition=TRUE | FALSE

when set to True, includes a user-defined partition for training and scoring.

Default FALSE
validationPartitionFraction=double

specifies the size of a single validation partition to use in order to assess model fit error as the tuning objective.

Alias validateFraction
Default 0.3
Range 0.01–0.99

tuningParameters=list( list(autotuneTuningParmDefinition-1) <, list(autotuneTuningParmDefinition-2), ...>)

specifies a list of custom tuning parameters and their definitions.

The autotuneTuningParmDefinition value can be one or more of the following:

exclude=TRUE | FALSE

when set to True, excludes a tuning parameter from the tuning process.

Default FALSE
initValue=integer | 64-bit-integer | double | TRUE | FALSE | "string"

specifies the initial (default) value of a tuning parameter.

lowerBound=double

specifies the lower bound of the range for a tuning parameter. This parameter can be used only for numeric tuning parameters, and is ignored if a value list is specified for the tuning parameter.

Alias LB
namePath="string"

specifies the name path of a tuning parameter. For a nested action parameter, this parameter specifies a dot-separated path that includes all its parent parameter names. For a top-level action parameter, this parameter is simply the name of the parameter.

Alias name
upperBound=double

specifies the upper bound of the range for a tuning parameter. This parameter can be used only for numeric tuning parameters, and is ignored if a value list is specified for the tuning parameter.

Alias UB
valueList=list(any-list-or-data-type-1 <, any-list-or-data-type-2, ...>)

specifies the list of values to try for a tuning parameter. If this parameter is specified, the lower bound and upper bound values of the tuning parameter are ignored.

useParameters="COMBINED" | "CUSTOM" | "STANDARD"

specifies whether to use only standard tuning parameters, only custom tuning parameters, or a combination of both.

Default COMBINED
Last updated: November 23, 2025