Provides actions for performing principal component analysis
Extracts principal components by using the eigenvalue decomposition method.
If a row includes a subparameter, you can specify the name, caslib, and so on in the subparameter. Otherwise, you can specify the name, caslib, and so on in the parameter.
|
Parameter |
Subparameter |
Description |
|---|---|---|
|
required parametertable |
— |
specifies the settings for an input table. |
|
Parameter |
Subparameter |
Description |
|---|---|---|
|
casOut |
writes SAS DATA step code for computing predicted values of the fitted model. |
|
|
required parametercasOut |
specifies the output table to be created to contain various statistics, including means, standard deviations, eigenvalues, and eigenvectors. |
|
|
required parametercasOut |
specifies the output table to be created to contain observationwise statistics. If you do not specify any statistic, then only the principal component scores are included. |
|
|
names |
lists the names of results tables to save as CAS tables on the server. |
|
|
— |
specifies the output table to be created to save model fit information that you can use for scoring. |
changes the attributes of variables used in this action. Currently, attributes specified on the inputs and nominals parameter are ignored.
For more information about specifying the attributes parameter, see the common casinvardesc parameter (Appendix A: Common Parameters).
| Aliases | attribute |
|---|---|
| attr |
writes SAS DATA step code for computing predicted values of the fitted model.
For more information about specifying the code parameter, see the common aircodegen parameter (Appendix A: Common Parameters).
when set to True, computes the principal components from the covariance matrix.
| Alias | covariance |
|---|---|
| Default | FALSE |
specifies a list of results tables to send to the client for display.
For more information about specifying the display parameter, see the common displayTables parameter (Appendix A: Common Parameters).
specifies a numeric variable that contains the frequency of occurrence of each observation.
specifies the settings for graphics processing units (GPUs). If you specify the groupBy subparameter in the table parameter, the gpu parameter is ignored. If no GPU is available, the CPU is used to perform model computation.
when set to True, enables the action to use GPUs to perform model computation on GPU-equipped machines (one GPU is used on each machine).
| Default | FALSE |
|---|
suppresses the analysis if the number of BY groups exceeds the specified value.
| Minimum value | 1 |
|---|
specifies the variables to be analyzed. For the NIPALS, ITERGS, and EIG methods, you can analyze only numeric variables. If you omit the parameter, all numeric variables that are not specified in other parameters are analyzed. You must specify either INPUTS or MODEL for the RANDOMPCA method, but you can use both numeric and categorical variables.
For more information about specifying the inputs parameter, see the common casinvardesc parameter (Appendix A: Common Parameters).
| Aliases | input |
|---|---|
| vars | |
| var |
specifies the number of principal components to be computed. If the value is 0, all principal components are computed.
| Default | 0 |
|---|---|
| Minimum value | 0 |
when set to True, omits the intercept from the model.
| Default | FALSE |
|---|
specifies the output table to be created to contain observationwise statistics. If you do not specify any statistic, then only the principal component scores are included.
The eigOutput value can be one or more of the following:
specifies the output table.
For more information about specifying the casOut parameter, see the common casouttable (Form 1) parameter (Appendix A: Common Parameters).
copies one or more variables from the input table to the output table.
| Alias | copyVar |
|---|
requests residuals for each analysis variable. The residuals are included only if you also specify the partial parameter. If set to an empty string, the prefix R is used for naming the output variables.
| Aliases | r |
|---|---|
| resid |
specifies principal component scores for each principal component. If the value is an empty string, the prefix Score is used to name the output variables.
lists the names of results tables to save as CAS tables on the server.
For more information about specifying the outputTables parameter, see the common outputTables parameter (Appendix A: Common Parameters).
| Alias | displayOut |
|---|
specifies the output table to be created to contain various statistics, including means, standard deviations, eigenvalues, and eigenvectors.
The eigOutstat value can be one or more of the following:
specifies the output table.
For more information about specifying the casOut parameter, see the common casouttable (Form 1) parameter (Appendix A: Common Parameters).
specifies a prefix for naming the residual variables.
| Aliases | parPrefix |
|---|---|
| pPrefix | |
| Default | "R" |
specifies numerical variables to be partialed out if you want to analyze a partial correlation or covariance matrix.
specifies a prefix for naming the principal components.
| Default | "Prin" |
|---|
specifies the singularity criterion.
| Alias | sing |
|---|---|
| Default | 1E-08 |
| Range | 0–1 |
when set to True, standardizes to unit variance the principal component scores in the output table that is specified in the output parameter.
| Alias | standard |
|---|---|
| Default | FALSE |
specifies the output table to be created to save model fit information that you can use for scoring.
For more information about specifying the store parameter, see the common casouttable (Form 1) parameter (Appendix A: Common Parameters).
| Alias | saveState |
|---|
specifies the settings for an input table.
For more information about specifying the table parameter, see the common castable (Form 1) parameter (Appendix A: Common Parameters).
specifies the divisor to use in calculating variances and standard deviations.
| Default | DF |
|---|
specifies a numeric variable that is used as a weight to perform a weighted analysis of the data.
Extracts principal components by using the eigenvalue decomposition method.
If a row includes a subparameter, you can specify the name, caslib, and so on in the subparameter. Otherwise, you can specify the name, caslib, and so on in the parameter.
|
Parameter |
Subparameter |
Description |
|---|---|---|
|
required parametertable |
— |
specifies the settings for an input table. |
|
Parameter |
Subparameter |
Description |
|---|---|---|
|
casOut |
writes SAS DATA step code for computing predicted values of the fitted model. |
|
|
required parametercasOut |
specifies the output table to be created to contain various statistics, including means, standard deviations, eigenvalues, and eigenvectors. |
|
|
required parametercasOut |
specifies the output table to be created to contain observationwise statistics. If you do not specify any statistic, then only the principal component scores are included. |
|
|
names |
lists the names of results tables to save as CAS tables on the server. |
|
|
— |
specifies the output table to be created to save model fit information that you can use for scoring. |
changes the attributes of variables used in this action. Currently, attributes specified on the inputs and nominals parameter are ignored.
For more information about specifying the attributes parameter, see the common casinvardesc parameter (Appendix A: Common Parameters).
| Aliases | attribute |
|---|---|
| attr |
writes SAS DATA step code for computing predicted values of the fitted model.
For more information about specifying the code parameter, see the common aircodegen parameter (Appendix A: Common Parameters).
when set to True, computes the principal components from the covariance matrix.
| Alias | covariance |
|---|---|
| Default | false |
specifies a list of results tables to send to the client for display.
For more information about specifying the display parameter, see the common displayTables parameter (Appendix A: Common Parameters).
specifies a numeric variable that contains the frequency of occurrence of each observation.
specifies the settings for graphics processing units (GPUs). If you specify the groupBy subparameter in the table parameter, the gpu parameter is ignored. If no GPU is available, the CPU is used to perform model computation.
when set to True, enables the action to use GPUs to perform model computation on GPU-equipped machines (one GPU is used on each machine).
| Default | false |
|---|
suppresses the analysis if the number of BY groups exceeds the specified value.
| Minimum value | 1 |
|---|
specifies the variables to be analyzed. For the NIPALS, ITERGS, and EIG methods, you can analyze only numeric variables. If you omit the parameter, all numeric variables that are not specified in other parameters are analyzed. You must specify either INPUTS or MODEL for the RANDOMPCA method, but you can use both numeric and categorical variables.
For more information about specifying the inputs parameter, see the common casinvardesc parameter (Appendix A: Common Parameters).
| Aliases | input |
|---|---|
| vars | |
| var |
specifies the number of principal components to be computed. If the value is 0, all principal components are computed.
| Default | 0 |
|---|---|
| Minimum value | 0 |
when set to True, omits the intercept from the model.
| Default | false |
|---|
specifies the output table to be created to contain observationwise statistics. If you do not specify any statistic, then only the principal component scores are included.
The eigOutput value can be one or more of the following:
specifies the output table.
For more information about specifying the casOut parameter, see the common casouttable (Form 1) parameter (Appendix A: Common Parameters).
copies one or more variables from the input table to the output table.
| Alias | copyVar |
|---|
requests residuals for each analysis variable. The residuals are included only if you also specify the partial parameter. If set to an empty string, the prefix R is used for naming the output variables.
| Aliases | r |
|---|---|
| resid |
specifies principal component scores for each principal component. If the value is an empty string, the prefix Score is used to name the output variables.
lists the names of results tables to save as CAS tables on the server.
For more information about specifying the outputTables parameter, see the common outputTables parameter (Appendix A: Common Parameters).
| Alias | displayOut |
|---|
specifies the output table to be created to contain various statistics, including means, standard deviations, eigenvalues, and eigenvectors.
The eigOutstat value can be one or more of the following:
specifies the output table.
For more information about specifying the casOut parameter, see the common casouttable (Form 1) parameter (Appendix A: Common Parameters).
specifies a prefix for naming the residual variables.
| Aliases | parPrefix |
|---|---|
| pPrefix | |
| Default | "R" |
specifies numerical variables to be partialed out if you want to analyze a partial correlation or covariance matrix.
specifies a prefix for naming the principal components.
| Default | "Prin" |
|---|
specifies the singularity criterion.
| Alias | sing |
|---|---|
| Default | 1E-08 |
| Range | 0–1 |
when set to True, standardizes to unit variance the principal component scores in the output table that is specified in the output parameter.
| Alias | standard |
|---|---|
| Default | false |
specifies the output table to be created to save model fit information that you can use for scoring.
For more information about specifying the store parameter, see the common casouttable (Form 1) parameter (Appendix A: Common Parameters).
| Alias | saveState |
|---|
specifies the settings for an input table.
For more information about specifying the table parameter, see the common castable (Form 1) parameter (Appendix A: Common Parameters).
specifies the divisor to use in calculating variances and standard deviations.
| Default | DF |
|---|
specifies a numeric variable that is used as a weight to perform a weighted analysis of the data.
Extracts principal components by using the eigenvalue decomposition method.
If a row includes a subparameter, you can specify the name, caslib, and so on in the subparameter. Otherwise, you can specify the name, caslib, and so on in the parameter.
|
Parameter |
Subparameter |
Description |
|---|---|---|
|
required parametertable |
— |
specifies the settings for an input table. |
|
Parameter |
Subparameter |
Description |
|---|---|---|
|
casOut |
writes SAS DATA step code for computing predicted values of the fitted model. |
|
|
required parametercasOut |
specifies the output table to be created to contain various statistics, including means, standard deviations, eigenvalues, and eigenvectors. |
|
|
required parametercasOut |
specifies the output table to be created to contain observationwise statistics. If you do not specify any statistic, then only the principal component scores are included. |
|
|
names |
lists the names of results tables to save as CAS tables on the server. |
|
|
— |
specifies the output table to be created to save model fit information that you can use for scoring. |
changes the attributes of variables used in this action. Currently, attributes specified on the inputs and nominals parameter are ignored.
For more information about specifying the attributes parameter, see the common casinvardesc parameter (Appendix A: Common Parameters).
| Aliases | attribute |
|---|---|
| attr |
writes SAS DATA step code for computing predicted values of the fitted model.
For more information about specifying the code parameter, see the common aircodegen parameter (Appendix A: Common Parameters).
when set to True, computes the principal components from the covariance matrix.
| Alias | covariance |
|---|---|
| Default | False |
specifies a list of results tables to send to the client for display.
For more information about specifying the display parameter, see the common displayTables parameter (Appendix A: Common Parameters).
specifies a numeric variable that contains the frequency of occurrence of each observation.
specifies the settings for graphics processing units (GPUs). If you specify the groupBy subparameter in the table parameter, the gpu parameter is ignored. If no GPU is available, the CPU is used to perform model computation.
when set to True, enables the action to use GPUs to perform model computation on GPU-equipped machines (one GPU is used on each machine).
| Default | False |
|---|
suppresses the analysis if the number of BY groups exceeds the specified value.
| Minimum value | 1 |
|---|
specifies the variables to be analyzed. For the NIPALS, ITERGS, and EIG methods, you can analyze only numeric variables. If you omit the parameter, all numeric variables that are not specified in other parameters are analyzed. You must specify either INPUTS or MODEL for the RANDOMPCA method, but you can use both numeric and categorical variables.
For more information about specifying the inputs parameter, see the common casinvardesc parameter (Appendix A: Common Parameters).
| Aliases | input |
|---|---|
| vars | |
| var |
specifies the number of principal components to be computed. If the value is 0, all principal components are computed.
| Default | 0 |
|---|---|
| Minimum value | 0 |
when set to True, omits the intercept from the model.
| Default | False |
|---|
specifies the output table to be created to contain observationwise statistics. If you do not specify any statistic, then only the principal component scores are included.
The eigOutput value can be one or more of the following:
specifies the output table.
For more information about specifying the casOut parameter, see the common casouttable (Form 1) parameter (Appendix A: Common Parameters).
copies one or more variables from the input table to the output table.
| Alias | copyVar |
|---|
requests residuals for each analysis variable. The residuals are included only if you also specify the partial parameter. If set to an empty string, the prefix R is used for naming the output variables.
| Aliases | r |
|---|---|
| resid |
specifies principal component scores for each principal component. If the value is an empty string, the prefix Score is used to name the output variables.
lists the names of results tables to save as CAS tables on the server.
For more information about specifying the outputTables parameter, see the common outputTables parameter (Appendix A: Common Parameters).
| Alias | displayOut |
|---|
specifies the output table to be created to contain various statistics, including means, standard deviations, eigenvalues, and eigenvectors.
The eigOutstat value can be one or more of the following:
specifies the output table.
For more information about specifying the casOut parameter, see the common casouttable (Form 1) parameter (Appendix A: Common Parameters).
specifies a prefix for naming the residual variables.
| Aliases | parPrefix |
|---|---|
| pPrefix | |
| Default | "R" |
specifies numerical variables to be partialed out if you want to analyze a partial correlation or covariance matrix.
specifies a prefix for naming the principal components.
| Default | "Prin" |
|---|
specifies the singularity criterion.
| Alias | sing |
|---|---|
| Default | 1E-08 |
| Range | 0–1 |
when set to True, standardizes to unit variance the principal component scores in the output table that is specified in the output parameter.
| Alias | standard |
|---|---|
| Default | False |
specifies the output table to be created to save model fit information that you can use for scoring.
For more information about specifying the store parameter, see the common casouttable (Form 1) parameter (Appendix A: Common Parameters).
| Alias | saveState |
|---|
specifies the settings for an input table.
For more information about specifying the table parameter, see the common castable (Form 1) parameter (Appendix A: Common Parameters).
specifies the divisor to use in calculating variances and standard deviations.
| Default | DF |
|---|
specifies a numeric variable that is used as a weight to perform a weighted analysis of the data.
Extracts principal components by using the eigenvalue decomposition method.
If a row includes a subparameter, you can specify the name, caslib, and so on in the subparameter. Otherwise, you can specify the name, caslib, and so on in the parameter.
|
Parameter |
Subparameter |
Description |
|---|---|---|
|
required parametertable |
— |
specifies the settings for an input table. |
|
Parameter |
Subparameter |
Description |
|---|---|---|
|
casOut |
writes SAS DATA step code for computing predicted values of the fitted model. |
|
|
required parametercasOut |
specifies the output table to be created to contain various statistics, including means, standard deviations, eigenvalues, and eigenvectors. |
|
|
required parametercasOut |
specifies the output table to be created to contain observationwise statistics. If you do not specify any statistic, then only the principal component scores are included. |
|
|
names |
lists the names of results tables to save as CAS tables on the server. |
|
|
— |
specifies the output table to be created to save model fit information that you can use for scoring. |
changes the attributes of variables used in this action. Currently, attributes specified on the inputs and nominals parameter are ignored.
For more information about specifying the attributes parameter, see the common casinvardesc parameter (Appendix A: Common Parameters).
| Aliases | attribute |
|---|---|
| attr |
writes SAS DATA step code for computing predicted values of the fitted model.
For more information about specifying the code parameter, see the common aircodegen parameter (Appendix A: Common Parameters).
when set to True, computes the principal components from the covariance matrix.
| Alias | covariance |
|---|---|
| Default | FALSE |
specifies a list of results tables to send to the client for display.
For more information about specifying the display parameter, see the common displayTables parameter (Appendix A: Common Parameters).
specifies a numeric variable that contains the frequency of occurrence of each observation.
specifies the settings for graphics processing units (GPUs). If you specify the groupBy subparameter in the table parameter, the gpu parameter is ignored. If no GPU is available, the CPU is used to perform model computation.
when set to True, enables the action to use GPUs to perform model computation on GPU-equipped machines (one GPU is used on each machine).
| Default | FALSE |
|---|
suppresses the analysis if the number of BY groups exceeds the specified value.
| Minimum value | 1 |
|---|
specifies the variables to be analyzed. For the NIPALS, ITERGS, and EIG methods, you can analyze only numeric variables. If you omit the parameter, all numeric variables that are not specified in other parameters are analyzed. You must specify either INPUTS or MODEL for the RANDOMPCA method, but you can use both numeric and categorical variables.
For more information about specifying the inputs parameter, see the common casinvardesc parameter (Appendix A: Common Parameters).
| Aliases | input |
|---|---|
| vars | |
| var |
specifies the number of principal components to be computed. If the value is 0, all principal components are computed.
| Default | 0 |
|---|---|
| Minimum value | 0 |
when set to True, omits the intercept from the model.
| Default | FALSE |
|---|
specifies the output table to be created to contain observationwise statistics. If you do not specify any statistic, then only the principal component scores are included.
The eigOutput value can be one or more of the following:
specifies the output table.
For more information about specifying the casOut parameter, see the common casouttable (Form 1) parameter (Appendix A: Common Parameters).
copies one or more variables from the input table to the output table.
| Alias | copyVar |
|---|
requests residuals for each analysis variable. The residuals are included only if you also specify the partial parameter. If set to an empty string, the prefix R is used for naming the output variables.
| Aliases | r |
|---|---|
| resid |
specifies principal component scores for each principal component. If the value is an empty string, the prefix Score is used to name the output variables.
lists the names of results tables to save as CAS tables on the server.
For more information about specifying the outputTables parameter, see the common outputTables parameter (Appendix A: Common Parameters).
| Alias | displayOut |
|---|
specifies the output table to be created to contain various statistics, including means, standard deviations, eigenvalues, and eigenvectors.
The eigOutstat value can be one or more of the following:
specifies the output table.
For more information about specifying the casOut parameter, see the common casouttable (Form 1) parameter (Appendix A: Common Parameters).
specifies a prefix for naming the residual variables.
| Aliases | parPrefix |
|---|---|
| pPrefix | |
| Default | "R" |
specifies numerical variables to be partialed out if you want to analyze a partial correlation or covariance matrix.
specifies a prefix for naming the principal components.
| Default | "Prin" |
|---|
specifies the singularity criterion.
| Alias | sing |
|---|---|
| Default | 1E-08 |
| Range | 0–1 |
when set to True, standardizes to unit variance the principal component scores in the output table that is specified in the output parameter.
| Alias | standard |
|---|---|
| Default | FALSE |
specifies the output table to be created to save model fit information that you can use for scoring.
For more information about specifying the store parameter, see the common casouttable (Form 1) parameter (Appendix A: Common Parameters).
| Alias | saveState |
|---|
specifies the settings for an input table.
For more information about specifying the table parameter, see the common castable (Form 1) parameter (Appendix A: Common Parameters).
specifies the divisor to use in calculating variances and standard deviations.
| Default | DF |
|---|
specifies a numeric variable that is used as a weight to perform a weighted analysis of the data.