Provides actions for mining textual data
Generates the parent table and SVD projections from new textual data based on the results from previous training data. This action requires a SAS Visual Text Analytics license or a SAS Visual Data Mining and Machine Learning license.
If a row includes a subparameter, you can specify the name, caslib, and so on in the subparameter. Otherwise, you can specify the name, caslib, and so on in the parameter.
|
Parameter |
Subparameter |
Description |
|---|---|---|
|
required parameterdocuments |
— |
names the input CAS table of documents to be parsed. You must include a text variable specified with textVar and a document ID variable specified with docIdVar. |
|
— |
specifies the input CAS table that contains the LITI binary, which contains the predefined or custom concept definitions. The tmMine action can reference a concepts model that is compiled in the compileConcept action. For more information on how to do this, see the example, Referencing a Concepts Model in the tmMine Action. This parameter requires a SAS Visual Text Analytics license. |
|
|
— |
specifies the name of the CAS table that contains a list of multi-word terms and their part-of-speech tags. Each multi-word term is parsed as a single token. Each multi-word term is parsed as a single token. This parameter requires a SAS Visual Text Analytics license. |
|
|
required parameterparseConfig |
— |
specifies the name of the input CAS table that contains parsing configuration information |
|
— |
Specifies the input textTopic table that is used for assigning learned topics to new data. |
|
|
required parameterterms |
— |
specifies the name of the input table that contains information about the terms in the document collection. The table is used to determine which terms to use in the topic calculation. |
|
— |
Specifies the input topic table that is used for assigning learned topics to new data. |
|
|
— |
specifies the input U matrix, which contains the left singular vectors. The matrix is used to form the document projections. |
|
Parameter |
Subparameter |
Description |
|---|---|---|
|
— |
specifies the table to contain the SVD projections of the documents. The matrix used to make the projections is the input table U. |
|
|
— |
specifies the name of the output CAS table to contain a compressed representation of the sparse term-by-document matrix. |
specifies the location of the language binary files. The location must be valid. If it is not specified, a default installation location is used.
specifies how the elements in the term-by-document matrix (the parent output table) are weighted.
| Alias | cellWgt |
|---|---|
| Default | LOG |
specifies the character or numeric variable on the documents table that contains the ID of each document.
| Default | "DOC_ID" |
|---|
specifies the table to contain the SVD projections of the documents. The matrix used to make the projections is the input table U.
For more information about specifying the docPro parameter, see the common casouttable (Form 1) parameter (Appendix A: Common Parameters).
names the input CAS table of documents to be parsed. You must include a text variable specified with textVar and a document ID variable specified with docIdVar.
For more information about specifying the documents parameter, see the common castable (Form 1) parameter (Appendix A: Common Parameters).
specifies whether to use the legacy variable names on tables. This parameter requires a SAS Visual Text Analytics license or a SAS Visual Data Mining and Machine Learning license.
| Default | FALSE |
|---|
specifies the input CAS table that contains the LITI binary, which contains the predefined or custom concept definitions. The tmMine action can reference a concepts model that is compiled in the compileConcept action. For more information on how to do this, see the example, Referencing a Concepts Model in the tmMine Action. This parameter requires a SAS Visual Text Analytics license.
For more information about specifying the liti parameter, see the common castable (Form 1) parameter (Appendix A: Common Parameters).
specifies the name of the CAS table that contains a list of multi-word terms and their part-of-speech tags. Each multi-word term is parsed as a single token. Each multi-word term is parsed as a single token. This parameter requires a SAS Visual Text Analytics license.
For more information about specifying the multiterm parameter, see the common castable (Form 1) parameter (Appendix A: Common Parameters).
specifies the number of threads to be used per node. The value must be an integer. When the value is 0, the number of threads equals the number of CPUs.
| Default | 8 |
|---|---|
| Range | 0–64 |
specifies the name of the output CAS table to contain a compressed representation of the sparse term-by-document matrix.
For more information about specifying the parent parameter, see the common casouttable (Form 1) parameter (Appendix A: Common Parameters).
specifies the name of the input CAS table that contains parsing configuration information
For more information about specifying the parseConfig parameter, see the common castable (Form 1) parameter (Appendix A: Common Parameters).
| Alias | scoreConfig |
|---|
specifies the name of the input table that contains information about the terms in the document collection. The table is used to determine which terms to use in the topic calculation.
For more information about specifying the terms parameter, see the common castable (Form 1) parameter (Appendix A: Common Parameters).
Specifies the input textTopic table that is used for assigning learned topics to new data.
For more information about specifying the termTopics parameter, see the common castable (Form 1) parameter (Appendix A: Common Parameters).
specifies the character variable in the documents table that contains the text to be processed.
| Default | "text" |
|---|
Specifies the input topic table that is used for assigning learned topics to new data.
For more information about specifying the topics parameter, see the common castable (Form 1) parameter (Appendix A: Common Parameters).
specifies the input U matrix, which contains the left singular vectors. The matrix is used to form the document projections.
For more information about specifying the u parameter, see the common castable (Form 1) parameter (Appendix A: Common Parameters).
specifies the name of the index variable on the input U matrix.
| Default | "_TermNum_" |
|---|
Generates the parent table and SVD projections from new textual data based on the results from previous training data. This action requires a SAS Visual Text Analytics license or a SAS Visual Data Mining and Machine Learning license.
If a row includes a subparameter, you can specify the name, caslib, and so on in the subparameter. Otherwise, you can specify the name, caslib, and so on in the parameter.
|
Parameter |
Subparameter |
Description |
|---|---|---|
|
required parameterdocuments |
— |
names the input CAS table of documents to be parsed. You must include a text variable specified with textVar and a document ID variable specified with docIdVar. |
|
— |
specifies the input CAS table that contains the LITI binary, which contains the predefined or custom concept definitions. The tmMine action can reference a concepts model that is compiled in the compileConcept action. For more information on how to do this, see the example, Referencing a Concepts Model in the tmMine Action. This parameter requires a SAS Visual Text Analytics license. |
|
|
— |
specifies the name of the CAS table that contains a list of multi-word terms and their part-of-speech tags. Each multi-word term is parsed as a single token. Each multi-word term is parsed as a single token. This parameter requires a SAS Visual Text Analytics license. |
|
|
required parameterparseConfig |
— |
specifies the name of the input CAS table that contains parsing configuration information |
|
— |
Specifies the input textTopic table that is used for assigning learned topics to new data. |
|
|
required parameterterms |
— |
specifies the name of the input table that contains information about the terms in the document collection. The table is used to determine which terms to use in the topic calculation. |
|
— |
Specifies the input topic table that is used for assigning learned topics to new data. |
|
|
— |
specifies the input U matrix, which contains the left singular vectors. The matrix is used to form the document projections. |
|
Parameter |
Subparameter |
Description |
|---|---|---|
|
— |
specifies the table to contain the SVD projections of the documents. The matrix used to make the projections is the input table U. |
|
|
— |
specifies the name of the output CAS table to contain a compressed representation of the sparse term-by-document matrix. |
specifies the location of the language binary files. The location must be valid. If it is not specified, a default installation location is used.
specifies how the elements in the term-by-document matrix (the parent output table) are weighted.
| Alias | cellWgt |
|---|---|
| Default | LOG |
specifies the character or numeric variable on the documents table that contains the ID of each document.
| Default | "DOC_ID" |
|---|
specifies the table to contain the SVD projections of the documents. The matrix used to make the projections is the input table U.
For more information about specifying the docPro parameter, see the common casouttable (Form 1) parameter (Appendix A: Common Parameters).
names the input CAS table of documents to be parsed. You must include a text variable specified with textVar and a document ID variable specified with docIdVar.
For more information about specifying the documents parameter, see the common castable (Form 1) parameter (Appendix A: Common Parameters).
specifies whether to use the legacy variable names on tables. This parameter requires a SAS Visual Text Analytics license or a SAS Visual Data Mining and Machine Learning license.
| Default | false |
|---|
specifies the input CAS table that contains the LITI binary, which contains the predefined or custom concept definitions. The tmMine action can reference a concepts model that is compiled in the compileConcept action. For more information on how to do this, see the example, Referencing a Concepts Model in the tmMine Action. This parameter requires a SAS Visual Text Analytics license.
For more information about specifying the liti parameter, see the common castable (Form 1) parameter (Appendix A: Common Parameters).
specifies the name of the CAS table that contains a list of multi-word terms and their part-of-speech tags. Each multi-word term is parsed as a single token. Each multi-word term is parsed as a single token. This parameter requires a SAS Visual Text Analytics license.
For more information about specifying the multiterm parameter, see the common castable (Form 1) parameter (Appendix A: Common Parameters).
specifies the number of threads to be used per node. The value must be an integer. When the value is 0, the number of threads equals the number of CPUs.
| Default | 8 |
|---|---|
| Range | 0–64 |
specifies the name of the output CAS table to contain a compressed representation of the sparse term-by-document matrix.
For more information about specifying the parent parameter, see the common casouttable (Form 1) parameter (Appendix A: Common Parameters).
specifies the name of the input CAS table that contains parsing configuration information
For more information about specifying the parseConfig parameter, see the common castable (Form 1) parameter (Appendix A: Common Parameters).
| Alias | scoreConfig |
|---|
specifies the name of the input table that contains information about the terms in the document collection. The table is used to determine which terms to use in the topic calculation.
For more information about specifying the terms parameter, see the common castable (Form 1) parameter (Appendix A: Common Parameters).
Specifies the input textTopic table that is used for assigning learned topics to new data.
For more information about specifying the termTopics parameter, see the common castable (Form 1) parameter (Appendix A: Common Parameters).
specifies the character variable in the documents table that contains the text to be processed.
| Default | "text" |
|---|
Specifies the input topic table that is used for assigning learned topics to new data.
For more information about specifying the topics parameter, see the common castable (Form 1) parameter (Appendix A: Common Parameters).
specifies the input U matrix, which contains the left singular vectors. The matrix is used to form the document projections.
For more information about specifying the u parameter, see the common castable (Form 1) parameter (Appendix A: Common Parameters).
specifies the name of the index variable on the input U matrix.
| Default | "_TermNum_" |
|---|
Generates the parent table and SVD projections from new textual data based on the results from previous training data. This action requires a SAS Visual Text Analytics license or a SAS Visual Data Mining and Machine Learning license.
If a row includes a subparameter, you can specify the name, caslib, and so on in the subparameter. Otherwise, you can specify the name, caslib, and so on in the parameter.
|
Parameter |
Subparameter |
Description |
|---|---|---|
|
required parameterdocuments |
— |
names the input CAS table of documents to be parsed. You must include a text variable specified with textVar and a document ID variable specified with docIdVar. |
|
— |
specifies the input CAS table that contains the LITI binary, which contains the predefined or custom concept definitions. The tmMine action can reference a concepts model that is compiled in the compileConcept action. For more information on how to do this, see the example, Referencing a Concepts Model in the tmMine Action. This parameter requires a SAS Visual Text Analytics license. |
|
|
— |
specifies the name of the CAS table that contains a list of multi-word terms and their part-of-speech tags. Each multi-word term is parsed as a single token. Each multi-word term is parsed as a single token. This parameter requires a SAS Visual Text Analytics license. |
|
|
required parameterparseConfig |
— |
specifies the name of the input CAS table that contains parsing configuration information |
|
— |
Specifies the input textTopic table that is used for assigning learned topics to new data. |
|
|
required parameterterms |
— |
specifies the name of the input table that contains information about the terms in the document collection. The table is used to determine which terms to use in the topic calculation. |
|
— |
Specifies the input topic table that is used for assigning learned topics to new data. |
|
|
— |
specifies the input U matrix, which contains the left singular vectors. The matrix is used to form the document projections. |
|
Parameter |
Subparameter |
Description |
|---|---|---|
|
— |
specifies the table to contain the SVD projections of the documents. The matrix used to make the projections is the input table U. |
|
|
— |
specifies the name of the output CAS table to contain a compressed representation of the sparse term-by-document matrix. |
specifies the location of the language binary files. The location must be valid. If it is not specified, a default installation location is used.
specifies how the elements in the term-by-document matrix (the parent output table) are weighted.
| Alias | cellWgt |
|---|---|
| Default | LOG |
specifies the character or numeric variable on the documents table that contains the ID of each document.
| Default | "DOC_ID" |
|---|
specifies the table to contain the SVD projections of the documents. The matrix used to make the projections is the input table U.
For more information about specifying the docPro parameter, see the common casouttable (Form 1) parameter (Appendix A: Common Parameters).
names the input CAS table of documents to be parsed. You must include a text variable specified with textVar and a document ID variable specified with docIdVar.
For more information about specifying the documents parameter, see the common castable (Form 1) parameter (Appendix A: Common Parameters).
specifies whether to use the legacy variable names on tables. This parameter requires a SAS Visual Text Analytics license or a SAS Visual Data Mining and Machine Learning license.
| Default | False |
|---|
specifies the input CAS table that contains the LITI binary, which contains the predefined or custom concept definitions. The tmMine action can reference a concepts model that is compiled in the compileConcept action. For more information on how to do this, see the example, Referencing a Concepts Model in the tmMine Action. This parameter requires a SAS Visual Text Analytics license.
For more information about specifying the liti parameter, see the common castable (Form 1) parameter (Appendix A: Common Parameters).
specifies the name of the CAS table that contains a list of multi-word terms and their part-of-speech tags. Each multi-word term is parsed as a single token. Each multi-word term is parsed as a single token. This parameter requires a SAS Visual Text Analytics license.
For more information about specifying the multiterm parameter, see the common castable (Form 1) parameter (Appendix A: Common Parameters).
specifies the number of threads to be used per node. The value must be an integer. When the value is 0, the number of threads equals the number of CPUs.
| Default | 8 |
|---|---|
| Range | 0–64 |
specifies the name of the output CAS table to contain a compressed representation of the sparse term-by-document matrix.
For more information about specifying the parent parameter, see the common casouttable (Form 1) parameter (Appendix A: Common Parameters).
specifies the name of the input CAS table that contains parsing configuration information
For more information about specifying the parseConfig parameter, see the common castable (Form 1) parameter (Appendix A: Common Parameters).
| Alias | scoreConfig |
|---|
specifies the name of the input table that contains information about the terms in the document collection. The table is used to determine which terms to use in the topic calculation.
For more information about specifying the terms parameter, see the common castable (Form 1) parameter (Appendix A: Common Parameters).
Specifies the input textTopic table that is used for assigning learned topics to new data.
For more information about specifying the termTopics parameter, see the common castable (Form 1) parameter (Appendix A: Common Parameters).
specifies the character variable in the documents table that contains the text to be processed.
| Default | "text" |
|---|
Specifies the input topic table that is used for assigning learned topics to new data.
For more information about specifying the topics parameter, see the common castable (Form 1) parameter (Appendix A: Common Parameters).
specifies the input U matrix, which contains the left singular vectors. The matrix is used to form the document projections.
For more information about specifying the u parameter, see the common castable (Form 1) parameter (Appendix A: Common Parameters).
specifies the name of the index variable on the input U matrix.
| Default | "_TermNum_" |
|---|
Generates the parent table and SVD projections from new textual data based on the results from previous training data. This action requires a SAS Visual Text Analytics license or a SAS Visual Data Mining and Machine Learning license.
If a row includes a subparameter, you can specify the name, caslib, and so on in the subparameter. Otherwise, you can specify the name, caslib, and so on in the parameter.
|
Parameter |
Subparameter |
Description |
|---|---|---|
|
required parameterdocuments |
— |
names the input CAS table of documents to be parsed. You must include a text variable specified with textVar and a document ID variable specified with docIdVar. |
|
— |
specifies the input CAS table that contains the LITI binary, which contains the predefined or custom concept definitions. The tmMine action can reference a concepts model that is compiled in the compileConcept action. For more information on how to do this, see the example, Referencing a Concepts Model in the tmMine Action. This parameter requires a SAS Visual Text Analytics license. |
|
|
— |
specifies the name of the CAS table that contains a list of multi-word terms and their part-of-speech tags. Each multi-word term is parsed as a single token. Each multi-word term is parsed as a single token. This parameter requires a SAS Visual Text Analytics license. |
|
|
required parameterparseConfig |
— |
specifies the name of the input CAS table that contains parsing configuration information |
|
— |
Specifies the input textTopic table that is used for assigning learned topics to new data. |
|
|
required parameterterms |
— |
specifies the name of the input table that contains information about the terms in the document collection. The table is used to determine which terms to use in the topic calculation. |
|
— |
Specifies the input topic table that is used for assigning learned topics to new data. |
|
|
— |
specifies the input U matrix, which contains the left singular vectors. The matrix is used to form the document projections. |
|
Parameter |
Subparameter |
Description |
|---|---|---|
|
— |
specifies the table to contain the SVD projections of the documents. The matrix used to make the projections is the input table U. |
|
|
— |
specifies the name of the output CAS table to contain a compressed representation of the sparse term-by-document matrix. |
specifies the location of the language binary files. The location must be valid. If it is not specified, a default installation location is used.
specifies how the elements in the term-by-document matrix (the parent output table) are weighted.
| Alias | cellWgt |
|---|---|
| Default | LOG |
specifies the character or numeric variable on the documents table that contains the ID of each document.
| Default | "DOC_ID" |
|---|
specifies the table to contain the SVD projections of the documents. The matrix used to make the projections is the input table U.
For more information about specifying the docPro parameter, see the common casouttable (Form 1) parameter (Appendix A: Common Parameters).
names the input CAS table of documents to be parsed. You must include a text variable specified with textVar and a document ID variable specified with docIdVar.
For more information about specifying the documents parameter, see the common castable (Form 1) parameter (Appendix A: Common Parameters).
specifies whether to use the legacy variable names on tables. This parameter requires a SAS Visual Text Analytics license or a SAS Visual Data Mining and Machine Learning license.
| Default | FALSE |
|---|
specifies the input CAS table that contains the LITI binary, which contains the predefined or custom concept definitions. The tmMine action can reference a concepts model that is compiled in the compileConcept action. For more information on how to do this, see the example, Referencing a Concepts Model in the tmMine Action. This parameter requires a SAS Visual Text Analytics license.
For more information about specifying the liti parameter, see the common castable (Form 1) parameter (Appendix A: Common Parameters).
specifies the name of the CAS table that contains a list of multi-word terms and their part-of-speech tags. Each multi-word term is parsed as a single token. Each multi-word term is parsed as a single token. This parameter requires a SAS Visual Text Analytics license.
For more information about specifying the multiterm parameter, see the common castable (Form 1) parameter (Appendix A: Common Parameters).
specifies the number of threads to be used per node. The value must be an integer. When the value is 0, the number of threads equals the number of CPUs.
| Default | 8 |
|---|---|
| Range | 0–64 |
specifies the name of the output CAS table to contain a compressed representation of the sparse term-by-document matrix.
For more information about specifying the parent parameter, see the common casouttable (Form 1) parameter (Appendix A: Common Parameters).
specifies the name of the input CAS table that contains parsing configuration information
For more information about specifying the parseConfig parameter, see the common castable (Form 1) parameter (Appendix A: Common Parameters).
| Alias | scoreConfig |
|---|
specifies the name of the input table that contains information about the terms in the document collection. The table is used to determine which terms to use in the topic calculation.
For more information about specifying the terms parameter, see the common castable (Form 1) parameter (Appendix A: Common Parameters).
Specifies the input textTopic table that is used for assigning learned topics to new data.
For more information about specifying the termTopics parameter, see the common castable (Form 1) parameter (Appendix A: Common Parameters).
specifies the character variable in the documents table that contains the text to be processed.
| Default | "text" |
|---|
Specifies the input topic table that is used for assigning learned topics to new data.
For more information about specifying the topics parameter, see the common castable (Form 1) parameter (Appendix A: Common Parameters).
specifies the input U matrix, which contains the left singular vectors. The matrix is used to form the document projections.
For more information about specifying the u parameter, see the common castable (Form 1) parameter (Appendix A: Common Parameters).
specifies the name of the index variable on the input U matrix.
| Default | "_TermNum_" |
|---|