dataPreprocess

highCardinality

Description

Performs randomized cardinality estimation.

dataPreprocess.highCardinality <result=results> <status=rc> / casOutHighCardinalityDetails={ caslib="string", compress=TRUE | FALSE, indexVars={"variable-name-1" <, "variable-name-2">}, label="string", lifetime=64-bit-integer, maxMemSize=64-bit-integer, memoryFormat="DVR" | "INHERIT" | "STANDARD", name="table-name", promote=TRUE | FALSE, replace=TRUE | FALSE, replication=integer, tableRedistUpPolicy="DEFER" | "NOREDIST" | "REBALANCE", threadBlockSize=64-bit-integer, timeStamp="string", where={"string-1" <, "string-2">} }, freq="variable-name", includeMissingGroup=TRUE | FALSE, inputs={{format="string", formattedLength=integer, label="string", name="variable-name" <*, nfd=integer, nfl=integer>} <, {...}>}, nRegisterBits=integer, outputTableOptions={ forceTableReturn=TRUE | FALSE, tableNames={"string-1" <, "string-2">} }, table={caslib="string", computedOnDemand=TRUE | FALSE, computedVars={{format="string", formattedLength=integer, label="string", name="variable-name" <*, nfd=integer, nfl=integer>} <, {...}>}, computedVarsProgram="string", dataSourceOptions={key-1=any-list-or-data-type-1 <, key-2=any-list-or-data-type-2>}, groupBy={{format="string", formattedLength=integer, label="string", name="variable-name" <*, nfd=integer, nfl=integer>} <, {...}>}, groupByMode="NOSORT" | "REDISTRIBUTE", importOptions={fileType="ANY" | "AUDIO" | "AUTO" | "BASESAS" | "CSV" | "DELIMITED" | "DOCUMENT" | "DTA" | "ESP" | "EXCEL" | "FMT" | "HDAT" | "IMAGE" | "JMP" | "LASR" | "PARQUET" | "SOUND" | "SPSS" | "VIDEO" | "XLS", fileType-specific-parameters}, name="table-name" <*, orderBy={{format="string", formattedLength=integer, label="string", name="variable-name" <*, nfd=integer, nfl=integer>} <, {...}>}, singlePass=TRUE | FALSE, vars={{format="string", formattedLength=integer, label="string", name="variable-name" <*, nfd=integer, nfl=integer>} <, {...}>}, where="where-expression", whereTable={casLib="string", dataSourceOptions={adls_noreq-parameters | bigquery-parameters | cas_noreq-parameters | clouddex-parameters | db2-parameters | dnfs-parameters | esp-parameters | fedsvr-parameters | gcs_noreq-parameters | hadoop-parameters | hana-parameters | impala-parameters | informix-parameters | jdbc-parameters | mongodb-parameters | mysql-parameters | odbc-parameters | oracle-parameters | path-parameters | postgres-parameters | redshift-parameters | s3-parameters | sapiq-parameters | sforce-parameters | singlestore_standard-parameters | snowflake-parameters | spark-parameters | spde-parameters | sqlserver-parameters | ss_noreq-parameters | teradata-parameters | vertica-parameters | yellowbrick-parameters}, importOptions={fileType="ANY" | "AUDIO" | "AUTO" | "BASESAS" | "CSV" | "DELIMITED" | "DOCUMENT" | "DTA" | "ESP" | "EXCEL" | "FMT" | "HDAT" | "IMAGE" | "JMP" | "LASR" | "PARQUET" | "SOUND" | "SPSS" | "VIDEO" | "XLS", fileType-specific-parameters}, name="table-name" <*, vars={{format="string", formattedLength=integer, label="string", name="variable-name" <*, nfd=integer, nfl=integer>} <, {...}>}, where="where-expression">} }, weight="variable-name" ;
Settings
ParameterDescription
casOutHighCardinalityDetails Specifies the settings for an output table. For more information about specifying the casOutHighCardinalityDetails parameter, see the common casouttable parameter.
freq Specifies the frequency variable.
includeMissingGroup When set to True, missing values are allowed as group-by keys. Default is FALSE.
inputs Specifies the variables to use for the analysis. You can specify a subset of the variables from the input table. For more information about specifying the inputs parameter, see the common casinvardesc parameter.
nRegisterBits Specifies the exponent that determines the number of registers to be used by the HyperLogLog++ cardinality estimator. Default is 10. Range is 5–16.
outputTableOptions Specifies options for result tables. You can specify which result tables the server returns and how group-by results are handled. Alias: tblOpts. The outputTableOptions value can be one or more of the following: forceTableReturn: When set to True, result tables are returned to the client even if the output is also saved as an output table. Default is FALSE. tableNames: Specifies the names of result tables to generate. By default, all result tables are returned. Alias: outputTables.
table Specifies the table name, caslib, and other common parameters. For more information about specifying the table parameter, see the common castable parameter.
weight Specifies the weight variable.

Examples

FAQ

What is the highCardinality Action?
casOutHighCardinalityDetails
freq
includeMissingGroup
inputs
nRegisterBits
outputTableOptions
table
weight