dataPreprocess

highCardinality

Description

Performs randomized cardinality estimation.

dataPreprocess.highCardinality <result=results> <status=rc> / casOutHighCardinalityDetails={ caslib="string", compress=TRUE | FALSE, indexVars={"variable-name-1" <, "variable-name-2">}, label="string", lifetime=64-bit-integer, maxMemSize=64-bit-integer, memoryFormat="DVR" | "INHERIT" | "STANDARD", name="table-name", promote=TRUE | FALSE, replace=TRUE | FALSE, replication=integer, tableRedistUpPolicy="DEFER" | "NOREDIST" | "REBALANCE", threadBlockSize=64-bit-integer, timeStamp="string", where={"string-1" <, "string-2">} }, freq="variable-name", includeMissingGroup=TRUE | FALSE, inputs={{format="string", formattedLength=integer, label="string", name="variable-name" <*, nfd=integer, nfl=integer>} <, {...}>}, nRegisterBits=integer, outputTableOptions={ forceTableReturn=TRUE | FALSE, tableNames={"string-1" <, "string-2">} }, table={caslib="string", computedOnDemand=TRUE | FALSE, computedVars={{format="string", formattedLength=integer, label="string", name="variable-name" <*, nfd=integer, nfl=integer>} <, {...}>}, computedVarsProgram="string", dataSourceOptions={key-1=any-list-or-data-type-1 <, key-2=any-list-or-data-type-2>}, groupBy={{format="string", formattedLength=integer, label="string", name="variable-name" <*, nfd=integer, nfl=integer>} <, {...}>}, groupByMode="NOSORT" | "REDISTRIBUTE", importOptions={fileType="ANY" | "AUDIO" | "AUTO" | "BASESAS" | "CSV" | "DELIMITED" | "DOCUMENT" | "DTA" | "ESP" | "EXCEL" | "FMT" | "HDAT" | "IMAGE" | "JMP" | "LASR" | "PARQUET" | "SOUND" | "SPSS" | "VIDEO" | "XLS", fileType-specific-parameters}, name="table-name" <*, orderBy={{format="string", formattedLength=integer, label="string", name="variable-name" <*, nfd=integer, nfl=integer>} <, {...}>}, singlePass=TRUE | FALSE, vars={{format="string", formattedLength=integer, label="string", name="variable-name" <*, nfd=integer, nfl=integer>} <, {...}>}, where="where-expression", whereTable={casLib="string", dataSourceOptions={adls_noreq-parameters | bigquery-parameters | cas_noreq-parameters | clouddex-parameters | db2-parameters | dnfs-parameters | esp-parameters | fedsvr-parameters | gcs_noreq-parameters | hadoop-parameters | hana-parameters | impala-parameters | informix-parameters | jdbc-parameters | mongodb-parameters | mysql-parameters | odbc-parameters | oracle-parameters | path-parameters | postgres-parameters | redshift-parameters | s3-parameters | sapiq-parameters | sforce-parameters | singlestore_standard-parameters | snowflake-parameters | spark-parameters | spde-parameters | sqlserver-parameters | ss_noreq-parameters | teradata-parameters | vertica-parameters | yellowbrick-parameters}, importOptions={fileType="ANY" | "AUDIO" | "AUTO" | "BASESAS" | "CSV" | "DELIMITED" | "DOCUMENT" | "DTA" | "ESP" | "EXCEL" | "FMT" | "HDAT" | "IMAGE" | "JMP" | "LASR" | "PARQUET" | "SOUND" | "SPSS" | "VIDEO" | "XLS", fileType-specific-parameters}, name="table-name" <*, vars={{format="string", formattedLength=integer, label="string", name="variable-name" <*, nfd=integer, nfl=integer>} <, {...}>}, where="where-expression">} }, weight="variable-name" ;
Settings
ParameterDescription
casOutHighCardinalityDetailsSpecifies the settings for an output table. For more information about specifying the casOutHighCardinalityDetails parameter, see the common casouttable parameter.
freqSpecifies the frequency variable.
includeMissingGroupWhen set to True, missing values are allowed as group-by keys. Default is FALSE.
inputsSpecifies the variables to use for the analysis. You can specify a subset of the variables from the input table. For more information about specifying the inputs parameter, see the common casinvardesc parameter.
nRegisterBitsSpecifies the exponent that determines the number of registers to be used by the HyperLogLog++ cardinality estimator. Default is 10. Range is 5–16.
outputTableOptionsSpecifies options for result tables. You can specify which result tables the server returns and how group-by results are handled. Alias: tblOpts. The outputTableOptions value can be one or more of the following: forceTableReturn: When set to True, result tables are returned to the client even if the output is also saved as an output table. Default is FALSE. tableNames: Specifies the names of result tables to generate. By default, all result tables are returned. Alias: outputTables.
tableSpecifies the table name, caslib, and other common parameters. For more information about specifying the table parameter, see the common castable parameter.
weightSpecifies the weight variable.

Examples

FAQ

What is the highCardinality Action?
casOutHighCardinalityDetails
freq
includeMissingGroup
inputs
nRegisterBits
outputTableOptions
table
weight