by default, a greedy search or exhaustive search is used to determine the best split for each variable of each tree node. When set to False, a fast and efficient algorithm that is based on clustering is applied. Setting this parameter to False is recommended for variables with high cardinality. D...
specifies the method for finding a split on a nominal input. Alias: nomSearch. The tkcasdt_nomSearchOpts value can be one or more of the following: handling: "CLASSIC" | "ENHANCED". maxCategories: specifies the maximum number of levels for a splitting rule to include. Aliases: maxCats, maxLevels,...
Performs randomized cardinality estimation.
specifies the settings for an output table. For more information about specifying the casOutHighCardinalityDetails parameter, see the common casouttable parameter. Alias: casout.
specifies the exponent that determines the number of registers to be used by the HyperLogLog++ cardinality estimator. Default: 10. Range: 5–16.
Practical example of merging SAS datasets with a many-to-one cardinality relationship.
This example illustrates how the SAS Viya CARDINALITY procedure handles variables with user-defined formats. It demonstrates the use of predefined formats for level ordering and determining a variable's cardinality.
This SAS script illustrates how to use the CARDINALITY procedure in SAS Viya 4 to analyze variable cardinality and, specifically, how to enforce a sorting order (ascending or descending) for a variable's levels. The CARDINALITY procedure is useful for understanding the distribution and the number...
This script uses the CARDINALITY procedure to analyze the distribution of levels for a numeric or character variable, focusing on specific data segments for deeper exploration.
/* 1. Charger le jeu de données SASHELP.CARS en session CAS */ proc cas; session casauto; caslib _all_ assign; data casuser.cars; set sashelp.cars; run; quit; /* 2. Créer un format défini par l'utilisateur pour engineSize */ proc format casfmtlib='casuser.myfmtlib'; value en...