Adds the clustering results of computeKmeansCluster()
in form
of grouping variables to the object's overall data - making them available for the across
-
argument.
addKmeansClusterVariables(
object,
variable_set,
k,
phase = NULL,
method_kmeans = NULL,
verbose = NULL
)
object | A valid cypro object. |
---|---|
variable_set | Character value. Denotes the variable set of interest. Use |
k | Numeric vector. All k-values of interest. |
phase | Character or numeric. If character, the ordinal value referring to the phase of interest (e.g. 'first', 'second' etc.). referring to the phase of interest or 'all'. If numeric, the number referring to the phase. If set to NULL takes the phase denoted as default with Ignored if the experiment design contains only one phase. |
method_kmeans | Character vector (or value see details for more.) Denotes the algorithms of interest. Defaults to 'Hartigan-Wong'. Use |
verbose | Logical. If set to TRUE informative messages regarding the computational progress will be printed. (Warning messages will always be printed.) |
An updated cypro object that contains the data added.
The last step of the kmeans clustering pipeline. This function iterates
over all combinations of method_kmeans
and k
and
adds the respective clustering variables to the object's overall data named according to
the following syntax: kmeans_method_kmeans
_k_k
_(variable_set)
.
This naming concept results in somewhat bulky but unambiguous clustering names. You can always
rename grouping variables with renameClusterDf()
.
Use getGroupingVariableNames()
afterwards to obtain all grouping variables.