Subset functions allow to conveniently split your data by certain characteristics such as cell lines, conditions, cluster etc. or for specific cell ids. This might be useful if you want apply some machine learning algorithms such as clustering and correlation on only a subset of cells. See details for more information.

subsetByCellId(
  object,
  new_name,
  cell_ids,
  reasoning = NULL,
  verbose = NULL,
  ...
)

Arguments

object

A valid cypro object.

new_name

Character value. Denotes the name of the output object. If set to NULL the name of the input object is taken and suffixed with '_subset'.

cell_ids

Character vector. Denotes the cells to keep unambiguously with their cell ids.

reasoning

Character value. Allows two write a short description of how the cell ids according to which the object is filtered were selected. This description is included in the output of printSubsetHistory(). Ignored if set to NULL.

verbose

Logical. If set to TRUE informative messages regarding the computational progress will be printed.

(Warning messages will always be printed.)

Value

A cypro object that contains the data for the subsetted cells.

Details

Creating subsets of your data affects analysis results such as clustering and correlation which is why these results are reset in the subsetted object and must be computed again. To prevent inadvertent overwriting the default directory is reset as well. Make sure to set a new one via setDefaultDirectory().

The mechanism with which you create the subset is stored in the output object. Use printSubsetHistory() to reconstruct the way from the original object to the current one.