Calculate basic statistics on a matrix
scm_gene_stat(
mat_id,
niche_quantile = 0.2,
downsample_n = NULL,
K_std_n = 1000
)
A value between 0 and 1.
The input matrix
Returns a dataset that contains statistic for all genes that passed the filtering stage. Columns starting with ds contain UMI statistics after downsampling, columns starting with n contain UMI statistics after normalizing UMIs so that the number of UMIs per cell sums to 1. The columns are:
Total gene expression
Gene variance
Number of cells in which the gene is expressed
Correlation with cell size
sz_cor after subtracting the trend
How many of the genen's umis are found in X% of the most highly expressing cells. (regularized)
niche_stat after subtracting the niche_norm trend: median niche_norm value of genes with similar total expression
Mean after normalization
Largest count, after downsampling
2nd largest count, after downsampling
3rd largest count, after downsampling
Mean on downsampled data
Variance on downsampled data
log2 of ds_var/ds_mean
ds_log_varmean after subtracting the trend
Number of cells in which the gene is expressed, after down sampling
Number of UMIs used for downsampling