cis_em_genomic.Rd
Get candidates non-promoter cis regulation
cis_em_genomic( meth_mat, expr_mat, gene_tss, min_samples = 100, max_k = 50, max_dist = 500000, min_dist = 200, spearman = FALSE, parallel = FALSE )
meth_mat | Matrix with methylation values. Each row is a locus and each column is a sample. Rownames should contain "chrom", "start" and "end" separated by "_" |
---|---|
expr_mat | Matrix with expression values. Each row is a gene and each column is a sample. |
gene_tss | intervals set with additional "name" column with transcription start site (TSS) for each gene. Used for allocating a genomic coordinate for each gene. In case of duplicate names or coordinates the first one in the matrix will be used. |
min_samples | minimal number of samples per gene (both expression and methylation). Default is 100. |
max_k | maximal number of genes to report per locus. Default is 50. Note that for efficiency (memory and computation) it is best to keep this number small. |
max_dist | maximal distance from TSS to consider "cis" (used for FDR computations) |
min_dist | minimal distance from TSS (in order to exclude promoter regulation). Used for FDR computation. |
spearman | use spearman correlation (if FALSE - use pearson) |
parallel | use mclapply for computation of ranks. Note that for large matrices forking might fail due to lack of memory and therefore parallel should be set to FALSE. |