generateNull {singscore} | R Documentation |
This function generates a number of random gene sets that
have the same number of genes as the scored gene set. It scores each random
gene set and returns a matrix of scores for all samples.
The empirical scores are used to calculate the empirical p-values and plot
the null distribution. The implementation uses BiocParallel::bplapply()
for easy access to parallel backends. Note that one should pass the same
values to the upSet
, downSet
, centerScore
and bidirectional
arguments as what they provide for the simpleScore()
function to generate
a proper null distribution.
generateNull( upSet, downSet = NULL, rankData, subSamples = NULL, centerScore = TRUE, knownDirection = TRUE, B = 1000, ncores = 1, seed = sample.int(1e+06, 1), useBPPARAM = NULL ) ## S4 method for signature 'vector,ANY' generateNull( upSet, downSet = NULL, rankData, subSamples = NULL, centerScore = TRUE, knownDirection = TRUE, B = 1000, ncores = 1, seed = sample.int(1e+06, 1), useBPPARAM = NULL ) ## S4 method for signature 'GeneSet,ANY' generateNull( upSet, downSet = NULL, rankData, subSamples = NULL, centerScore = TRUE, knownDirection = TRUE, B = 1000, ncores = 1, seed = sample.int(1e+06, 1), useBPPARAM = NULL ) ## S4 method for signature 'vector,vector' generateNull( upSet, downSet = NULL, rankData, subSamples = NULL, centerScore = TRUE, knownDirection = TRUE, B = 1000, ncores = 1, seed = sample.int(1e+06, 1), useBPPARAM = NULL ) ## S4 method for signature 'GeneSet,GeneSet' generateNull( upSet, downSet = NULL, rankData, subSamples = NULL, centerScore = TRUE, knownDirection = TRUE, B = 1000, ncores = 1, seed = sample.int(1e+06, 1), useBPPARAM = NULL )
upSet |
A GeneSet object or character vector of gene IDs of up-regulated gene set or a gene set where the nature of genes is not known |
downSet |
A GeneSet object or character vector of gene IDs of down-regulated gene set or NULL where only a single gene set is provided |
rankData |
A matrix object, ranked gene expression matrix data generated
using the |
subSamples |
A vector of sample labels/indices that will be used to subset the rankData matrix. All samples will be scored if not provided |
centerScore |
A Boolean, specifying whether scores should be centered
around 0, default as TRUE. Note: scores never centered if |
knownDirection |
A boolean, determining whether the gene set should be considered to be directional or not. A gene set is directional if the type of genes in it are known i.e. up- or down-regulated. This should be set to TRUE if the gene set is composed of both up- AND down-regulated genes. Defaults to TRUE. This parameter becomes irrelevant when both upSet(Colc) and downSet(Colc) are provided. |
B |
integer, the number of permutation repeats or the number of random gene sets to be generated, default as 1000 |
ncores, |
integer, the number of CPU cores the function can use |
seed |
integer, set the seed for randomisation |
useBPPARAM, |
the backend the function uses, if NULL is provided, the
function uses the default parallel backend which is the first on the list
returned by |
A matrix of empirical scores for all samples
Ruqian Lyu
Post about BiocParallel
browseVignettes("BiocParallel")
ranked <- rankGenes(toy_expr_se) scoredf <- simpleScore(ranked, upSet = toy_gs_up, downSet = toy_gs_dn) # find out what backends can be registered on your machine BiocParallel::registered() # the first one is the default backend # ncores = ncores <- parallel::detectCores() - 2 permuteResult = generateNull(upSet = toy_gs_up, downSet = toy_gs_dn, ranked, centerScore = TRUE, B =10, seed = 1, ncores = 1 )