runReport {RCAS} | R Documentation |
This is the main report generation function for RCAS. This function can take a BED file, a GTF file and optionally an MSIGDB gene set annotation (or any text file containing annotations with the same structure as defined in MSIGDB); and use these input to run multiple RCAS functions to create a summary report regarding the annotation data that overlap the input BED file, enrichment analysis for GO terms, gene sets from MSIGDB, and motif analysis.
runReport(queryFilePath = "testdata", gffFilePath = "testdata", msigdbFilePath = "testdata", annotationSummary = TRUE, goAnalysis = TRUE, msigdbAnalysis = TRUE, motifAnalysis = TRUE, genomeVersion = "hg19", outDir = getwd(), printProcessedTables = FALSE, sampleN = 0, quiet = FALSE, selfContained = TRUE)
queryFilePath |
a BED format file which contains genomic coordinates of protein-RNA binding sites |
gffFilePath |
A GTF format file which contains genome annotations (preferably from ENSEMBL) |
msigdbFilePath |
Gene set annotations for Homo sapiens from Molecular Signatures Database or any text file that has the same structure. Regardless of which species is being studied (see genomeVersion parameter), msigdbFilePath must contain annotations for human genes. The gene sets will be mapped from human to other species if genomeVersion is set to anything except human genome versions (e.g. mm9 or dm3). |
annotationSummary |
TRUE/FALSE (default: TRUE) A switch to decide if RCAS should provide annotation summaries from overlap operations |
goAnalysis |
TRUE/FALSE (default: TRUE) A switch to decide if RCAS should run GO term enrichment analysis |
msigdbAnalysis |
TRUE/FALSE (default: TRUE) A switch to decide if RCAS should run gene set enrichment analysis |
motifAnalysis |
TRUE/FALSE (default: TRUE) A switch to decide if RCAS should run motif analysis |
genomeVersion |
A character string to denote for which genome version the analysis is being done. Available options are hg19/hg38 (human), mm9/mm10 (mouse), ce10 (worm) and dm3 (fly). |
outDir |
Path to the output directory. (default: current working directory) |
printProcessedTables |
boolean value (default: FALSE). If set to TRUE, raw data tables that are used for plots/tables will be printed to text files. |
sampleN |
integer value (default: 0). A parameter to determine if the input query regions should be downsampled to a smaller size in order to make report generation quicker. When set to 0, downsampling won't be done. To activate the sampling a positive integer value that is smaller than the total number of query regions should be given. |
quiet |
boolean value (default: FALSE). If set to TRUE, progress bars and chunk labels will be suppressed while knitting the Rmd file. |
selfContained |
boolean value (default: TRUE). By default, the generated html file will be self-contained, which means that all figures and tables will be embedded in a single html file with no external dependencies (See rmarkdown::html_document) |
An html generated using rmarkdown/knitr/pandoc that contains interactive figures, tables, and text that provide an overview of the experiment
#Default run will generate a report using built-in test data for hg19 genome. ## Not run: runReport() ## End(Not run) #A custom run for human ## Not run: runReport( queryFilePath = 'input.BED', gffFilePath = 'annotation.gtf', msigdbFilePath = 'human_msigdb.gmt') ## End(Not run) # To turn off certain modules of the report ## Not run: runReport( queryFilePath = 'input.BED', gffFilePath = 'annotation.gtf', msigdbFilePath = 'human_msigdb.gmt', motifAnalysis = FALSE, goAnalysis = FALSE ) ## End(Not run) # To run the pipeline for species other than human # If the msigdb module is needed, the msigdbFilePath # must be set to the MSIGDB annotations for 'human'. # MSIGDB datasets for other species will be calculated # in the background using the createOrthologousMsigdbDataset # function ## Not run: runReport( queryFilePath = 'input.mm9.BED', gffFilePath = 'annotation.mm9.gtf', msigdbFilePath = 'msigdb.human.gmt', genomeVersion = 'mm9' ) ## End(Not run)