library(MultiAssayExperiment)
library(SpatialExperiment)
library(SingleCellMultiModal)
The dataset consists of two data types, seq-FISH data was provided by Zhu et al. (2018), while scRNA-seq data was provided by Tasic et al. (2016).
Data have been retrievedas part of the Hackathon in the Mathematical Frameworks for Integrative Analysis of Emerging Biological DataTypes workshop.
The user can see the available dataset by using the default options
SingleCellMultiModal::seqFISH(DataType="mouse_visual_cortex",
modes="*", dry.run=TRUE, version="2.0.0")
## snapshotDate(): 2021-05-18
## ah_id mode file_size rdataclass rdatadateadded
## 1 EH3785 scRNA_Counts 0.2 Mb matrix 2020-09-14
## 2 EH3786 scRNA_Labels 0 Mb data.frame 2020-09-14
## 3 EH3787 seqFISH_Coordinates 0 Mb data.frame 2020-09-14
## 4 EH3788 seqFISH_Counts 0.2 Mb matrix 2020-09-14
## 5 EH3789 seqFISH_Labels 0 Mb data.frame 2020-09-14
## rdatadateremoved
## 1 <NA>
## 2 <NA>
## 3 <NA>
## 4 <NA>
## 5 <NA>
Or simply by running:
mae <- SingleCellMultiModal::seqFISH(DataType="mouse_visual_cortex",
modes="*", dry.run=FALSE, version="2.0.0")
## snapshotDate(): 2021-05-18
## Working on: scRNA_Counts
## Working on: scRNA_Labels
## Working on: seqFISH_Coordinates
## Working on: seqFISH_Counts
## Working on: seqFISH_Labels
## see ?SingleCellMultiModal and browseVignettes('SingleCellMultiModal') for documentation
## loading from cache
## see ?SingleCellMultiModal and browseVignettes('SingleCellMultiModal') for documentation
## loading from cache
## see ?SingleCellMultiModal and browseVignettes('SingleCellMultiModal') for documentation
## loading from cache
## see ?SingleCellMultiModal and browseVignettes('SingleCellMultiModal') for documentation
## loading from cache
## see ?SingleCellMultiModal and browseVignettes('SingleCellMultiModal') for documentation
## loading from cache
## Warning: 'ExperimentList' contains 'data.frame' or 'DataFrame',
## potential for errors with mixed data types
## Warning: 'ExperimentList' contains 'data.frame' or 'DataFrame',
## potential for errors with mixed data types
## Warning: 'ExperimentList' contains 'data.frame' or 'DataFrame',
## potential for errors with mixed data types
mae
## A MultiAssayExperiment object of 2 listed
## experiments with user-defined names and respective classes.
## Containing an ExperimentList class object of length 2:
## [1] seqFISH: SpatialExperiment with 113 rows and 1597 columns
## [2] scRNAseq: SingleCellExperiment with 113 rows and 1723 columns
## Functionality:
## experiments() - obtain the ExperimentList instance
## colData() - the primary/phenotype DataFrame
## sampleMap() - the sample coordination DataFrame
## `$`, `[`, `[[` - extract colData columns, subset, or experiment
## *Format() - convert into a long or wide DataFrame
## assays() - convert ExperimentList to a SimpleList of matrices
## exportClass() - save all data to files
Example with actual data:
experiments(mae)
## ExperimentList class object of length 2:
## [1] seqFISH: SpatialExperiment with 113 rows and 1597 columns
## [2] scRNAseq: SingleCellExperiment with 113 rows and 1723 columns
Check row annotations:
rownames(mae)
## CharacterList of length 2
## [["seqFISH"]] abca15 abca9 acta2 adcy4 aldh3b2 ... wrn zfp182 zfp715 zfp90
## [["scRNAseq"]] abca15 abca9 acta2 adcy4 aldh3b2 ... wrn zfp182 zfp715 zfp90
Take a peek at the sampleMap
:
sampleMap(mae)
## DataFrame with 3320 rows and 3 columns
## assay primary colname
## <factor> <character> <character>
## 1 seqFISH V2 V2
## 2 seqFISH V3 V3
## 3 seqFISH V4 V4
## 4 seqFISH V5 V5
## 5 seqFISH V6 V6
## ... ... ... ...
## 3316 scRNAseq V1719 V1719
## 3317 scRNAseq V1720 V1720
## 3318 scRNAseq V1721 V1721
## 3319 scRNAseq V1722 V1722
## 3320 scRNAseq V1723 V1723
The scRNA-seq data are accessible with $scRNAseq
, which returns a
SingleCellExperiment class object, with all its associated methods.
experiments(mae)$scRNAseq
## class: SingleCellExperiment
## dim: 113 1723
## metadata(0):
## assays(1): counts
## rownames(113): abca15 abca9 ... zfp715 zfp90
## rowData names(1): X
## colnames(1723): V1 V2 ... V1722 V1723
## colData names(3): broad_type sample_name dissection
## reducedDimNames(0):
## mainExpName: NULL
## altExpNames(0):
Otherwhise the assay
function can be used to access the scRNAseq assay
stored in the mae
MultiAssayExperiment object.
head(assay(mae, "scRNAseq"))[,1:4]
## V1 V2 V3 V4
## abca15 11 42 17 42
## abca9 22 46 22 46
## acta2 15 47 15 42
## adcy4 12 45 12 45
## aldh3b2 27 49 27 49
## amigo2 23 43 101 43
The seq-FISH data are accessible with $seqFISH
, which returns a
SpatialExperiment class object.
experiments(mae)$seqFISH
## class: SpatialExperiment
## dim: 113 1597
## metadata(0):
## assays(1): counts
## rownames(113): abca15 abca9 ... zfp715 zfp90
## rowData names(1): X
## colnames(1597): V2 V3 ... V1597 V1598
## colData names(7): cluster class ... Cell_ID Irrelevant
## reducedDimNames(0):
## mainExpName: NULL
## altExpNames(0):
## spatialData names(2) : Cell_ID Irrelevant
## spatialCoords names(2) : x y
## imgData names(0):
Otherwhise the assay
function can be used to access the seqFISH assay
stored in the mae
MultiAssayExperiment object.
head(assay(mae, "seqFISH"))[,1:4]
## V2 V3 V4 V5
## abca15 68 49 50 39
## abca9 41 42 38 36
## acta2 25 23 16 21
## adcy4 39 54 37 18
## aldh3b2 101 47 41 52
## amigo2 93 64 93 93
Spatial data can be retrieved with spatialData
function on the
SpatialExperiment object.
(sd <- spatialData(experiments(mae)$seqFISH))
## DataFrame with 1597 rows and 2 columns
## Cell_ID Irrelevant
## <integer> <character>
## V2 1 -
## V3 2 -
## V4 3 -
## V5 4 -
## V6 5 -
## ... ... ...
## V1594 1593 -
## V1595 1594 -
## V1596 1595 -
## V1597 1596 -
## V1598 1597 -
Spatial coordinates within the spatial data can be retrieved in matrix form with spatialCoords
function on the SpatialExperiment object.
head(sc <- spatialCoords(experiments(mae)$seqFISH))
## x y
## [1,] 265.76 -231.14
## [2,] 290.48 -261.52
## [3,] 257.12 -133.35
## [4,] 753.46 -261.14
## [5,] 700.01 -169.05
## [6,] 415.63 -252.45
Direct access to the colnames of the spacial coordinates with
spatialCoordsNames
function.
spatialCoordsNames(experiments(mae)$seqFISH)
## [1] "x" "y"
The provided seqFISH dataset comes out in two different versions:
The full scRNA-seq data matrix is 24057 rows x 1809 columns.
To access the v1.0.0 simply run
mae <- SingleCellMultiModal::seqFISH(DataType="mouse_visual_cortex",
modes="*", dry.run=FALSE, version="1.0.0")
## snapshotDate(): 2021-05-18
## Working on: scRNA_Full_Counts
## Working on: scRNA_Full_Labels
## Working on: seqFISH_Coordinates
## Working on: seqFISH_Counts
## Working on: seqFISH_Labels
## see ?SingleCellMultiModal and browseVignettes('SingleCellMultiModal') for documentation
## loading from cache
## see ?SingleCellMultiModal and browseVignettes('SingleCellMultiModal') for documentation
## loading from cache
## see ?SingleCellMultiModal and browseVignettes('SingleCellMultiModal') for documentation
## loading from cache
## see ?SingleCellMultiModal and browseVignettes('SingleCellMultiModal') for documentation
## loading from cache
## see ?SingleCellMultiModal and browseVignettes('SingleCellMultiModal') for documentation
## loading from cache
## Warning: 'ExperimentList' contains 'data.frame' or 'DataFrame',
## potential for errors with mixed data types
## Warning: 'ExperimentList' contains 'data.frame' or 'DataFrame',
## potential for errors with mixed data types
## Warning: 'ExperimentList' contains 'data.frame' or 'DataFrame',
## potential for errors with mixed data types
mae
## A MultiAssayExperiment object of 2 listed
## experiments with user-defined names and respective classes.
## Containing an ExperimentList class object of length 2:
## [1] seqFISH: SpatialExperiment with 113 rows and 1597 columns
## [2] scRNAseq: SingleCellExperiment with 24057 rows and 1809 columns
## Functionality:
## experiments() - obtain the ExperimentList instance
## colData() - the primary/phenotype DataFrame
## sampleMap() - the sample coordination DataFrame
## `$`, `[`, `[[` - extract colData columns, subset, or experiment
## *Format() - convert into a long or wide DataFrame
## assays() - convert ExperimentList to a SimpleList of matrices
## exportClass() - save all data to files
sessionInfo()
## R version 4.1.0 (2021-05-18)
## Platform: x86_64-pc-linux-gnu (64-bit)
## Running under: Ubuntu 20.04.2 LTS
##
## Matrix products: default
## BLAS: /home/biocbuild/bbs-3.13-bioc/R/lib/libRblas.so
## LAPACK: /home/biocbuild/bbs-3.13-bioc/R/lib/libRlapack.so
##
## locale:
## [1] LC_CTYPE=en_US.UTF-8 LC_NUMERIC=C
## [3] LC_TIME=en_US.UTF-8 LC_COLLATE=C
## [5] LC_MONETARY=en_US.UTF-8 LC_MESSAGES=en_US.UTF-8
## [7] LC_PAPER=en_US.UTF-8 LC_NAME=C
## [9] LC_ADDRESS=C LC_TELEPHONE=C
## [11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=C
##
## attached base packages:
## [1] parallel stats4 stats graphics grDevices utils datasets
## [8] methods base
##
## other attached packages:
## [1] SpatialExperiment_1.2.0 scater_1.20.0
## [3] ggplot2_3.3.3 scran_1.20.1
## [5] scuttle_1.2.0 SingleCellExperiment_1.14.1
## [7] rhdf5_2.36.0 RaggedExperiment_1.16.0
## [9] SingleCellMultiModal_1.4.1 MultiAssayExperiment_1.18.0
## [11] SummarizedExperiment_1.22.0 Biobase_2.52.0
## [13] GenomicRanges_1.44.0 GenomeInfoDb_1.28.0
## [15] IRanges_2.26.0 S4Vectors_0.30.0
## [17] BiocGenerics_0.38.0 MatrixGenerics_1.4.0
## [19] matrixStats_0.59.0 BiocStyle_2.20.0
##
## loaded via a namespace (and not attached):
## [1] AnnotationHub_3.0.0 BiocFileCache_2.0.0
## [3] plyr_1.8.6 igraph_1.2.6
## [5] BiocParallel_1.26.0 digest_0.6.27
## [7] htmltools_0.5.1.1 viridis_0.6.1
## [9] magick_2.7.2 fansi_0.5.0
## [11] magrittr_2.0.1 memoise_2.0.0
## [13] ScaledMatrix_1.0.0 cluster_2.1.2
## [15] limma_3.48.0 Biostrings_2.60.1
## [17] R.utils_2.10.1 colorspace_2.0-1
## [19] blob_1.2.1 rappdirs_0.3.3
## [21] xfun_0.23 dplyr_1.0.6
## [23] crayon_1.4.1 RCurl_1.98-1.3
## [25] jsonlite_1.7.2 glue_1.4.2
## [27] gtable_0.3.0 zlibbioc_1.38.0
## [29] XVector_0.32.0 UpSetR_1.4.0
## [31] DelayedArray_0.18.0 BiocSingular_1.8.0
## [33] DropletUtils_1.12.1 Rhdf5lib_1.14.0
## [35] HDF5Array_1.20.0 scales_1.1.1
## [37] DBI_1.1.1 edgeR_3.34.0
## [39] Rcpp_1.0.6 viridisLite_0.4.0
## [41] xtable_1.8-4 dqrng_0.3.0
## [43] bit_4.0.4 rsvd_1.0.5
## [45] metapod_1.0.0 httr_1.4.2
## [47] ellipsis_0.3.2 farver_2.1.0
## [49] pkgconfig_2.0.3 R.methodsS3_1.8.1
## [51] uwot_0.1.10 sass_0.4.0
## [53] dbplyr_2.1.1 locfit_1.5-9.4
## [55] utf8_1.2.1 tidyselect_1.1.1
## [57] labeling_0.4.2 rlang_0.4.11
## [59] later_1.2.0 AnnotationDbi_1.54.0
## [61] munsell_0.5.0 BiocVersion_3.13.1
## [63] tools_4.1.0 cachem_1.0.5
## [65] generics_0.1.0 RSQLite_2.2.7
## [67] ExperimentHub_2.0.0 evaluate_0.14
## [69] stringr_1.4.0 fastmap_1.1.0
## [71] yaml_2.2.1 knitr_1.33
## [73] bit64_4.0.5 purrr_0.3.4
## [75] KEGGREST_1.32.0 sparseMatrixStats_1.4.0
## [77] mime_0.10 formatR_1.11
## [79] R.oo_1.24.0 compiler_4.1.0
## [81] beeswarm_0.4.0 filelock_1.0.2
## [83] curl_4.3.1 png_0.1-7
## [85] interactiveDisplayBase_1.30.0 tibble_3.1.2
## [87] statmod_1.4.36 bslib_0.2.5.1
## [89] stringi_1.6.2 highr_0.9
## [91] RSpectra_0.16-0 lattice_0.20-44
## [93] bluster_1.2.1 Matrix_1.3-4
## [95] vctrs_0.3.8 pillar_1.6.1
## [97] lifecycle_1.0.0 rhdf5filters_1.4.0
## [99] BiocManager_1.30.15 jquerylib_0.1.4
## [101] RcppAnnoy_0.0.18 BiocNeighbors_1.10.0
## [103] cowplot_1.1.1 bitops_1.0-7
## [105] irlba_2.3.3 httpuv_1.6.1
## [107] R6_2.5.0 bookdown_0.22
## [109] promises_1.2.0.1 gridExtra_2.3
## [111] vipor_0.4.5 codetools_0.2-18
## [113] assertthat_0.2.1 rjson_0.2.20
## [115] withr_2.4.2 GenomeInfoDbData_1.2.6
## [117] grid_4.1.0 beachmat_2.8.0
## [119] rmarkdown_2.8 DelayedMatrixStats_1.14.0
## [121] shiny_1.6.0 ggbeeswarm_0.6.0
Tasic, Bosiljka, Vilas Menon, Thuc Nghi Nguyen, Tae Kyung Kim, Tim Jarsky, Zizhen Yao, Boaz Levi, et al. 2016. “Adult Mouse Cortical Cell Taxonomy Revealed by Single Cell Transcriptomics.” Nature Neuroscience 19 (2): 335.
Zhu, Qian, Sheel Shah, Ruben Dries, Long Cai, and Guo-Cheng Yuan. 2018. “Identification of Spatially Associated Subpopulations by Combining scRNAseq and Sequential Fluorescence in Situ Hybridization Data.” Nature Biotechnology 36 (12): 1183.