TabulaMurisData 1.12.0
The TabulaMurisData
data package provides access to the 10x and SmartSeq2
single-cell RNA-seq data sets from
the Tabula Muris Consortium.
The contents of the package can be seen by querying the ExperimentHub for the
package name.
suppressPackageStartupMessages({
library(ExperimentHub)
library(SingleCellExperiment)
library(TabulaMurisData)
})
#> snapshotDate(): 2021-10-18
eh <- ExperimentHub()
#> snapshotDate(): 2021-10-18
query(eh, "TabulaMurisData")
#> ExperimentHub with 2 records
#> # snapshotDate(): 2021-10-18
#> # $dataprovider: Tabula Muris Consortium
#> # $species: Mus musculus
#> # $rdataclass: SingleCellExperiment
#> # additional mcols(): taxonomyid, genome, description,
#> # coordinate_1_based, maintainer, rdatadateadded, preparerclass, tags,
#> # rdatapath, sourceurl, sourcetype
#> # retrieve records with, e.g., 'object[["EH1617"]]'
#>
#> title
#> EH1617 | TabulaMurisDroplet
#> EH1618 | TabulaMurisSmartSeq2
The individual data sets can be accessed using either their ExperimentHub accession number, or the convenience functions provided in this package. For example, for the 10x data:
droplet <- eh[["EH1617"]]
#> see ?TabulaMurisData and browseVignettes('TabulaMurisData') for documentation
#> loading from cache
droplet
#> class: SingleCellExperiment
#> dim: 23341 70118
#> metadata(0):
#> assays(1): counts
#> rownames(23341): 0610005C13Rik 0610007C21Rik ... Zzef1 Zzz3
#> rowData names(2): ID Symbol
#> colnames(70118): 10X_P4_0_AAACCTGAGATTACCC 10X_P4_0_AAACCTGAGTGCCAGA
#> ... 10X_P8_15_TTTGTCATCTTACCGC 10X_P8_15_TTTGTCATCTTGTTTG
#> colData names(10): cell channel ... cell_ontology_id free_annotation
#> reducedDimNames(0):
#> mainExpName: NULL
#> altExpNames(0):
droplet <- TabulaMurisDroplet()
#> see ?TabulaMurisData and browseVignettes('TabulaMurisData') for documentation
#> loading from cache
droplet
#> class: SingleCellExperiment
#> dim: 23341 70118
#> metadata(0):
#> assays(1): counts
#> rownames(23341): 0610005C13Rik 0610007C21Rik ... Zzef1 Zzz3
#> rowData names(2): ID Symbol
#> colnames(70118): 10X_P4_0_AAACCTGAGATTACCC 10X_P4_0_AAACCTGAGTGCCAGA
#> ... 10X_P8_15_TTTGTCATCTTACCGC 10X_P8_15_TTTGTCATCTTGTTTG
#> colData names(10): cell channel ... cell_ontology_id free_annotation
#> reducedDimNames(0):
#> mainExpName: NULL
#> altExpNames(0):
iSEE
Each data set is provided in the form of a SingleCellExperiment
object. To
gain further insights into the contents of the data sets, they can be explored
using, e.g., the iSEE package. For the purposes of this vignette,
we first subsample a small subset of the cells in the 10x data set, to reduce
the run time.
set.seed(1234)
se <- droplet[, sample(seq_len(ncol(droplet)), 250, replace = FALSE)]
se
#> class: SingleCellExperiment
#> dim: 23341 250
#> metadata(0):
#> assays(1): counts
#> rownames(23341): 0610005C13Rik 0610007C21Rik ... Zzef1 Zzz3
#> rowData names(2): ID Symbol
#> colnames(250): 10X_P8_12_ACGGGCTGTCAGAGGT 10X_P7_10_CGTCCATGTTATGCGT
#> ... 10X_P7_9_TGACAACGTGTAAGTA 10X_P8_14_GATCTAGCACGGCCAT
#> colData names(10): cell channel ... cell_ontology_id free_annotation
#> reducedDimNames(0):
#> mainExpName: NULL
#> altExpNames(0):
Next, we calculate size factors and normalize the data using the scran and scater packages, and perform dimension reduction using PCA and t-SNE.
se <- scran::computeSumFactors(se)
se <- scater::logNormCounts(se)
se <- scater::runPCA(se)
se <- scater::runTSNE(se)
Finally, we call iSEE
with the subsampled SingleCellExperiment
object. This
opens up an instance of iSEE
containing the provided data set.
if (require(iSEE)) {
iSEE(se)
}
sessionInfo()
#> R version 4.1.1 (2021-08-10)
#> Platform: x86_64-pc-linux-gnu (64-bit)
#> Running under: Ubuntu 20.04.3 LTS
#>
#> Matrix products: default
#> BLAS: /home/biocbuild/bbs-3.14-bioc/R/lib/libRblas.so
#> LAPACK: /home/biocbuild/bbs-3.14-bioc/R/lib/libRlapack.so
#>
#> locale:
#> [1] LC_CTYPE=en_US.UTF-8 LC_NUMERIC=C
#> [3] LC_TIME=en_GB LC_COLLATE=C
#> [5] LC_MONETARY=en_US.UTF-8 LC_MESSAGES=en_US.UTF-8
#> [7] LC_PAPER=en_US.UTF-8 LC_NAME=C
#> [9] LC_ADDRESS=C LC_TELEPHONE=C
#> [11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=C
#>
#> attached base packages:
#> [1] stats4 stats graphics grDevices utils datasets methods
#> [8] base
#>
#> other attached packages:
#> [1] TabulaMurisData_1.12.0 SingleCellExperiment_1.16.0
#> [3] SummarizedExperiment_1.24.0 Biobase_2.54.0
#> [5] GenomicRanges_1.46.0 GenomeInfoDb_1.30.0
#> [7] IRanges_2.28.0 S4Vectors_0.32.0
#> [9] MatrixGenerics_1.6.0 matrixStats_0.61.0
#> [11] ExperimentHub_2.2.0 AnnotationHub_3.2.0
#> [13] BiocFileCache_2.2.0 dbplyr_2.1.1
#> [15] BiocGenerics_0.40.0 BiocStyle_2.22.0
#>
#> loaded via a namespace (and not attached):
#> [1] Rtsne_0.15 ggbeeswarm_0.6.0
#> [3] colorspace_2.0-2 ellipsis_0.3.2
#> [5] scuttle_1.4.0 bluster_1.4.0
#> [7] XVector_0.34.0 BiocNeighbors_1.12.0
#> [9] ggrepel_0.9.1 bit64_4.0.5
#> [11] interactiveDisplayBase_1.32.0 AnnotationDbi_1.56.1
#> [13] fansi_0.5.0 sparseMatrixStats_1.6.0
#> [15] cachem_1.0.6 knitr_1.36
#> [17] scater_1.22.0 jsonlite_1.7.2
#> [19] cluster_2.1.2 png_0.1-7
#> [21] shiny_1.7.1 BiocManager_1.30.16
#> [23] compiler_4.1.1 httr_1.4.2
#> [25] dqrng_0.3.0 assertthat_0.2.1
#> [27] Matrix_1.3-4 fastmap_1.1.0
#> [29] limma_3.50.0 later_1.3.0
#> [31] BiocSingular_1.10.0 htmltools_0.5.2
#> [33] tools_4.1.1 rsvd_1.0.5
#> [35] igraph_1.2.7 gtable_0.3.0
#> [37] glue_1.4.2 GenomeInfoDbData_1.2.7
#> [39] dplyr_1.0.7 rappdirs_0.3.3
#> [41] Rcpp_1.0.7 jquerylib_0.1.4
#> [43] vctrs_0.3.8 Biostrings_2.62.0
#> [45] DelayedMatrixStats_1.16.0 xfun_0.27
#> [47] stringr_1.4.0 beachmat_2.10.0
#> [49] mime_0.12 lifecycle_1.0.1
#> [51] irlba_2.3.3 statmod_1.4.36
#> [53] edgeR_3.36.0 zlibbioc_1.40.0
#> [55] scales_1.1.1 promises_1.2.0.1
#> [57] parallel_4.1.1 yaml_2.2.1
#> [59] curl_4.3.2 gridExtra_2.3
#> [61] memoise_2.0.0 ggplot2_3.3.5
#> [63] sass_0.4.0 stringi_1.7.5
#> [65] RSQLite_2.2.8 BiocVersion_3.14.0
#> [67] ScaledMatrix_1.2.0 scran_1.22.0
#> [69] filelock_1.0.2 BiocParallel_1.28.0
#> [71] rlang_0.4.12 pkgconfig_2.0.3
#> [73] bitops_1.0-7 evaluate_0.14
#> [75] lattice_0.20-45 purrr_0.3.4
#> [77] bit_4.0.4 tidyselect_1.1.1
#> [79] magrittr_2.0.1 bookdown_0.24
#> [81] R6_2.5.1 generics_0.1.1
#> [83] metapod_1.2.0 DelayedArray_0.20.0
#> [85] DBI_1.1.1 pillar_1.6.4
#> [87] withr_2.4.2 KEGGREST_1.34.0
#> [89] RCurl_1.98-1.5 tibble_3.1.5
#> [91] crayon_1.4.1 utf8_1.2.2
#> [93] rmarkdown_2.11 viridis_0.6.2
#> [95] locfit_1.5-9.4 grid_4.1.1
#> [97] blob_1.2.2 digest_0.6.28
#> [99] xtable_1.8-4 httpuv_1.6.3
#> [101] munsell_0.5.0 viridisLite_0.4.0
#> [103] beeswarm_0.4.0 vipor_0.4.5
#> [105] bslib_0.3.1