The package is not yet available from Bioconductor.
Install the development version of the AnVILAz package from GitHub with
if (!requireNamespace("BiocManager", quietly = TRUE))
install.packages("BiocManager", repos = "https://cran.r-project.org")
BiocManager::install("Bioconductor/AnVILAz")Once installed, load the package with
For this tutorial we will refer to the Azure Blob Storage service as ABS. Within the ABS, we are given access to a Container. For more information, follow this link to Microsoft’s definition of containers and blobs.
The avlist command corresponds to a view of the files in
the Blob container on Azure. They can also be accessed via the Microsoft
Azure Storage Explorer.
As an example, we load the internal mtcars dataset and
save it as an .Rda file with save. We can then
upload this file to the ABS.
Now we can upload the data to the analyses/ folder in
the Azure Blob Storage (ABS) Container.
We can also use a small log file for demonstration purposes. The
jupyter.log file is already present in our workspace
directory.
We can remove the data with avremove and the
relative path to the .Rda file.
The reverse operation is also possible with a remote and local paths as the first and second arguments, respectively.
To upload an entire folder, we can use avbackup. Note
that the entire test folder becomes a subfolder of the
remote analyses folder in this example.
By default, the entire source directory will be copied
to the current working directory ".", i.e., the base
workspace directory.
You may also move this to another folder by providing a folder name as the second argument.
DATA tabmtcars exampleFirst we create an example dataset for uploading to the
DATA tab. We create a model_id column from the
rownames.
The avtable_import command takes an existing R object
(usually a tibble) and uploads to the DATA tab
in the AnVIL User Interface. The table argument will set
the name of the table. We also need to provide the
primaryKey which corresponds to the column name that
uniquely identifies each row in the data. Typically, the
primaryKey column provides a list of patient or UUID
identifiers and is in the first column of the data.
The avtable function will pull the data from the
DATA tab and represent the data locally as a
tibble. It works by using the same type
identifier (i.e., the table argument) that was used when
the data was uploaded.
The API allows deletion of specific rows in the data using
avtable_delete_values. To indicate which row to delete,
provide the a value or set of values that correspond to row identifiers
in the primaryKey. In this example, we remove the
AMC-Javelin entry from the data. We are left with 31
records.
If you experience issues, please feel free to contact us with a reproducible example on GitHub:
## R version 4.5.2 (2025-10-31)
## Platform: x86_64-pc-linux-gnu
## Running under: Ubuntu 24.04.3 LTS
##
## Matrix products: default
## BLAS: /usr/lib/x86_64-linux-gnu/openblas-pthread/libblas.so.3
## LAPACK: /usr/lib/x86_64-linux-gnu/openblas-pthread/libopenblasp-r0.3.26.so; LAPACK version 3.12.0
##
## locale:
## [1] LC_CTYPE=en_US.UTF-8 LC_NUMERIC=C
## [3] LC_TIME=en_US.UTF-8 LC_COLLATE=C
## [5] LC_MONETARY=en_US.UTF-8 LC_MESSAGES=en_US.UTF-8
## [7] LC_PAPER=en_US.UTF-8 LC_NAME=C
## [9] LC_ADDRESS=C LC_TELEPHONE=C
## [11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=C
##
## time zone: Etc/UTC
## tzcode source: system (glibc)
##
## attached base packages:
## [1] stats graphics grDevices utils datasets methods base
##
## other attached packages:
## [1] AnVILAz_1.4.0 BiocStyle_2.39.0
##
## loaded via a namespace (and not attached):
## [1] vctrs_0.6.5 httr_1.4.7 cli_3.6.5
## [4] knitr_1.50 rlang_1.1.6 xfun_0.54
## [7] jsonlite_2.0.0 glue_1.8.0 rjsoncons_1.3.2
## [10] buildtools_1.0.0 htmltools_0.5.8.1 maketools_1.3.2
## [13] BiocBaseUtils_1.13.0 sys_3.4.3 sass_0.4.10
## [16] rappdirs_0.3.3 rmarkdown_2.30 tibble_3.3.0
## [19] evaluate_1.0.5 jquerylib_0.1.4 fastmap_1.2.0
## [22] yaml_2.3.10 lifecycle_1.0.4 httr2_1.2.1
## [25] BiocManager_1.30.26 compiler_4.5.2 pkgconfig_2.0.3
## [28] digest_0.6.37 R6_2.6.1 pillar_1.11.1
## [31] magrittr_2.0.4 bslib_0.9.0 tools_4.5.2
## [34] AnVILBase_1.4.0 cachem_1.1.0