TCIApathfinder wraps the Cancer Imaging Archive REST API. See TCIApathfinder vignettes for an introduction to package usage. This vignette shows how images downloaded with TCIApathfinder can be processed and analyzed with other R packages.
library(TCIApathfinder)
# Pick a patient of interest
patient <- "TCGA-AR-A1AQ"
# Get information on all image series for this patient
series <- get_series_info(patient_id = patient)
# Pick an image series to download
series_instance_uid <- as.character(series$series[1, "series_instance_uid"])
# Download and unzip the image series
ser <- save_image_series(series_instance_uid = series_instance_uid, out_dir = "~/Desktop", out_file_name = "series1.zip")
dicom_dir <- "~/Desktop/series1/"
unzip("~/Desktop/series1.zip", exdir = dicom_dir)The oro.dicom package provides functions to process image files in DICOM format, which is the format used by TCIA. See oro.dicom package documentation for further details.
suppressPackageStartupMessages(library(oro.dicom))
# Read in the DICOM images and create a 3D array of intensities
dicom_list <- readDICOM(dicom_dir)
img_array_3d <- create3D(dicom_list)
# Check the dimensions of the 3D array
dim(img_array_3d)## [1] 256 256 116Note that this series consists of 116 DICOM images. Each image is 256x256 pixels.
This patient is included in The Cancer Genome Atlas. A variety of germline and somatic genomic data can be downloaded with the Bioconductor package TCGAbiolinks. See TCGAbiolinks package vignettes for further detail. A sample workflow for analyzing TCGA data is provided in TCGA Workflow: Analyze cancer genomics and epigenomics data using Bioconductor packages.