Release notes

DATASET IMPROVEMENTS

TARGET GRCh38 Dataset

The Therapeutically Applicable Research to Generate Effective Treatments (TARGET) dataset provides genomic, transcriptomic, and epigenomic data from patients representing several childhood cancers and serves as a valuable complement to the existing genomic and multi-omic datasets available on the CGC. The complete TARGET GRCh38 dataset, which includes both Open Data accessible to all researchers and Controlled Data, to which access is regulated by the Database for Genotypes and Phenotypes (dbGaP), is now available on the CGC. This dataset can be queried using the Data Browser to generate custom cohorts from within this dataset as well as cohorts derived from multiple similarly aligned datasets such as the TARGET GRCh38 and TCGA GRCh38 datasets.

TARGET GRCh38 dataset and its metadata.

CPTAC Dataset

The CGC now provides access - through both a Public Project and the Data Browser - to mass spectrometry data that were generated by the Clinical Proteomic Tumor Analysis Consortium (CPTAC) as part of the TCGA initiative to characterize and quantify the proteome of cancer samples. This dataset represents 335 samples from patients with Breast Invasive Carcinoma, Colon Adenocarcinoma, Ovarian Serous Cystadenocarcinoma, and Rectum Adenocarcinoma for whom matched genomic data are available. The dataset can be queried using the CGC Data Browser to generate custom cohorts from within this dataset as well as multi-omic cohorts across the TCGA GRCh38 genomic and CPTAC proteomic datasets.

Learn more about the CPTAC public projectCPTAC data, and CPTAC metadata.

Multiple-dataset Querying and Controlled Data Access

To maximize the accessibility and value of the multi-omic datasets available on the CGC, the Data Browser now enables cross-dataset queries for datasets with harmonized metadata. This allows researchers to use the Data Browser to identify cohorts of interest across multiple genomic datasets such as the GRCh38 alignments of TCGA and TARGET. In addition, researchers who use the CGC to access Controlled Data now are able to create projects that include Open and Controlled Data from both TCGA and TARGET, further facilitating cross-dataset analyses.

Learn more about cross-dataset queries through a sample query.

Divya Sain