Data from Quigley lab projects

Datasets in public controlled access repositories

These data can be obtained by qualified parties who make a formal request through the EGA or dbGaP system. We encourage others to use these data in their research! Please refer to the project authors as the West Coast Dream Team when describing this project in the text of a manuscript or abstract.

  • 5hmC analysis of mCRPC (Sjöström et al Cancer Research 2022, The 5-hydroxymethylcytosine Landscape of Prostate Cancer): EGAS00001004942
  • Hi-C analysis of mCRPC (Zhao et al. Nature Genetics 2024): EGAC00001002873, EGAS00001006604
  • Deep RNA-seq analysis of mCRPC (Zhang et al. Nature Cell Biology 2024): EGAC00001002685
  • ATAC-seq analysis of mCRPC (Shrestha et al. Cancer Research 2024): EGAS00001006698
  • WGS and WGBS of mCRPC (Lundberg et al. Cancer Research 2023): EGAS00001006649
  • WGBS of mCRPC (Zhao et al. Nature Genetics 2000): phs001648.v2.p1
  • WGS and RNA-seq of mCRPC (Quigley et al. Cell 2018): WCDT-MCRPC

Clinical Metadata

All public clinical metadata for WCDT projects can be found at our GitHub link.

The genomic and epigenomic landscape of double-negative metastatic prostate cancer

The West Coast Dream Team collaboration group performed whole genome WGS and WGBS and RNA sequencing of 134 tumor biopsies from patients with metastatic castration-resistant prostate cancer (Lundberg et al. Cancer Research 2023). Scripts employed during the analysis are available on github:
https://github.com/DavidQuigley/WCDT_subtypes. A total of 210 RNA-seq experiments were available for this study; only some of these patients also had WGS and WGBS data. The samples with RNA-seq data in Lundberg et al. Cancer Research 2023 are a superset of samples characterized in Quigley et al. Cell 2018, described below.

mRNA data

Genomic Hallmarks and Structural Variation in Metastatic Prostate Cancer

The West Coast Dream Team collaboration group performed whole genome and DNA and RNA sequencing of 101 tumor biopsies from patients with metastatic castration-resistant prostate cancer (Quigley, Dang, Zhao et al. Cell 2018). Scripts employed during the analysis are available on github:
https://github.com/DavidQuigley/WCDT.

Whole genome DNA sequence data

mRNA data

Laser-capture microdissected tumor tissue was subjected to RNA-seq. RNA reads were then aligned against HG38-decoy using STAR as described in the manuscript, producing per-gene count files (see below). RNA data were available for 99 of the 101 samples with DNA-seq data, so please expect to see 99 columns in matrix files for this study. A total of 26,485 transcripts were assessed for counts. Count files were then processed to calculate TPM values using the code at https://github.com/DavidQuigley/WCDT/scripts/calculate_RNA_tpm.R, using code adapted from https://gist.github.com/slowkow/c6ab0348747f86e2748b. This script marked as absent any individual gene if no sample had at least 100 counts for that gene and if the mean number of counts across all 101 samples was less than 100. After filtering, 16,844 genes with TPM calls were included in the analysis. You can re-process the count data to your own satisfaction using the raw data linked below.

Teaching Resources

Exploratory Data Analysis lab