J. Taroni 2018

Analyze the DIPG B matrices from the MultiPLIER model

Set up

`%>%` <- dplyr::`%>%`
source(file.path("util", "test_LV_differences.R"))

Directories for this notebook

# plot and result directory setup for this notebook
plot.dir <- file.path("plots", "36")
dir.create(plot.dir, recursive = TRUE, showWarnings = FALSE)
results.dir <- file.path("results", "36")
dir.create(results.dir, recursive = TRUE, showWarnings = FALSE)

Read in files

GSE50021

gse50021.b.file <- file.path("results", "35", "GSE50021_recount2_B.RDS")
gse50021.b <- readRDS(gse50021.b.file)
gse50021.meta.file <- file.path("data", "sample_info", 
                                "GSE50021_cleaned_metadata.tsv")
gse50021.meta.df <- readr::read_tsv(gse50021.meta.file)
Parsed with column specification:
cols(
  sample_accession = col_character(),
  source_name = col_character(),
  tissue = col_character(),
  gender = col_character(),
  age_at_diagnosis_yrs = col_double(),
  overall_survival_yrs = col_double()
)

GSE26576

gse26576.b.file <- file.path("results", "35", "E-GEOD-26576_recount2_B.RDS")
gse26576.b <- readRDS(gse26576.b.file)
gse26576.meta.file <- file.path("data", "sample_info", 
                                "E-GEOD-26576_cleaned_metadata.tsv")
gse26576.meta.df <- readr::read_tsv(gse26576.meta.file)
Parsed with column specification:
cols(
  sample_id = col_character(),
  sample_file = col_character(),
  sample_title = col_character(),
  age_at_diagnosis = col_double(),
  disease_state = col_character(),
  histology = col_character(),
  sample_collection = col_character()
)

Metadata

We should explore the metadata a little bit, because these datasets are both quite small. So, we’ll need to sort out what kinds of analyses are possible.

gse50021.meta.df %>%
  dplyr::group_by(tissue) %>%
  dplyr::tally()
gse26576.meta.df %>%
  dplyr::group_by(disease_state) %>%
  dplyr::tally()

GSE26576 does not have enough normal samples for a DIPG-normal comparison and the other groups in GSE26576 are not present in GSE50021.

We can do a comparison of DIPG-normal in GSE50021.

Differential expression in GSE50021

gse50021.results <- 
  LVTestWrapper(b.matrix = gse50021.b,
                sample.info.df = dplyr::mutate(gse50021.meta.df,
                                               Sample = sample_accession),
                phenotype.col = "tissue",
                file.lead = "GSE50021_normal_dipg",
                plot.dir = plot.dir,
                results.dir = results.dir)
Column `Sample` joining factor and character vector, coercing into character vector
gse50021.results$limma %>%
  dplyr::filter(adj.P.Val < 0.05)

After revisiting Buczkowicz, et al. and the GSE50021 accession, there is no information about which part of the brain the normal brain came from or how it was obtained. So, I am a bit concerned about drawing any conclusions from this analysis, as the differences we see may not be attributable to disease state.

LS0tCnRpdGxlOiAiRElQRzogQW5hbHlzaXMiCm91dHB1dDogICAKICBodG1sX25vdGVib29rOiAKICAgIHRvYzogdHJ1ZQogICAgdG9jX2Zsb2F0OiB0cnVlCi0tLQoKKipKLiBUYXJvbmkgMjAxOCoqCgpBbmFseXplIHRoZSBESVBHIGBCYCBtYXRyaWNlcyBmcm9tIHRoZSBNdWx0aVBMSUVSIG1vZGVsCgojIyBTZXQgdXAKCmBgYHtyfQpgJT4lYCA8LSBkcGx5cjo6YCU+JWAKc291cmNlKGZpbGUucGF0aCgidXRpbCIsICJ0ZXN0X0xWX2RpZmZlcmVuY2VzLlIiKSkKYGBgCgpEaXJlY3RvcmllcyBmb3IgdGhpcyBub3RlYm9vawoKYGBge3J9CiMgcGxvdCBhbmQgcmVzdWx0IGRpcmVjdG9yeSBzZXR1cCBmb3IgdGhpcyBub3RlYm9vawpwbG90LmRpciA8LSBmaWxlLnBhdGgoInBsb3RzIiwgIjM2IikKZGlyLmNyZWF0ZShwbG90LmRpciwgcmVjdXJzaXZlID0gVFJVRSwgc2hvd1dhcm5pbmdzID0gRkFMU0UpCnJlc3VsdHMuZGlyIDwtIGZpbGUucGF0aCgicmVzdWx0cyIsICIzNiIpCmRpci5jcmVhdGUocmVzdWx0cy5kaXIsIHJlY3Vyc2l2ZSA9IFRSVUUsIHNob3dXYXJuaW5ncyA9IEZBTFNFKQpgYGAKCiMjIFJlYWQgaW4gZmlsZXMKCiMjIyMgR1NFNTAwMjEKCmBgYHtyfQpnc2U1MDAyMS5iLmZpbGUgPC0gZmlsZS5wYXRoKCJyZXN1bHRzIiwgIjM1IiwgIkdTRTUwMDIxX3JlY291bnQyX0IuUkRTIikKZ3NlNTAwMjEuYiA8LSByZWFkUkRTKGdzZTUwMDIxLmIuZmlsZSkKZ3NlNTAwMjEubWV0YS5maWxlIDwtIGZpbGUucGF0aCgiZGF0YSIsICJzYW1wbGVfaW5mbyIsIAogICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICJHU0U1MDAyMV9jbGVhbmVkX21ldGFkYXRhLnRzdiIpCmdzZTUwMDIxLm1ldGEuZGYgPC0gcmVhZHI6OnJlYWRfdHN2KGdzZTUwMDIxLm1ldGEuZmlsZSkKYGBgCgojIyMjIEdTRTI2NTc2CgpgYGB7cn0KZ3NlMjY1NzYuYi5maWxlIDwtIGZpbGUucGF0aCgicmVzdWx0cyIsICIzNSIsICJFLUdFT0QtMjY1NzZfcmVjb3VudDJfQi5SRFMiKQpnc2UyNjU3Ni5iIDwtIHJlYWRSRFMoZ3NlMjY1NzYuYi5maWxlKQpnc2UyNjU3Ni5tZXRhLmZpbGUgPC0gZmlsZS5wYXRoKCJkYXRhIiwgInNhbXBsZV9pbmZvIiwgCiAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgIkUtR0VPRC0yNjU3Nl9jbGVhbmVkX21ldGFkYXRhLnRzdiIpCmdzZTI2NTc2Lm1ldGEuZGYgPC0gcmVhZHI6OnJlYWRfdHN2KGdzZTI2NTc2Lm1ldGEuZmlsZSkKYGBgCgojIyMgTWV0YWRhdGEKCldlIHNob3VsZCBleHBsb3JlIHRoZSBtZXRhZGF0YSBhIGxpdHRsZSBiaXQsIGJlY2F1c2UgdGhlc2UgZGF0YXNldHMgYXJlIGJvdGgKcXVpdGUgc21hbGwuClNvLCB3ZSdsbCBuZWVkIHRvIHNvcnQgb3V0IHdoYXQga2luZHMgb2YgYW5hbHlzZXMgYXJlIHBvc3NpYmxlLgoKYGBge3J9CmdzZTUwMDIxLm1ldGEuZGYgJT4lCiAgZHBseXI6Omdyb3VwX2J5KHRpc3N1ZSkgJT4lCiAgZHBseXI6OnRhbGx5KCkKYGBgCgpgYGB7cn0KZ3NlMjY1NzYubWV0YS5kZiAlPiUKICBkcGx5cjo6Z3JvdXBfYnkoZGlzZWFzZV9zdGF0ZSkgJT4lCiAgZHBseXI6OnRhbGx5KCkKYGBgCgpgR1NFMjY1NzZgIGRvZXMgbm90IGhhdmUgZW5vdWdoIG5vcm1hbCBzYW1wbGVzIGZvciBhIERJUEctbm9ybWFsIGNvbXBhcmlzb24gYW5kCnRoZSBvdGhlciBncm91cHMgaW4gYEdTRTI2NTc2YCBhcmUgbm90IHByZXNlbnQgaW4gYEdTRTUwMDIxYC4KCldlIGNhbiBkbyBhIGNvbXBhcmlzb24gb2YgRElQRy1ub3JtYWwgaW4gYEdTRTUwMDIxYC4KCiMjIERpZmZlcmVudGlhbCBleHByZXNzaW9uIGluIGBHU0U1MDAyMWAKCmBgYHtyfQpnc2U1MDAyMS5yZXN1bHRzIDwtIAogIExWVGVzdFdyYXBwZXIoYi5tYXRyaXggPSBnc2U1MDAyMS5iLAogICAgICAgICAgICAgICAgc2FtcGxlLmluZm8uZGYgPSBkcGx5cjo6bXV0YXRlKGdzZTUwMDIxLm1ldGEuZGYsCiAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgU2FtcGxlID0gc2FtcGxlX2FjY2Vzc2lvbiksCiAgICAgICAgICAgICAgICBwaGVub3R5cGUuY29sID0gInRpc3N1ZSIsCiAgICAgICAgICAgICAgICBmaWxlLmxlYWQgPSAiR1NFNTAwMjFfbm9ybWFsX2RpcGciLAogICAgICAgICAgICAgICAgcGxvdC5kaXIgPSBwbG90LmRpciwKICAgICAgICAgICAgICAgIHJlc3VsdHMuZGlyID0gcmVzdWx0cy5kaXIpCmBgYAoKYGBge3J9CmdzZTUwMDIxLnJlc3VsdHMkbGltbWEgJT4lCiAgZHBseXI6OmZpbHRlcihhZGouUC5WYWwgPCAwLjA1KQpgYGAKCkFmdGVyIHJldmlzaXRpbmcgW0J1Y3prb3dpY3osIGV0IGFsLl0oaHR0cHM6Ly9keC5kb2kub3JnLzEwLjEwMzgvbmcuMjkzNikgYW5kCnRoZSBbYEdTRTUwMDIxYF0oaHR0cHM6Ly93d3cubmNiaS5ubG0ubmloLmdvdi9nZW8vcXVlcnkvYWNjLmNnaT9hY2M9R1NFNTAwMjEpCmFjY2Vzc2lvbiwgdGhlcmUgaXMgbm8gaW5mb3JtYXRpb24gYWJvdXQgd2hpY2ggX3BhcnRfIG9mIHRoZSBicmFpbiB0aGUgbm9ybWFsCmJyYWluIGNhbWUgZnJvbSBvciBob3cgaXQgd2FzIG9idGFpbmVkLiAKU28sIEkgYW0gYSBiaXQgY29uY2VybmVkIGFib3V0IGRyYXdpbmcgYW55IGNvbmNsdXNpb25zIGZyb20gdGhpcyBhbmFseXNpcywgYXMKdGhlIGRpZmZlcmVuY2VzIHdlIHNlZSBtYXkgbm90IGJlIGF0dHJpYnV0YWJsZSB0byBkaXNlYXNlIHN0YXRlLgoK