1 Analyses of 4 S.cerevisiae strains, wt and mutant for pseudouridylation(CBF5), UPF1, and double.

The goal of this project is to look for changes in the yeast transcriptome as a result of a mutation(s) in CBF5, UPF1, and both. This document is intended to make it easier to reproduce/improve the analyses performed.

  1. preprocessing.html The steps performed to preprocess the data.
  2. annotation.html Data shared by all the downstream analyses.
  3. sample_estimation.html Check the samples for suitability.
  4. differential_expression.html Performing the DE analyses.
  5. ontology.html Perform ontology searches.

2 TODO list

The following are some requests I have received and whether or not I think I did them:

  • 2017-05-18
  • Plot jacobson vs. wcmu/wcwu scatter plot
  • Split old and new analyses and clean them up
  • Plot old/new data scatter plot for cbf5

3 Finished TODO

  • 2017-05-18
  • Plot jacobson vs. wcmu/wcwu scatter plot – except something is wonky.

4 Installation and setup

These are rmarkdown documents which make heavy use of the hpgltools package. The following section demonstrates how to set that up in a clean R environment.

## Use R's install.packages to install devtools.
install.packages("devtools")
## Use devtools to install hpgltools.
devtools::install_github("abelew/hpgltools")
## Load hpgltools into the R environment.
library(hpgltools)
library('pander')
pander(sessionInfo())

R version 3.4.4 (2018-03-15)

**Platform:** x86_64-pc-linux-gnu (64-bit)

locale: LC_CTYPE=en_US.utf8, LC_NUMERIC=C, LC_TIME=en_US.utf8, LC_COLLATE=en_US.utf8, LC_MONETARY=en_US.utf8, LC_MESSAGES=en_US.utf8, LC_PAPER=en_US.utf8, LC_NAME=C, LC_ADDRESS=C, LC_TELEPHONE=C, LC_MEASUREMENT=en_US.utf8 and LC_IDENTIFICATION=C

attached base packages: stats, graphics, grDevices, utils, datasets, methods and base

other attached packages: pander(v.0.6.1) and hpgltools(v.2018.03)

loaded via a namespace (and not attached): Rcpp(v.0.12.16), xml2(v.1.2.0), roxygen2(v.6.0.1), knitr(v.1.20), magrittr(v.1.5), devtools(v.1.13.5), BiocGenerics(v.0.24.0), munsell(v.0.4.3), colorspace(v.1.3-2), R6(v.2.2.2), rlang(v.0.2.0), foreach(v.1.4.4), plyr(v.1.8.4), stringr(v.1.3.0), tools(v.3.4.4), parallel(v.3.4.4), grid(v.3.4.4), Biobase(v.2.38.0), data.table(v.1.10.4-3), gtable(v.0.2.0), withr(v.2.1.2), commonmark(v.1.4), htmltools(v.0.3.6), iterators(v.1.0.9), lazyeval(v.0.2.1), yaml(v.2.1.18), rprojroot(v.1.3-2), digest(v.0.6.15), tibble(v.1.4.2), ggplot2(v.2.2.1), base64enc(v.0.1-3), codetools(v.0.2-15), memoise(v.1.1.0), evaluate(v.0.10.1), rmarkdown(v.1.9), stringi(v.1.1.7), pillar(v.1.2.1), compiler(v.3.4.4), scales(v.0.5.0) and backports(v.1.1.2)

LS0tCnRpdGxlOiAiUy5jZXJldmlzaWFlIDIwMTY6IFJOQXNlcSBhbmFseXNlcy4iCmF1dGhvcjogImF0YiBhYmVsZXdAZ21haWwuY29tIgpkYXRlOiAiYHIgU3lzLkRhdGUoKWAiCm91dHB1dDoKIGh0bWxfZG9jdW1lbnQ6CiAgY29kZV9kb3dubG9hZDogdHJ1ZQogIGNvZGVfZm9sZGluZzogc2hvdwogIGZpZ19jYXB0aW9uOiB0cnVlCiAgZmlnX2hlaWdodDogNwogIGZpZ193aWR0aDogNwogIGhpZ2hsaWdodDogZGVmYXVsdAogIGtlZXBfbWQ6IGZhbHNlCiAgbW9kZTogc2VsZmNvbnRhaW5lZAogIG51bWJlcl9zZWN0aW9uczogdHJ1ZQogIHNlbGZfY29udGFpbmVkOiB0cnVlCiAgdGhlbWU6IHJlYWRhYmxlCiAgdG9jOiB0cnVlCiAgdG9jX2Zsb2F0OgogICAgY29sbGFwc2VkOiBmYWxzZQogICAgc21vb3RoX3Njcm9sbDogZmFsc2UKLS0tCgo8c3R5bGU+CiAgYm9keSAubWFpbi1jb250YWluZXIgewogICAgbWF4LXdpZHRoOiAxNjAwcHg7CiAgfQo8L3N0eWxlPgoKYGBge3Igb3B0aW9ucywgaW5jbHVkZT1GQUxTRX0KaWYgKCFpc1RSVUUoZ2V0MCgic2tpcF9sb2FkIikpKSB7CiAgbGlicmFyeShocGdsdG9vbHMpCiAgdHQgPC0gZGV2dG9vbHM6OmxvYWRfYWxsKCJ+L2hwZ2x0b29scyIpCiAga25pdHI6Om9wdHNfa25pdCRzZXQocHJvZ3Jlc3M9VFJVRSwKICAgICAgICAgICAgICAgICAgICAgICB2ZXJib3NlPVRSVUUsCiAgICAgICAgICAgICAgICAgICAgICAgd2lkdGg9OTAsCiAgICAgICAgICAgICAgICAgICAgICAgZWNobz1UUlVFKQogIGtuaXRyOjpvcHRzX2NodW5rJHNldChlcnJvcj1UUlVFLAogICAgICAgICAgICAgICAgICAgICAgICBmaWcud2lkdGg9OCwKICAgICAgICAgICAgICAgICAgICAgICAgZmlnLmhlaWdodD04LAogICAgICAgICAgICAgICAgICAgICAgICBkcGk9OTYpCiAgb2xkX29wdGlvbnMgPC0gb3B0aW9ucyhkaWdpdHM9NCwKICAgICAgICAgICAgICAgICAgICAgICAgIHN0cmluZ3NBc0ZhY3RvcnM9RkFMU0UsCiAgICAgICAgICAgICAgICAgICAgICAgICBrbml0ci5kdXBsaWNhdGUubGFiZWw9ImFsbG93IikKICBnZ3Bsb3QyOjp0aGVtZV9zZXQoZ2dwbG90Mjo6dGhlbWVfYncoYmFzZV9zaXplPTEwKSkKICB2ZXIgPC0gIjIwMTcwNTE1IgogIHByZXZpb3VzX2ZpbGUgPC0gImluZGV4LlJtZCIKCiAgdG1wIDwtIHRyeShzbShsb2FkbWUoZmlsZW5hbWU9cGFzdGUwKGdzdWIocGF0dGVybj0iXFwuUm1kIiwgcmVwbGFjZT0iIiwgeD1wcmV2aW91c19maWxlKSwgIi12IiwgdmVyLCAiLnJkYS54eiIpKSkpCiAgcm1kX2ZpbGUgPC0gImluZGV4LlJtZCIKfQpgYGAKCgpBbmFseXNlcyBvZiA0IFMuY2VyZXZpc2lhZSBzdHJhaW5zLCB3dCBhbmQgbXV0YW50IGZvciBwc2V1ZG91cmlkeWxhdGlvbihDQkY1KSwgVVBGMSwgYW5kIGRvdWJsZS4KPT09PT09PT09PT09PT09PT09PT09PT09PT09PT09PT09PT09PT09PT09PT09PT09PT09PT09PT09PT09PT09PT09PT09PT09PT09PT09PT09PT09PT09PT09PT09PT09CgpUaGUgZ29hbCBvZiB0aGlzIHByb2plY3QgaXMgdG8gbG9vayBmb3IgY2hhbmdlcyBpbiB0aGUgeWVhc3QgdHJhbnNjcmlwdG9tZSBhcyBhIHJlc3VsdCBvZiBhCm11dGF0aW9uKHMpIGluIENCRjUsIFVQRjEsIGFuZCBib3RoLiAgVGhpcyBkb2N1bWVudCBpcyBpbnRlbmRlZCB0byBtYWtlIGl0IGVhc2llciB0bwpyZXByb2R1Y2UvaW1wcm92ZSB0aGUgYW5hbHlzZXMgcGVyZm9ybWVkLgoKMS4gIFtwcmVwcm9jZXNzaW5nLmh0bWxdKHByZXByb2Nlc3NpbmcuaHRtbCkgIFRoZSBzdGVwcyBwZXJmb3JtZWQgdG8gcHJlcHJvY2VzcyB0aGUgZGF0YS4KMi4gIFthbm5vdGF0aW9uLmh0bWxdKGFubm90YXRpb24uaHRtbCkgIERhdGEgc2hhcmVkIGJ5IGFsbCB0aGUgZG93bnN0cmVhbSBhbmFseXNlcy4KMy4gIFtzYW1wbGVfZXN0aW1hdGlvbi5odG1sXShzYW1wbGVfZXN0aW1hdGlvbi5odG1sKSAgQ2hlY2sgdGhlIHNhbXBsZXMgZm9yIHN1aXRhYmlsaXR5Lgo0LiAgW2RpZmZlcmVudGlhbF9leHByZXNzaW9uLmh0bWxdKGRpZmZlcmVudGlhbF9leHByZXNzaW9uLmh0bWwpICBQZXJmb3JtaW5nIHRoZSBERSBhbmFseXNlcy4KNS4gIFtvbnRvbG9neS5odG1sXShvbnRvbG9neS5odG1sKSAgUGVyZm9ybSBvbnRvbG9neSBzZWFyY2hlcy4KCiMgVE9ETyBsaXN0CgpUaGUgZm9sbG93aW5nIGFyZSBzb21lIHJlcXVlc3RzIEkgaGF2ZSByZWNlaXZlZCBhbmQgd2hldGhlciBvciBub3QgSSB0aGluayBJIGRpZCB0aGVtOgoKKiAyMDE3LTA1LTE4CiAgKiBQbG90IGphY29ic29uIHZzLiB3Y211L3djd3Ugc2NhdHRlciBwbG90CiAgKiBTcGxpdCBvbGQgYW5kIG5ldyBhbmFseXNlcyBhbmQgY2xlYW4gdGhlbSB1cAogICogUGxvdCBvbGQvbmV3IGRhdGEgc2NhdHRlciBwbG90IGZvciBjYmY1CgojIEZpbmlzaGVkIFRPRE8KCiogMjAxNy0wNS0xOAogICogUGxvdCBqYWNvYnNvbiB2cy4gd2NtdS93Y3d1IHNjYXR0ZXIgcGxvdCAtLSBleGNlcHQgc29tZXRoaW5nIGlzIHdvbmt5LgoKIyBJbnN0YWxsYXRpb24gYW5kIHNldHVwCgpUaGVzZSBhcmUgcm1hcmtkb3duIGRvY3VtZW50cyB3aGljaCBtYWtlIGhlYXZ5IHVzZSBvZiB0aGUgaHBnbHRvb2xzIHBhY2thZ2UuICBUaGUgZm9sbG93aW5nIHNlY3Rpb24KZGVtb25zdHJhdGVzIGhvdyB0byBzZXQgdGhhdCB1cCBpbiBhIGNsZWFuIFIgZW52aXJvbm1lbnQuCgpgYGB7ciBzZXR1cCwgZXZhbD1GQUxTRX0KIyMgVXNlIFIncyBpbnN0YWxsLnBhY2thZ2VzIHRvIGluc3RhbGwgZGV2dG9vbHMuCmluc3RhbGwucGFja2FnZXMoImRldnRvb2xzIikKIyMgVXNlIGRldnRvb2xzIHRvIGluc3RhbGwgaHBnbHRvb2xzLgpkZXZ0b29sczo6aW5zdGFsbF9naXRodWIoImFiZWxldy9ocGdsdG9vbHMiKQojIyBMb2FkIGhwZ2x0b29scyBpbnRvIHRoZSBSIGVudmlyb25tZW50LgpsaWJyYXJ5KGhwZ2x0b29scykKYGBgCgpgYGB7ciBzeXNpbmZvLCByZXN1bHRzPSdhc2lzJ30KbGlicmFyeSgncGFuZGVyJykKcGFuZGVyKHNlc3Npb25JbmZvKCkpCmBgYAo=