1 Analyses of 4 S.cerevisiae strains, wt and mutant for pseudouridylation(CBF5), UPF1, and double.

The goal of this project is to look for changes in the yeast transcriptome as a result of a mutation(s) in CBF5, UPF1, and both. This document is intended to make it easier to reproduce/improve the analyses performed.

  1. preprocessing.html The steps performed to preprocess the data.
  2. annotation.html Data shared by all the downstream analyses.
  3. sample_estimation.html Check the samples for suitability.
  4. differential_expression.html Performing the DE analyses.
  5. ontology.html Perform ontology searches.

2 TODO list

The following are some requests I have received and whether or not I think I did them:

3 Installation and setup

These are rmarkdown documents which make heavy use of the hpgltools package. The following section demonstrates how to set that up in a clean R environment.

## Use R's install.packages to install devtools.
install.packages("devtools")
## Use devtools to install hpgltools.
devtools::install_github("abelew/hpgltools")
## Load hpgltools into the R environment.
library(hpgltools)
library('pander')
pander(sessionInfo())

R version 3.3.3 (2017-03-06)

**Platform:** x86_64-pc-linux-gnu (64-bit)

locale: LC_CTYPE=en_US.UTF-8, LC_NUMERIC=C, LC_TIME=en_US.UTF-8, LC_COLLATE=en_US.UTF-8, LC_MONETARY=en_US.UTF-8, LC_MESSAGES=en_US.UTF-8, LC_PAPER=en_US.UTF-8, LC_NAME=C, LC_ADDRESS=C, LC_TELEPHONE=C, LC_MEASUREMENT=en_US.UTF-8 and LC_IDENTIFICATION=C

attached base packages: stats4, parallel, stats, graphics, grDevices, utils, datasets, methods and base

other attached packages: hpgltools(v.2017.01), ruv(v.0.9.6), pander(v.0.6.0), TxDb.Scerevisiae.UCSC.sacCer3.sgdGene(v.3.2.2), GenomicFeatures(v.1.26.4), GenomicRanges(v.1.26.4), GenomeInfoDb(v.1.10.3), BiocInstaller(v.1.24.0), AnnotationDbi(v.1.36.2), IRanges(v.2.8.2), S4Vectors(v.0.12.2), Biobase(v.2.34.0), AnnotationHub(v.2.6.5) and BiocGenerics(v.0.20.0)

loaded via a namespace (and not attached): nlme(v.3.1-131), bitops(v.1.0-6), matrixStats(v.0.52.2), devtools(v.1.13.1), RColorBrewer(v.1.1-2), httr(v.1.2.1), rprojroot(v.1.2), tools(v.3.3.3), backports(v.1.0.5), R6(v.2.2.1), DBI(v.0.6-1), lazyeval(v.0.2.0), mgcv(v.1.8-17), colorspace(v.1.3-2), withr(v.1.0.2), curl(v.2.6), compiler(v.3.3.3), preprocessCore(v.1.36.0), xml2(v.1.1.1), rtracklayer(v.1.34.2), labeling(v.0.3), scales(v.0.4.1), genefilter(v.1.56.0), commonmark(v.1.2), stringr(v.1.2.0), digest(v.0.6.12), Rsamtools(v.1.26.2), rmarkdown(v.1.5), XVector(v.0.14.1), base64enc(v.0.1-3), htmltools(v.0.3.6), limma(v.3.30.13), RSQLite(v.1.1-2), shiny(v.1.0.3), BiocParallel(v.1.8.2), gtools(v.3.5.0), dplyr(v.0.5.0), RCurl(v.1.95-4.8), magrittr(v.1.5), GO.db(v.3.4.0), Matrix(v.1.2-10), Rcpp(v.0.12.10), munsell(v.0.4.3), stringi(v.1.1.5), yaml(v.2.1.14), edgeR(v.3.16.5), SummarizedExperiment(v.1.4.0), zlibbioc(v.1.20.0), plyr(v.1.8.4), grid(v.3.3.3), ggrepel(v.0.6.5), crayon(v.1.3.2), lattice(v.0.20-35), Biostrings(v.2.42.1), splines(v.3.3.3), annotate(v.1.52.1), locfit(v.1.5-9.1), knitr(v.1.15.1), corpcor(v.1.6.9), reshape2(v.1.4.2), codetools(v.0.2-15), biomaRt(v.2.30.0), XML(v.3.98-1.7), evaluate(v.0.10), data.table(v.1.10.4), httpuv(v.1.3.3), foreach(v.1.4.3), testthat(v.1.0.2), gtable(v.0.2.0), assertthat(v.0.2.0), ggplot2(v.2.2.1), openxlsx(v.4.0.17), mime(v.0.5), xtable(v.1.8-2), roxygen2(v.6.0.1), survival(v.2.41-3), tibble(v.1.3.0), iterators(v.1.0.8), GenomicAlignments(v.1.10.1), memoise(v.1.1.0), sva(v.3.22.0) and interactiveDisplayBase(v.1.12.0)

LS0tCnRpdGxlOiAiUk5Bc2VxIG9mIFNhY2NoYXJvbXljZXMgY2VyZXZpc2lhZSBzdHJhaW5zLiIKYXV0aG9yOiAiYXRiIGFiZWxld0BnbWFpbC5jb20iCmRhdGU6ICJgciBTeXMuRGF0ZSgpYCIKb3V0cHV0OgogaHRtbF9kb2N1bWVudDoKICBjb2RlX2Rvd25sb2FkOiB0cnVlCiAgY29kZV9mb2xkaW5nOiBzaG93CiAgZmlnX2NhcHRpb246IHRydWUKICBmaWdfaGVpZ2h0OiA3CiAgZmlnX3dpZHRoOiA3CiAgaGlnaGxpZ2h0OiBkZWZhdWx0CiAga2VlcF9tZDogZmFsc2UKICBtb2RlOiBzZWxmY29udGFpbmVkCiAgbnVtYmVyX3NlY3Rpb25zOiB0cnVlCiAgc2VsZl9jb250YWluZWQ6IHRydWUKICB0aGVtZTogcmVhZGFibGUKICB0b2M6IHRydWUKICB0b2NfZmxvYXQ6CiAgICBjb2xsYXBzZWQ6IGZhbHNlCiAgICBzbW9vdGhfc2Nyb2xsOiBmYWxzZQotLS0KCjxzdHlsZT4KICA8IS0tIERvY3VtZW50IHByZWx1ZGUgcmV2aXNpb24gMjAxNy0wMiAtLT4KICBib2R5IC5tYWluLWNvbnRhaW5lciB7CiAgICBtYXgtd2lkdGg6IDE2MDBweDsKfQo8L3N0eWxlPgoKYGBge3Igb3B0aW9ucywgaW5jbHVkZT1GQUxTRX0KIyMgVGhlc2UgYXJlIHRoZSBvcHRpb25zIEkgdGVuZCB0byBmYXZvcgpsaWJyYXJ5KCJocGdsdG9vbHMiKQp0dCA8LSBkZXZ0b29sczo6bG9hZF9hbGwoIn4vaHBnbHRvb2xzIikKa25pdHI6Om9wdHNfa25pdCRzZXQoCiAgICBwcm9ncmVzcyA9IFRSVUUsCiAgICB2ZXJib3NlID0gVFJVRSwKICAgIHdpZHRoID0gOTAsCiAgICBlY2hvID0gVFJVRSkKa25pdHI6Om9wdHNfY2h1bmskc2V0KAogICAgZXJyb3IgPSBUUlVFLAogICAgZmlnLndpZHRoID0gOCwKICAgIGZpZy5oZWlnaHQgPSA4LAogICAgZHBpID0gOTYpCm9wdGlvbnMoCiAgICBkaWdpdHMgPSA0LAogICAgc3RyaW5nc0FzRmFjdG9ycyA9IEZBTFNFLAogICAga25pdHIuZHVwbGljYXRlLmxhYmVsID0gImFsbG93IikKZ2dwbG90Mjo6dGhlbWVfc2V0KGdncGxvdDI6OnRoZW1lX2J3KGJhc2Vfc2l6ZT0xMCkpCnNldC5zZWVkKDEpCnZlciA8LSAiMjAxNzA1MTUiCnByZXZpb3VzX2ZpbGUgPC0gImluZGV4LlJtZCIKYGBgCgpgYGB7ciBsb2FkbWUsIGluY2x1ZGU9RkFMU0V9CnRtcCA8LSBzbSh0cnkobG9hZG1lKGZpbGVuYW1lPXBhc3RlMChnc3ViKHBhdHRlcm49IlxcLlJtZCIsIHJlcGxhY2U9IiIsIHg9cHJldmlvdXNfZmlsZSksICItdiIsIHZlciwgIi5yZGEueHoiKSkpKQoKcm1kX2ZpbGUgPC0gImluZGV4LlJtZCIKdGhpc19zYXZlIDwtIHBhc3RlMChnc3ViKHBhdHRlcm49IlxcLlJtZCIsIHJlcGxhY2U9IiIsIHg9cm1kX2ZpbGUpLCAiLXYiLCB2ZXIsICIucmRhLnh6IikKYGBgCgpgYGB7ciByZW5kZXIsIGV2YWw9RkFMU0UsIGluY2x1ZGU9RkFMU0V9CiMjIFRoaXMgYmxvY2sgaXMgdXNlZCB0byByZW5kZXIgYSBkb2N1bWVudCBmcm9tIHdpdGhpbiBpdC4Kcm1hcmtkb3duOjpyZW5kZXIocm1kX2ZpbGUpCgojIyBBbiBleHRyYSByZW5kZXJlciBmb3IgcGRmIG91dHB1dApybWFya2Rvd246OnJlbmRlcihybWRfZmlsZSwgb3V0cHV0X2Zvcm1hdD0icGRmX2RvY3VtZW50Iiwgb3V0cHV0X29wdGlvbnM9Yygic2tpcF9odG1sIikpCmBgYAoKQW5hbHlzZXMgb2YgNCBTLmNlcmV2aXNpYWUgc3RyYWlucywgd3QgYW5kIG11dGFudCBmb3IgcHNldWRvdXJpZHlsYXRpb24oQ0JGNSksIFVQRjEsIGFuZCBkb3VibGUuCj09PT09PT09PT09PT09PT09PT09PT09PT09PT09PT09PT09PT09PT09PT09PT09PT09PT09PT09PT09PT09PT09PT09PT09PT09PT09PT09PT09PT09PT09PT09PT09PQoKVGhlIGdvYWwgb2YgdGhpcyBwcm9qZWN0IGlzIHRvIGxvb2sgZm9yIGNoYW5nZXMgaW4gdGhlIHllYXN0IHRyYW5zY3JpcHRvbWUgYXMgYSByZXN1bHQgb2YgYQptdXRhdGlvbihzKSBpbiBDQkY1LCBVUEYxLCBhbmQgYm90aC4gIFRoaXMgZG9jdW1lbnQgaXMgaW50ZW5kZWQgdG8gbWFrZSBpdCBlYXNpZXIgdG8KcmVwcm9kdWNlL2ltcHJvdmUgdGhlIGFuYWx5c2VzIHBlcmZvcm1lZC4KCjEuICBbcHJlcHJvY2Vzc2luZy5odG1sXShwcmVwcm9jZXNzaW5nLmh0bWwpICBUaGUgc3RlcHMgcGVyZm9ybWVkIHRvIHByZXByb2Nlc3MgdGhlIGRhdGEuCjIuICBbYW5ub3RhdGlvbi5odG1sXShhbm5vdGF0aW9uLmh0bWwpICBEYXRhIHNoYXJlZCBieSBhbGwgdGhlIGRvd25zdHJlYW0gYW5hbHlzZXMuCjMuICBbc2FtcGxlX2VzdGltYXRpb24uaHRtbF0oc2FtcGxlX2VzdGltYXRpb24uaHRtbCkgIENoZWNrIHRoZSBzYW1wbGVzIGZvciBzdWl0YWJpbGl0eS4KNC4gIFtkaWZmZXJlbnRpYWxfZXhwcmVzc2lvbi5odG1sXShkaWZmZXJlbnRpYWxfZXhwcmVzc2lvbi5odG1sKSAgUGVyZm9ybWluZyB0aGUgREUgYW5hbHlzZXMuCjUuICBbb250b2xvZ3kuaHRtbF0ob250b2xvZ3kuaHRtbCkgIFBlcmZvcm0gb250b2xvZ3kgc2VhcmNoZXMuCgojIFRPRE8gbGlzdAoKVGhlIGZvbGxvd2luZyBhcmUgc29tZSByZXF1ZXN0cyBJIGhhdmUgcmVjZWl2ZWQgYW5kIHdoZXRoZXIgb3Igbm90IEkgdGhpbmsgSSBkaWQgdGhlbToKCgojIEluc3RhbGxhdGlvbiBhbmQgc2V0dXAKClRoZXNlIGFyZSBybWFya2Rvd24gZG9jdW1lbnRzIHdoaWNoIG1ha2UgaGVhdnkgdXNlIG9mIHRoZSBocGdsdG9vbHMgcGFja2FnZS4gIFRoZSBmb2xsb3dpbmcgc2VjdGlvbgpkZW1vbnN0cmF0ZXMgaG93IHRvIHNldCB0aGF0IHVwIGluIGEgY2xlYW4gUiBlbnZpcm9ubWVudC4KCmBgYHtyIHNldHVwLCBldmFsPUZBTFNFfQojIyBVc2UgUidzIGluc3RhbGwucGFja2FnZXMgdG8gaW5zdGFsbCBkZXZ0b29scy4KaW5zdGFsbC5wYWNrYWdlcygiZGV2dG9vbHMiKQojIyBVc2UgZGV2dG9vbHMgdG8gaW5zdGFsbCBocGdsdG9vbHMuCmRldnRvb2xzOjppbnN0YWxsX2dpdGh1YigiYWJlbGV3L2hwZ2x0b29scyIpCiMjIExvYWQgaHBnbHRvb2xzIGludG8gdGhlIFIgZW52aXJvbm1lbnQuCmxpYnJhcnkoaHBnbHRvb2xzKQpgYGAKCmBgYHtyIHN5c2luZm8sIHJlc3VsdHM9J2FzaXMnfQpsaWJyYXJ5KCdwYW5kZXInKQpwYW5kZXIoc2Vzc2lvbkluZm8oKSkKYGBgCg==