1 Which genes are DE in human macrophages at 4 hours upon infection with L. major?

2 Gather annotation data

I want to perform a series of comparisons among the host cells: human and mouse. Thus I need to collect annotation data for both species and get the set of orthologs between them.

2.1 Start with the human annotation data

hs_annot <- load_biomart_annotations()$annotation

## The biomart annotations file already exists, loading from it.

rownames(hs_annot) <- make.names(
  paste0(hs_annot[["ensembl_transcript_id"]], ".",
         hs_annot[["transcript_version"]]),
  unique=TRUE)
hs_tx_gene <- hs_annot[, c("ensembl_gene_id", "ensembl_transcript_id")]
hs_tx_gene[["id"]] <- rownames(hs_tx_gene)
hs_tx_gene <- hs_tx_gene[, c("id", "ensembl_gene_id")]
new_hs_annot <- hs_annot
rownames(new_hs_annot) <- make.names(hs_annot[["ensembl_gene_id"]], unique=TRUE)

2.2 Generate expressionsets

The question is reasonably self-contained. I want to compare the uninfected human samples against any samples which were infected for 4 hours. So let us first pull those samples and then poke at them a bit.

sample_sheet <- "sample_sheets/leishmania_host_metasheet_20190401.xlsx"
hs_expt <- create_expt(sample_sheet,
                       file_column="hsapiensfile",
                       gene_info=new_hs_annot,
                       tx_gene_map=hs_tx_gene)

## Reading the sample metadata.

## The sample definitions comprises: 437 rows(samples) and 55 columns(metadata fields).

## Reading count tables.

## Using the transcript to gene mapping.

## Reading salmon data with tximport.

## Finished reading count tables.

## Matched 19629 annotations and counts.

## Bringing together the count matrix and gene information.

## The mapped IDs are not the rownames of your gene information, changing them now.

## Some annotations were lost in merging, setting them to 'undefined'.

hs_expt_noskipped <- subset_expt(hs_expt, subset="skipped!='yes'")

## There were 267, now there are 247 samples.

hs_t4h_expt <- subset_expt(hs_expt_noskipped, subset="expttime=='t4h'")

## There were 247, now there are 64 samples.

hs_t4h_expt <- set_expt_conditions(hs_t4h_expt, fact="infectstate")
hs_t4h_expt <- set_expt_batches(hs_t4h_expt, fact="study")
table(hs_t4h_expt$conditions)

## 
## bead   no stim  yes 
##    3   18   35    8

table(hs_t4h_expt$batches)

## 
## lps-timecourse       m-gm-csf           mbio 
##              8             39             17

2.3 Examine t4h vs uninfected

hs_t4h_plots <- sm(graph_metrics(hs_t4h_expt))

hs_t4h_norm <- normalize_expt(hs_t4h_expt, norm="quant", convert="cpm",
                              transform="log2", filter=TRUE, batch="svaseq")

## This function will replace the expt$expressionset slot with:

## log2(svaseq(cpm(quant(cbcb(data)))))

## It backs up the current data into a slot named:
##  expt$backup_expressionset. It will also save copies of each step along the way
##  in expt$normalized with the corresponding libsizes. Keep the libsizes in mind
##  when invoking limma.  The appropriate libsize is the non-log(cpm(normalized)).
##  This is most likely kept at:
##  'new_expt$normalized$intermediate_counts$normalization$libsizes'
##  A copy of this may also be found at:
##  new_expt$best_libsize

## Warning in normalize_expt(hs_t4h_expt, norm = "quant", convert = "cpm", :
## Quantile normalization and sva do not always play well together.

## Step 1: performing count filter with option: cbcb

## Removing 7360 low-count genes (12269 remaining).

## Step 2: normalizing the data with quant.

## Using normalize.quantiles.robust due to a thread error in preprocessCore.

## Step 3: converting the data with cpm.

## Step 4: transforming the data with log2.

## transform_counts: Found 3822 values equal to 0, adding 1 to the matrix.

## Step 5: doing batch correction with svaseq.

## Note to self:  If you get an error like 'x contains missing values' The data has too many 0's and needs a stronger low-count filter applied.

## Passing off to all_adjusters.

## batch_counts: Before batch/surrogate estimation, 711279 entries are x>1: 90.6%.

## batch_counts: Before batch/surrogate estimation, 3822 entries are x==0: 0.487%.

## batch_counts: Before batch/surrogate estimation, 70115 entries are 0<x<1: 8.93%.

## The be method chose 12 surrogate variable(s).

## Attempting svaseq estimation with 12 surrogates.

## There are 1760 (0.224%) elements which are < 0 after batch correction.

hs_t4h_norm_plots <- sm(graph_metrics(hs_t4h_norm))

2.3.1 Print some of the plots

hs_t4h_plots$legend

hs_t4h_plots$libsize

hs_t4h_plots$boxplot

hs_t4h_norm_plots$pc_plot

2.4 Remove stimulated samples

hs_t4h_inf <- subset_expt(hs_t4h_expt, subset="condition!='stim'")

## There were 64, now there are 29 samples.

hs_t4h_inf_norm <- normalize_expt(hs_t4h_inf, transform="log2", convert="cpm",
                                  filter=TRUE, batch="svaseq")

## This function will replace the expt$expressionset slot with:

## log2(svaseq(cpm(cbcb(data))))

## It backs up the current data into a slot named:
##  expt$backup_expressionset. It will also save copies of each step along the way
##  in expt$normalized with the corresponding libsizes. Keep the libsizes in mind
##  when invoking limma.  The appropriate libsize is the non-log(cpm(normalized)).
##  This is most likely kept at:
##  'new_expt$normalized$intermediate_counts$normalization$libsizes'
##  A copy of this may also be found at:
##  new_expt$best_libsize

## Leaving the data unnormalized.  This is necessary for DESeq, but
##  EdgeR/limma might benefit from normalization.  Good choices include quantile,
##  size-factor, tmm, etc.

## Step 1: performing count filter with option: cbcb

## Removing 7759 low-count genes (11870 remaining).

## Step 2: not normalizing the data.

## Step 3: converting the data with cpm.

## Step 4: transforming the data with log2.

## transform_counts: Found 3276 values equal to 0, adding 1 to the matrix.

## Step 5: doing batch correction with svaseq.

## Note to self:  If you get an error like 'x contains missing values' The data has too many 0's and needs a stronger low-count filter applied.

## Passing off to all_adjusters.

## batch_counts: Before batch/surrogate estimation, 319013 entries are x>1: 92.7%.

## batch_counts: Before batch/surrogate estimation, 3276 entries are x==0: 0.952%.

## batch_counts: Before batch/surrogate estimation, 21941 entries are 0<x<1: 6.37%.

## The be method chose 6 surrogate variable(s).

## Attempting svaseq estimation with 6 surrogates.

## There are 557 (0.162%) elements which are < 0 after batch correction.

hs_t4h_pca <- plot_pca(hs_t4h_inf_norm, plot_title="H. sapiens, L. major, t4h")
hs_t4h_pca$plot

hs_t4h_de <- all_pairwise(hs_t4h_inf, model_batch="svaseq", filter=TRUE)

## batch_counts: Before batch/surrogate estimation, 339179 entries are x>1: 98.5%.

## batch_counts: Before batch/surrogate estimation, 3276 entries are x==0: 0.952%.

## batch_counts: Before batch/surrogate estimation, 216 entries are 0<x<1: 0.0627%.

## The be method chose 5 surrogate variable(s).

## Attempting svaseq estimation with 5 surrogates.

## This function will replace the expt$expressionset slot with:

## log2(cpm(quant(cbcb(data))))

## It backs up the current data into a slot named:
##  expt$backup_expressionset. It will also save copies of each step along the way
##  in expt$normalized with the corresponding libsizes. Keep the libsizes in mind
##  when invoking limma.  The appropriate libsize is the non-log(cpm(normalized)).
##  This is most likely kept at:
##  'new_expt$normalized$intermediate_counts$normalization$libsizes'
##  A copy of this may also be found at:
##  new_expt$best_libsize

## Not correcting the count-data for batch effects.  If batch is
##  included in EdgerR/limma's model, then this is probably wise; but in extreme
##  batch effects this is a good parameter to play with.

## Step 1: performing count filter with option: cbcb

## Removing 0 low-count genes (11870 remaining).

## Step 2: normalizing the data with quant.

## Using normalize.quantiles.robust due to a thread error in preprocessCore.

## Step 3: converting the data with cpm.

## Step 4: transforming the data with log2.

## transform_counts: Found 1964 values equal to 0, adding 1 to the matrix.

## Step 5: not doing batch correction.

## Plotting a PCA before surrogates/batch inclusion.

## Using svaseq to visualize before/after batch inclusion.

## Performing a test normalization with: raw

## This function will replace the expt$expressionset slot with:

## log2(svaseq(cpm(cbcb(data))))

## It backs up the current data into a slot named:
##  expt$backup_expressionset. It will also save copies of each step along the way
##  in expt$normalized with the corresponding libsizes. Keep the libsizes in mind
##  when invoking limma.  The appropriate libsize is the non-log(cpm(normalized)).
##  This is most likely kept at:
##  'new_expt$normalized$intermediate_counts$normalization$libsizes'
##  A copy of this may also be found at:
##  new_expt$best_libsize

## Leaving the data unnormalized.  This is necessary for DESeq, but
##  EdgeR/limma might benefit from normalization.  Good choices include quantile,
##  size-factor, tmm, etc.

## Step 1: performing count filter with option: cbcb

## Removing 0 low-count genes (11870 remaining).

## Step 2: not normalizing the data.

## Step 3: converting the data with cpm.

## Step 4: transforming the data with log2.

## transform_counts: Found 3276 values equal to 0, adding 1 to the matrix.

## Step 5: doing batch correction with svaseq.

## Note to self:  If you get an error like 'x contains missing values' The data has too many 0's and needs a stronger low-count filter applied.

## Passing off to all_adjusters.

## batch_counts: Before batch/surrogate estimation, 319013 entries are x>1: 92.7%.

## batch_counts: Before batch/surrogate estimation, 3276 entries are x==0: 0.952%.

## batch_counts: Before batch/surrogate estimation, 21941 entries are 0<x<1: 6.37%.

## The be method chose 6 surrogate variable(s).

## Attempting svaseq estimation with 6 surrogates.

## There are 557 (0.162%) elements which are < 0 after batch correction.

## Error in checkForRemoteErrors(val): one node produced an error: c("Error in DESeqDataSet(se, design = design, ignoreRank) : \n  some values in assay are not integers\n", "deseq")

hs_t4h_table <- combine_de_tables

3 Which genes are DE in mouse macrophages at 4 hours upon infection with L. major?

4 Gather annotation data

I want to perform a series of comparisons among the host cells: human and mouse. Thus I need to collect annotation data for both species and get the set of orthologs between them.

4.1 Start with the human annotation data

mm_annot <- load_biomart_annotations(species="mmusculus")$annotation

## The biomart annotations file already exists, loading from it.

rownames(mm_annot) <- make.names(
  paste0(mm_annot[["ensembl_transcript_id"]], ".",
         mm_annot[["transcript_version"]]),
  unique=TRUE)
mm_tx_gene <- mm_annot[, c("ensembl_gene_id", "ensembl_transcript_id")]
mm_tx_gene[["id"]] <- rownames(mm_tx_gene)
mm_tx_gene <- mm_tx_gene[, c("id", "ensembl_gene_id")]
new_mm_annot <- mm_annot
rownames(new_mm_annot) <- make.names(mm_annot[["ensembl_gene_id"]], unique=TRUE)

4.2 Generate expressionsets

mm_expt <- create_expt(sample_sheet,
                       file_column="mmusculusfile",
                       gene_info=new_mm_annot,
                       tx_gene_map=mm_tx_gene)

## Reading the sample metadata.

## The sample definitions comprises: 437 rows(samples) and 55 columns(metadata fields).

## Reading count tables.

## Using the transcript to gene mapping.

## Reading salmon data with tximport.

## Finished reading count tables.

## Matched 19660 annotations and counts.

## Bringing together the count matrix and gene information.

## The mapped IDs are not the rownames of your gene information, changing them now.

## Some annotations were lost in merging, setting them to 'undefined'.

mm_t4h_expt <- subset_expt(mm_expt, subset="expttime=='t4h'")

## There were 105, now there are 41 samples.

mm_t4h_expt <- set_expt_conditions(mm_t4h_expt, fact="infectstate")
table(mm_t4h_expt$conditions)

## 
##  no yes 
##  35   6

table(mm_t4h_expt$batches)

## 
## undefined 
##        41

4.3 Examine t4h vs uninfected

mm_t4h_plots <- sm(graph_metrics(mm_t4h_expt))

mm_t4h_norm <- normalize_expt(mm_t4h_expt, norm="quant", convert="cpm",
                              transform="log2", filter=TRUE, batch="svaseq")

## This function will replace the expt$expressionset slot with:

## log2(svaseq(cpm(quant(cbcb(data)))))

## It backs up the current data into a slot named:
##  expt$backup_expressionset. It will also save copies of each step along the way
##  in expt$normalized with the corresponding libsizes. Keep the libsizes in mind
##  when invoking limma.  The appropriate libsize is the non-log(cpm(normalized)).
##  This is most likely kept at:
##  'new_expt$normalized$intermediate_counts$normalization$libsizes'
##  A copy of this may also be found at:
##  new_expt$best_libsize

## Warning in normalize_expt(mm_t4h_expt, norm = "quant", convert = "cpm", :
## Quantile normalization and sva do not always play well together.

## Step 1: performing count filter with option: cbcb

## Removing 9350 low-count genes (10310 remaining).

## Step 2: normalizing the data with quant.

## Using normalize.quantiles.robust due to a thread error in preprocessCore.

## Step 3: converting the data with cpm.

## Step 4: transforming the data with log2.

## transform_counts: Found 38 values equal to 0, adding 1 to the matrix.

## Step 5: doing batch correction with svaseq.

## Note to self:  If you get an error like 'x contains missing values' The data has too many 0's and needs a stronger low-count filter applied.

## Passing off to all_adjusters.

## batch_counts: Before batch/surrogate estimation, 390888 entries are x>1: 92.5%.

## batch_counts: Before batch/surrogate estimation, 38 entries are x==0: 0.00899%.

## batch_counts: Before batch/surrogate estimation, 31784 entries are 0<x<1: 7.52%.

## The be method chose 7 surrogate variable(s).

## Attempting svaseq estimation with 7 surrogates.

## There are 648 (0.153%) elements which are < 0 after batch correction.

mm_t4h_norm_plots <- sm(graph_metrics(mm_t4h_norm))

4.3.1 Print some of the plots

mm_t4h_plots$legend

mm_t4h_plots$libsize

mm_t4h_plots$boxplot

mm_t4h_norm_plots$pc_plot

pander::pander(sessionInfo())

R version 3.5.3 (2019-03-11)

Platform: x86_64-pc-linux-gnu (64-bit)

locale: LC_CTYPE=en_US.utf8, LC_NUMERIC=C, LC_TIME=en_US.utf8, LC_COLLATE=en_US.utf8, LC_MONETARY=en_US.utf8, LC_MESSAGES=en_US.utf8, LC_PAPER=en_US.utf8, LC_NAME=C, LC_ADDRESS=C, LC_TELEPHONE=C, LC_MEASUREMENT=en_US.utf8 and LC_IDENTIFICATION=C

attached base packages: parallel, stats, graphics, grDevices, utils, datasets, methods and base

other attached packages: ruv(v.0.9.7), hpgltools(v.1.0), Biobase(v.2.42.0) and BiocGenerics(v.0.28.0)

loaded via a namespace (and not attached): backports(v.1.1.3), fastmatch(v.1.1-0), plyr(v.1.8.4), igraph(v.1.2.4), lazyeval(v.0.2.1), splines(v.3.5.3), BiocParallel(v.1.16.6), usethis(v.1.4.0), GenomeInfoDb(v.1.18.2), ggplot2(v.3.1.0), urltools(v.1.7.2), sva(v.3.30.1), digest(v.0.6.18), foreach(v.1.4.4), htmltools(v.0.3.6), GOSemSim(v.2.8.0), viridis(v.0.5.1), GO.db(v.3.7.0), gdata(v.2.18.0), magrittr(v.1.5), memoise(v.1.1.0), doParallel(v.1.0.14), openxlsx(v.4.1.0), limma(v.3.38.3), remotes(v.2.0.2), readr(v.1.3.1), Biostrings(v.2.50.2), annotate(v.1.60.1), matrixStats(v.0.54.0), enrichplot(v.1.2.0), prettyunits(v.1.0.2), colorspace(v.1.4-0), blob(v.1.1.1), ggrepel(v.0.8.0), xfun(v.0.5), dplyr(v.0.8.0.1), tximport(v.1.10.1), callr(v.3.1.1), crayon(v.1.3.4), RCurl(v.1.95-4.12), jsonlite(v.1.6), genefilter(v.1.64.0), lme4(v.1.1-21), survival(v.2.43-3), iterators(v.1.0.10), glue(v.1.3.1), polyclip(v.1.9-1), gtable(v.0.2.0), zlibbioc(v.1.28.0), XVector(v.0.22.0), UpSetR(v.1.3.3), DelayedArray(v.0.8.0), pkgbuild(v.1.0.2), scales(v.1.0.0), DOSE(v.3.8.2), edgeR(v.3.24.3), DBI(v.1.0.0), Rcpp(v.1.0.1), viridisLite(v.0.3.0), xtable(v.1.8-3), progress(v.1.2.0), gridGraphics(v.0.3-0), bit(v.1.1-14), europepmc(v.0.3), preprocessCore(v.1.45.0), stats4(v.3.5.3), httr(v.1.4.0), fgsea(v.1.8.0), gplots(v.3.0.1.1), RColorBrewer(v.1.1-2), pkgconfig(v.2.0.2), XML(v.3.98-1.19), farver(v.1.1.0), locfit(v.1.5-9.1), labeling(v.0.3), ggplotify(v.0.0.3), tidyselect(v.0.2.5), rlang(v.0.3.1), reshape2(v.1.4.3), AnnotationDbi(v.1.44.0), munsell(v.0.5.0), tools(v.3.5.3), cli(v.1.0.1), RSQLite(v.2.1.1), ggridges(v.0.5.1), devtools(v.2.0.1), evaluate(v.0.13), stringr(v.1.4.0), yaml(v.2.2.0), processx(v.3.3.0), knitr(v.1.22), bit64(v.0.9-7), fs(v.1.2.6), pander(v.0.6.3), zip(v.2.0.1), caTools(v.1.17.1.2), purrr(v.0.3.1), ggraph(v.1.0.2), packrat(v.0.5.0), nlme(v.3.1-137), xml2(v.1.2.0), DO.db(v.2.9), biomaRt(v.2.38.0), compiler(v.3.5.3), pbkrtest(v.0.4-7), rstudioapi(v.0.9.0), variancePartition(v.1.12.1), testthat(v.2.0.1), tibble(v.2.0.1), tweenr(v.1.0.1), stringi(v.1.4.3), ps(v.1.3.0), GenomicFeatures(v.1.34.4), desc(v.1.2.0), lattice(v.0.20-38), Matrix(v.1.2-16), nloptr(v.1.2.1), pillar(v.1.3.1), triebeard(v.0.3.0), corpcor(v.1.6.9), data.table(v.1.12.0), cowplot(v.0.9.4), bitops(v.1.0-6), rtracklayer(v.1.42.2), GenomicRanges(v.1.34.0), qvalue(v.2.14.1), colorRamps(v.2.3), directlabels(v.2018.05.22), R6(v.2.4.0), KernSmooth(v.2.23-15), gridExtra(v.2.3), IRanges(v.2.16.0), sessioninfo(v.1.1.1), codetools(v.0.2-16), boot(v.1.3-20), MASS(v.7.3-51.1), gtools(v.3.8.1), assertthat(v.0.2.0), pkgload(v.1.0.2), SummarizedExperiment(v.1.12.0), rprojroot(v.1.3-2), withr(v.2.1.2), GenomicAlignments(v.1.18.1), Rsamtools(v.1.34.1), S4Vectors(v.0.20.1), GenomeInfoDbData(v.1.2.0), mgcv(v.1.8-27), hms(v.0.4.2), clusterProfiler(v.3.10.1), quadprog(v.1.5-5), grid(v.3.5.3), tidyr(v.0.8.3), minqa(v.1.2.4), rvcheck(v.0.1.3), rmarkdown(v.1.11), Rtsne(v.0.15), ggforce(v.0.2.1) and base64enc(v.0.1-3)

message("This is hpgltools commit: ", get_git_commit())

## If you wish to reproduce this exact build of hpgltools, invoke the following:

## > git clone http://github.com/abelew/hpgltools.git

## > git reset 8686ac4e2fa0e16c090af923d169e9658cbca2fb

## This is hpgltools commit: Tue Mar 26 12:06:09 2019 -0400: 8686ac4e2fa0e16c090af923d169e9658cbca2fb

## message(paste0("Saving to ", savefile))
## tmp <- sm(saveme(filename=savefile))

LS0tCnRpdGxlOiAiMjAxOTAzMjYgQW4gYXR0ZW1wdCB0byBhbnN3ZXIgb25lIG9mIHRoZSBiaWcgcXVlc3Rpb25zIGZyb20gTmFqaWIuIgphdXRob3I6ICJhdGIgYWJlbGV3QGdtYWlsLmNvbSIKZGF0ZTogImByIFN5cy5EYXRlKClgIgpvdXRwdXQ6CiAgaHRtbF9kb2N1bWVudDoKICAgIGNvZGVfZG93bmxvYWQ6IHRydWUKICAgIGNvZGVfZm9sZGluZzogc2hvdwogICAgZmlnX2NhcHRpb246IHRydWUKICAgIGZpZ19oZWlnaHQ6IDcKICAgIGZpZ193aWR0aDogNwogICAgaGlnaGxpZ2h0OiB0YW5nbwogICAga2VlcF9tZDogZmFsc2UKICAgIG1vZGU6IHNlbGZjb250YWluZWQKICAgIG51bWJlcl9zZWN0aW9uczogdHJ1ZQogICAgc2VsZl9jb250YWluZWQ6IHRydWUKICAgIHRoZW1lOiByZWFkYWJsZQogICAgdG9jOiB0cnVlCiAgICB0b2NfZmxvYXQ6CiAgICAgIGNvbGxhcHNlZDogZmFsc2UKICAgICAgc21vb3RoX3Njcm9sbDogZmFsc2UKICBybWRmb3JtYXRzOjpyZWFkdGhlZG93bjoKICAgIGNvZGVfZG93bmxvYWQ6IHRydWUKICAgIGNvZGVfZm9sZGluZzogc2hvdwogICAgZGZfcHJpbnQ6IHBhZ2VkCiAgICBmaWdfY2FwdGlvbjogdHJ1ZQogICAgZmlnX2hlaWdodDogNwogICAgZmlnX3dpZHRoOiA3CiAgICBoaWdobGlnaHQ6IHRhbmdvCiAgICB3aWR0aDogMzAwCiAgICBrZWVwX21kOiBmYWxzZQogICAgbW9kZTogc2VsZmNvbnRhaW5lZAogICAgdG9jX2Zsb2F0OiB0cnVlCiAgQmlvY1N0eWxlOjpodG1sX2RvY3VtZW50OgogICAgY29kZV9kb3dubG9hZDogdHJ1ZQogICAgY29kZV9mb2xkaW5nOiBzaG93CiAgICBmaWdfY2FwdGlvbjogdHJ1ZQogICAgZmlnX2hlaWdodDogNwogICAgZmlnX3dpZHRoOiA3CiAgICBoaWdobGlnaHQ6IHRhbmdvCiAgICBrZWVwX21kOiBmYWxzZQogICAgbW9kZTogc2VsZmNvbnRhaW5lZAogICAgdG9jX2Zsb2F0OiB0cnVlCi0tLQoKPHN0eWxlIHR5cGU9InRleHQvY3NzIj4KYm9keSwgdGQgewogIGZvbnQtc2l6ZTogMTZweDsKfQpjb2RlLnJ7CiAgZm9udC1zaXplOiAxNnB4Owp9CnByZSB7CiBmb250LXNpemU6IDE2cHgKfQo8L3N0eWxlPgoKYGBge3Igb3B0aW9ucywgaW5jbHVkZT1GQUxTRX0KbGlicmFyeShocGdsdG9vbHMpCnR0IDwtIHNtKGRldnRvb2xzOjpsb2FkX2FsbCgifi9ocGdsdG9vbHMiKSkKa25pdHI6Om9wdHNfa25pdCRzZXQocHJvZ3Jlc3M9VFJVRSwKICAgICAgICAgICAgICAgICAgICAgdmVyYm9zZT1UUlVFLAogICAgICAgICAgICAgICAgICAgICB3aWR0aD0xMjAsCiAgICAgICAgICAgICAgICAgICAgIGVjaG89VFJVRSkKa25pdHI6Om9wdHNfY2h1bmskc2V0KGVycm9yPVRSVUUsCiAgICAgICAgICAgICAgICAgICAgICBmaWcud2lkdGg9OCwKICAgICAgICAgICAgICAgICAgICAgIGZpZy5oZWlnaHQ9OCwKICAgICAgICAgICAgICAgICAgICAgIGRwaT05NikKb2xkX29wdGlvbnMgPC0gb3B0aW9ucyhkaWdpdHM9NCwKICAgICAgICAgICAgICAgICAgICAgICBzdHJpbmdzQXNGYWN0b3JzPUZBTFNFLAogICAgICAgICAgICAgICAgICAgICAgIGtuaXRyLmR1cGxpY2F0ZS5sYWJlbD0iYWxsb3ciKQpnZ3Bsb3QyOjp0aGVtZV9zZXQoZ2dwbG90Mjo6dGhlbWVfYncoYmFzZV9zaXplPTEyKSkKdmVyIDwtICIyMDE5MDMyNiIKcnVuZGF0ZSA8LSBmb3JtYXQoU3lzLkRhdGUoKSwgZm9ybWF0PSIlWSVtJWQiKQpybWRfZmlsZSA8LSBwYXN0ZTAoIjIwMTkwMzI2X2h1bWFuX21hY3JvcGhhZ2VzLlJtZCIpCmBgYAoKIyBXaGljaCBnZW5lcyBhcmUgREUgaW4gaHVtYW4gbWFjcm9waGFnZXMgYXQgNCBob3VycyB1cG9uIGluZmVjdGlvbiB3aXRoIEwuIG1ham9yPwoKIyBHYXRoZXIgYW5ub3RhdGlvbiBkYXRhCgpJIHdhbnQgdG8gcGVyZm9ybSBhIHNlcmllcyBvZiBjb21wYXJpc29ucyBhbW9uZyB0aGUgaG9zdCBjZWxsczogaHVtYW4gYW5kIG1vdXNlLgpUaHVzIEkgbmVlZCB0byBjb2xsZWN0IGFubm90YXRpb24gZGF0YSBmb3IgYm90aCBzcGVjaWVzIGFuZCBnZXQgdGhlIHNldCBvZgpvcnRob2xvZ3MgYmV0d2VlbiB0aGVtLgoKIyMgU3RhcnQgd2l0aCB0aGUgaHVtYW4gYW5ub3RhdGlvbiBkYXRhCgpgYGB7ciBodW1hbl9hbm5vdGF0aW9uc30KaHNfYW5ub3QgPC0gbG9hZF9iaW9tYXJ0X2Fubm90YXRpb25zKCkkYW5ub3RhdGlvbgpyb3duYW1lcyhoc19hbm5vdCkgPC0gbWFrZS5uYW1lcygKICBwYXN0ZTAoaHNfYW5ub3RbWyJlbnNlbWJsX3RyYW5zY3JpcHRfaWQiXV0sICIuIiwKICAgICAgICAgaHNfYW5ub3RbWyJ0cmFuc2NyaXB0X3ZlcnNpb24iXV0pLAogIHVuaXF1ZT1UUlVFKQpoc190eF9nZW5lIDwtIGhzX2Fubm90WywgYygiZW5zZW1ibF9nZW5lX2lkIiwgImVuc2VtYmxfdHJhbnNjcmlwdF9pZCIpXQpoc190eF9nZW5lW1siaWQiXV0gPC0gcm93bmFtZXMoaHNfdHhfZ2VuZSkKaHNfdHhfZ2VuZSA8LSBoc190eF9nZW5lWywgYygiaWQiLCAiZW5zZW1ibF9nZW5lX2lkIildCm5ld19oc19hbm5vdCA8LSBoc19hbm5vdApyb3duYW1lcyhuZXdfaHNfYW5ub3QpIDwtIG1ha2UubmFtZXMoaHNfYW5ub3RbWyJlbnNlbWJsX2dlbmVfaWQiXV0sIHVuaXF1ZT1UUlVFKQpgYGAKCiMjIEdlbmVyYXRlIGV4cHJlc3Npb25zZXRzCgpUaGUgcXVlc3Rpb24gaXMgcmVhc29uYWJseSBzZWxmLWNvbnRhaW5lZC4gIEkgd2FudCB0byBjb21wYXJlIHRoZSB1bmluZmVjdGVkCmh1bWFuIHNhbXBsZXMgYWdhaW5zdCBhbnkgc2FtcGxlcyB3aGljaCB3ZXJlIGluZmVjdGVkIGZvciA0IGhvdXJzLgpTbyBsZXQgdXMgZmlyc3QgcHVsbCB0aG9zZSBzYW1wbGVzIGFuZCB0aGVuIHBva2UgYXQgdGhlbSBhIGJpdC4KCmBgYHtyIGV4cHRzfQpzYW1wbGVfc2hlZXQgPC0gInNhbXBsZV9zaGVldHMvbGVpc2htYW5pYV9ob3N0X21ldGFzaGVldF8yMDE5MDQwMS54bHN4Igpoc19leHB0IDwtIGNyZWF0ZV9leHB0KHNhbXBsZV9zaGVldCwKICAgICAgICAgICAgICAgICAgICAgICBmaWxlX2NvbHVtbj0iaHNhcGllbnNmaWxlIiwKICAgICAgICAgICAgICAgICAgICAgICBnZW5lX2luZm89bmV3X2hzX2Fubm90LAogICAgICAgICAgICAgICAgICAgICAgIHR4X2dlbmVfbWFwPWhzX3R4X2dlbmUpCgpoc19leHB0X25vc2tpcHBlZCA8LSBzdWJzZXRfZXhwdChoc19leHB0LCBzdWJzZXQ9InNraXBwZWQhPSd5ZXMnIikKaHNfdDRoX2V4cHQgPC0gc3Vic2V0X2V4cHQoaHNfZXhwdF9ub3NraXBwZWQsIHN1YnNldD0iZXhwdHRpbWU9PSd0NGgnIikKaHNfdDRoX2V4cHQgPC0gc2V0X2V4cHRfY29uZGl0aW9ucyhoc190NGhfZXhwdCwgZmFjdD0iaW5mZWN0c3RhdGUiKQpoc190NGhfZXhwdCA8LSBzZXRfZXhwdF9iYXRjaGVzKGhzX3Q0aF9leHB0LCBmYWN0PSJzdHVkeSIpCnRhYmxlKGhzX3Q0aF9leHB0JGNvbmRpdGlvbnMpCnRhYmxlKGhzX3Q0aF9leHB0JGJhdGNoZXMpCmBgYAoKIyMgRXhhbWluZSB0NGggdnMgdW5pbmZlY3RlZAoKYGBge3IgaHNfZXhhbWluZV90NGgsIGZpZy5zaG93PSdoaWRlJ30KaHNfdDRoX3Bsb3RzIDwtIHNtKGdyYXBoX21ldHJpY3MoaHNfdDRoX2V4cHQpKQoKaHNfdDRoX25vcm0gPC0gbm9ybWFsaXplX2V4cHQoaHNfdDRoX2V4cHQsIG5vcm09InF1YW50IiwgY29udmVydD0iY3BtIiwKICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgdHJhbnNmb3JtPSJsb2cyIiwgZmlsdGVyPVRSVUUsIGJhdGNoPSJzdmFzZXEiKQpoc190NGhfbm9ybV9wbG90cyA8LSBzbShncmFwaF9tZXRyaWNzKGhzX3Q0aF9ub3JtKSkKYGBgCgojIyMgUHJpbnQgc29tZSBvZiB0aGUgcGxvdHMKCmBgYHtyIGhzX2V4YW1pbmVfcGxvdHN9CmhzX3Q0aF9wbG90cyRsZWdlbmQKaHNfdDRoX3Bsb3RzJGxpYnNpemUKaHNfdDRoX3Bsb3RzJGJveHBsb3QKCmhzX3Q0aF9ub3JtX3Bsb3RzJHBjX3Bsb3QKYGBgCgojIyBSZW1vdmUgc3RpbXVsYXRlZCBzYW1wbGVzCgpgYGB7ciBoc190NGhfbm91bnN0aW19CmhzX3Q0aF9pbmYgPC0gc3Vic2V0X2V4cHQoaHNfdDRoX2V4cHQsIHN1YnNldD0iY29uZGl0aW9uIT0nc3RpbSciKQpoc190NGhfaW5mX25vcm0gPC0gbm9ybWFsaXplX2V4cHQoaHNfdDRoX2luZiwgdHJhbnNmb3JtPSJsb2cyIiwgY29udmVydD0iY3BtIiwKICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgIGZpbHRlcj1UUlVFLCBiYXRjaD0ic3Zhc2VxIikKCmhzX3Q0aF9wY2EgPC0gcGxvdF9wY2EoaHNfdDRoX2luZl9ub3JtLCBwbG90X3RpdGxlPSJILiBzYXBpZW5zLCBMLiBtYWpvciwgdDRoIikKaHNfdDRoX3BjYSRwbG90Cgpoc190NGhfZGUgPC0gYWxsX3BhaXJ3aXNlKGhzX3Q0aF9pbmYsIG1vZGVsX2JhdGNoPSJzdmFzZXEiLCBmaWx0ZXI9VFJVRSkKaHNfdDRoX3RhYmxlIDwtIGNvbWJpbmVfZGVfdGFibGVzCmBgYAoKIyBXaGljaCBnZW5lcyBhcmUgREUgaW4gbW91c2UgbWFjcm9waGFnZXMgYXQgNCBob3VycyB1cG9uIGluZmVjdGlvbiB3aXRoIEwuIG1ham9yPwoKIyBHYXRoZXIgYW5ub3RhdGlvbiBkYXRhCgpJIHdhbnQgdG8gcGVyZm9ybSBhIHNlcmllcyBvZiBjb21wYXJpc29ucyBhbW9uZyB0aGUgaG9zdCBjZWxsczogaHVtYW4gYW5kIG1vdXNlLgpUaHVzIEkgbmVlZCB0byBjb2xsZWN0IGFubm90YXRpb24gZGF0YSBmb3IgYm90aCBzcGVjaWVzIGFuZCBnZXQgdGhlIHNldCBvZgpvcnRob2xvZ3MgYmV0d2VlbiB0aGVtLgoKIyMgU3RhcnQgd2l0aCB0aGUgaHVtYW4gYW5ub3RhdGlvbiBkYXRhCgpgYGB7ciBtb3VzZV9hbm5vdGF0aW9uc30KbW1fYW5ub3QgPC0gbG9hZF9iaW9tYXJ0X2Fubm90YXRpb25zKHNwZWNpZXM9Im1tdXNjdWx1cyIpJGFubm90YXRpb24Kcm93bmFtZXMobW1fYW5ub3QpIDwtIG1ha2UubmFtZXMoCiAgcGFzdGUwKG1tX2Fubm90W1siZW5zZW1ibF90cmFuc2NyaXB0X2lkIl1dLCAiLiIsCiAgICAgICAgIG1tX2Fubm90W1sidHJhbnNjcmlwdF92ZXJzaW9uIl1dKSwKICB1bmlxdWU9VFJVRSkKbW1fdHhfZ2VuZSA8LSBtbV9hbm5vdFssIGMoImVuc2VtYmxfZ2VuZV9pZCIsICJlbnNlbWJsX3RyYW5zY3JpcHRfaWQiKV0KbW1fdHhfZ2VuZVtbImlkIl1dIDwtIHJvd25hbWVzKG1tX3R4X2dlbmUpCm1tX3R4X2dlbmUgPC0gbW1fdHhfZ2VuZVssIGMoImlkIiwgImVuc2VtYmxfZ2VuZV9pZCIpXQpuZXdfbW1fYW5ub3QgPC0gbW1fYW5ub3QKcm93bmFtZXMobmV3X21tX2Fubm90KSA8LSBtYWtlLm5hbWVzKG1tX2Fubm90W1siZW5zZW1ibF9nZW5lX2lkIl1dLCB1bmlxdWU9VFJVRSkKYGBgCgojIyBHZW5lcmF0ZSBleHByZXNzaW9uc2V0cwoKVGhlIHF1ZXN0aW9uIGlzIHJlYXNvbmFibHkgc2VsZi1jb250YWluZWQuICBJIHdhbnQgdG8gY29tcGFyZSB0aGUgdW5pbmZlY3RlZApodW1hbiBzYW1wbGVzIGFnYWluc3QgYW55IHNhbXBsZXMgd2hpY2ggd2VyZSBpbmZlY3RlZCBmb3IgNCBob3Vycy4KU28gbGV0IHVzIGZpcnN0IHB1bGwgdGhvc2Ugc2FtcGxlcyBhbmQgdGhlbiBwb2tlIGF0IHRoZW0gYSBiaXQuCgpgYGB7ciBtb3VzZV9leHB0c30KbW1fZXhwdCA8LSBjcmVhdGVfZXhwdChzYW1wbGVfc2hlZXQsCiAgICAgICAgICAgICAgICAgICAgICAgZmlsZV9jb2x1bW49Im1tdXNjdWx1c2ZpbGUiLAogICAgICAgICAgICAgICAgICAgICAgIGdlbmVfaW5mbz1uZXdfbW1fYW5ub3QsCiAgICAgICAgICAgICAgICAgICAgICAgdHhfZ2VuZV9tYXA9bW1fdHhfZ2VuZSkKbW1fdDRoX2V4cHQgPC0gc3Vic2V0X2V4cHQobW1fZXhwdCwgc3Vic2V0PSJleHB0dGltZT09J3Q0aCciKQptbV90NGhfZXhwdCA8LSBzZXRfZXhwdF9jb25kaXRpb25zKG1tX3Q0aF9leHB0LCBmYWN0PSJpbmZlY3RzdGF0ZSIpCnRhYmxlKG1tX3Q0aF9leHB0JGNvbmRpdGlvbnMpCnRhYmxlKG1tX3Q0aF9leHB0JGJhdGNoZXMpCmBgYAoKIyMgRXhhbWluZSB0NGggdnMgdW5pbmZlY3RlZAoKYGBge3IgZXhhbWluZV90NGgsIGZpZy5zaG93PSdoaWRlJ30KbW1fdDRoX3Bsb3RzIDwtIHNtKGdyYXBoX21ldHJpY3MobW1fdDRoX2V4cHQpKQptbV90NGhfbm9ybSA8LSBub3JtYWxpemVfZXhwdChtbV90NGhfZXhwdCwgbm9ybT0icXVhbnQiLCBjb252ZXJ0PSJjcG0iLAogICAgICAgICAgICAgICAgICAgICAgICAgICAgICB0cmFuc2Zvcm09ImxvZzIiLCBmaWx0ZXI9VFJVRSwgYmF0Y2g9InN2YXNlcSIpCm1tX3Q0aF9ub3JtX3Bsb3RzIDwtIHNtKGdyYXBoX21ldHJpY3MobW1fdDRoX25vcm0pKQpgYGAKCiMjIyBQcmludCBzb21lIG9mIHRoZSBwbG90cwoKYGBge3IgZXhhbWluZV9wbG90c30KbW1fdDRoX3Bsb3RzJGxlZ2VuZAptbV90NGhfcGxvdHMkbGlic2l6ZQptbV90NGhfcGxvdHMkYm94cGxvdAoKbW1fdDRoX25vcm1fcGxvdHMkcGNfcGxvdApgYGAKCgoKCmBgYHtyIHNhdmVtZX0KcGFuZGVyOjpwYW5kZXIoc2Vzc2lvbkluZm8oKSkKbWVzc2FnZSgiVGhpcyBpcyBocGdsdG9vbHMgY29tbWl0OiAiLCBnZXRfZ2l0X2NvbW1pdCgpKQojIyBtZXNzYWdlKHBhc3RlMCgiU2F2aW5nIHRvICIsIHNhdmVmaWxlKSkKIyMgdG1wIDwtIHNtKHNhdmVtZShmaWxlbmFtZT1zYXZlZmlsZSkpCmBgYAo=

20190326 An attempt to answer one of the big questions from Najib.

atb abelew@gmail.com

2019-04-01