Comparing my DE-ish data with IGV
The goal of this worksheet is to compare the most changed WT/KO regions of the genome between my differential-expression based examination and what I actually see in IGV. I sort of assume that the differences are going to be due to peculiarities already observed in the WT samples, but lets find out!
KB664734 first hit
The most different region of all is near the gene ID: ASTE002813
Looking at it in IGV, the reason is immediately obvious. The WT 3 sample has coverage in this region when all other samples (including the other wild types) do not.
KB664942
This is a much more interesting region. It should be noted though that this is a pretty small scaffold, thus the relevant reads may actually be on one of the actual chromosomes. Though I think this is not terribly likely and I am not even sure how to properly test that hypothesis… hmm I guess I can blast a piece of this against the entire genome and see if there is another region which is sufficiently similar. However, the only way that should be relevant is if the mapper (hisat2) mapped the relevant reads differently, which is theoretically possible if we don’t have the random seed set, it randomly assigned the reads to different places on a per-sample basis so that only the KO samples got reassigned differently, and no reads got assigned to this region. So, basically the chances are like 1 in the number of stars – or impossible.
Thus I think this might be a real difference which is related to the difference between the WT and KO samples. Whether this is actually CRISPR I am not sure how to discern.
KB664734 second hit
This looks quite similar to the previous observation. This region at the final 5-10kb of the contig has been lost in the KO samples.
KB664542 first hit
This is much more similar to the first observation. WT sample 1 is very different from all other samples. I would guess that there is 0 chance that this is related to the CRISPR.
KB664488 first hit
This seems an interesting candidate for a funky off-target effect.
KB664488 second hit
This seems to be irrelevant. It appears that different samples lost coverage in this region (both WT and KO), but a bit more in the KO samples than WT, so it got picked up. It seems like this entire contig has lower coverage than the others I have looked at, which is presumably contributing to these false positives (there are a bunch more which I am not including).
KB664439 first hit
This one is neat, but I would be willing to bet that it is not CRISPR related. Even though all the KO samples are similar in pattern, they share this pattern with WT1.
KB665365 first hit
I might stop here, there are 22 more candidates according to my completely arbitrary 4 fold cutoff, but all the ones I looked at are similar in pattern to this one, where one or more WT samples diverged from an otherwise established coverage pattern. Thus I am reasonably confident that there is no significant chance that the differences are CRISPR related.
I am, however, taking pictures of them so that you may judge for yourself. The naming convention is: \({contig}_\){observation} where config is the name of the assembly’s contig and observation is just the arbitrary order in which I find them.
LS0tCnRpdGxlOiAiTG9va2luZyBmb3Igb3RoZXIgQ0RTIGNoYW5nZXMgYmV0d2VlbiB0aGUgd3QgYW5kIGtvIHNhbXBsZXMiCmF1dGhvcjogImF0YiBhYmVsZXdAZ21haWwuY29tIgpkYXRlOiAiYHIgU3lzLkRhdGUoKWAiCm91dHB1dDoKICBodG1sX2RvY3VtZW50OgogICAgY29kZV9kb3dubG9hZDogdHJ1ZQogICAgY29kZV9mb2xkaW5nOiBzaG93CiAgICBmaWdfY2FwdGlvbjogdHJ1ZQogICAgZmlnX2hlaWdodDogNwogICAgZmlnX3dpZHRoOiA3CiAgICBoaWdobGlnaHQ6IHRhbmdvCiAgICBrZWVwX21kOiBmYWxzZQogICAgbW9kZTogc2VsZmNvbnRhaW5lZAogICAgbnVtYmVyX3NlY3Rpb25zOiB0cnVlCiAgICBzZWxmX2NvbnRhaW5lZDogdHJ1ZQogICAgdGhlbWU6IHJlYWRhYmxlCiAgICB0b2M6IHRydWUKICAgIHRvY19mbG9hdDoKICAgICAgY29sbGFwc2VkOiBmYWxzZQogICAgICBzbW9vdGhfc2Nyb2xsOiBmYWxzZQogIHJtZGZvcm1hdHM6OnJlYWR0aGVkb3duOgogICAgY29kZV9kb3dubG9hZDogdHJ1ZQogICAgY29kZV9mb2xkaW5nOiBzaG93CiAgICBkZl9wcmludDogcGFnZWQKICAgIGZpZ19jYXB0aW9uOiB0cnVlCiAgICBmaWdfaGVpZ2h0OiA3CiAgICBmaWdfd2lkdGg6IDcKICAgIGhpZ2hsaWdodDogdGFuZ28KICAgIHdpZHRoOiAzMDAKICAgIGtlZXBfbWQ6IGZhbHNlCiAgICBtb2RlOiBzZWxmY29udGFpbmVkCiAgICB0b2NfZmxvYXQ6IHRydWUKICBCaW9jU3R5bGU6Omh0bWxfZG9jdW1lbnQ6CiAgICBjb2RlX2Rvd25sb2FkOiB0cnVlCiAgICBjb2RlX2ZvbGRpbmc6IHNob3cKICAgIGZpZ19jYXB0aW9uOiB0cnVlCiAgICBmaWdfaGVpZ2h0OiA3CiAgICBmaWdfd2lkdGg6IDcKICAgIGhpZ2hsaWdodDogdGFuZ28KICAgIGtlZXBfbWQ6IGZhbHNlCiAgICBtb2RlOiBzZWxmY29udGFpbmVkCiAgICB0b2NfZmxvYXQ6IHRydWUKLS0tCgo8c3R5bGUgdHlwZT0idGV4dC9jc3MiPgpib2R5LCB0ZCB7CiAgZm9udC1zaXplOiAxNnB4Owp9CmNvZGUucnsKICBmb250LXNpemU6IDE2cHg7Cn0KcHJlIHsKIGZvbnQtc2l6ZTogMTZweAp9Cjwvc3R5bGU+CgpgYGB7ciBvcHRpb25zLCBpbmNsdWRlPUZBTFNFfQpsaWJyYXJ5KCJocGdsdG9vbHMiKQp0dCA8LSBkZXZ0b29sczo6bG9hZF9hbGwoIn4vaHBnbHRvb2xzIikKa25pdHI6Om9wdHNfa25pdCRzZXQod2lkdGg9MTIwLAogICAgICAgICAgICAgICAgICAgICBwcm9ncmVzcz1UUlVFLAogICAgICAgICAgICAgICAgICAgICB2ZXJib3NlPVRSVUUsCiAgICAgICAgICAgICAgICAgICAgIGVjaG89VFJVRSkKa25pdHI6Om9wdHNfY2h1bmskc2V0KGVycm9yPVRSVUUsCiAgICAgICAgICAgICAgICAgICAgICBkcGk9OTYpCm9sZF9vcHRpb25zIDwtIG9wdGlvbnMoZGlnaXRzPTQsCiAgICAgICAgICAgICAgICAgICAgICAgc3RyaW5nc0FzRmFjdG9ycz1GQUxTRSwKICAgICAgICAgICAgICAgICAgICAgICBrbml0ci5kdXBsaWNhdGUubGFiZWw9ImFsbG93IikKZ2dwbG90Mjo6dGhlbWVfc2V0KGdncGxvdDI6OnRoZW1lX2J3KGJhc2Vfc2l6ZT0xMCkpCnJ1bmRhdGUgPC0gZm9ybWF0KFN5cy5EYXRlKCksIGZvcm1hdD0iJVklbSVkIikKcHJldmlvdXNfZmlsZSA8LSAiIgp2ZXIgPC0gIjIwMjAwMzE1IgoKIyN0bXAgPC0gc20obG9hZG1lKGZpbGVuYW1lPXBhc3RlMChnc3ViKHBhdHRlcm49IlxcLlJtZCIsIHJlcGxhY2U9IiIsIHg9cHJldmlvdXNfZmlsZSksICItdiIsIHZlciwgIi5yZGEueHoiKSkpCnJtZF9maWxlIDwtICJvdGhlcl9jaGFuZ2VzLlJtZCIKYGBgCgojIENvbXBhcmluZyBteSBERS1pc2ggZGF0YSB3aXRoIElHVgoKVGhlIGdvYWwgb2YgdGhpcyB3b3Jrc2hlZXQgaXMgdG8gY29tcGFyZSB0aGUgbW9zdCBjaGFuZ2VkIFdUL0tPIHJlZ2lvbnMgb2YgdGhlCmdlbm9tZSBiZXR3ZWVuIG15IGRpZmZlcmVudGlhbC1leHByZXNzaW9uIGJhc2VkIGV4YW1pbmF0aW9uIGFuZCB3aGF0IEkgYWN0dWFsbHkKc2VlIGluIElHVi4gIEkgc29ydCBvZiBhc3N1bWUgdGhhdCB0aGUgZGlmZmVyZW5jZXMgYXJlIGdvaW5nIHRvIGJlIGR1ZSB0bwpwZWN1bGlhcml0aWVzIGFscmVhZHkgb2JzZXJ2ZWQgaW4gdGhlIFdUIHNhbXBsZXMsIGJ1dCBsZXRzIGZpbmQgb3V0IQoKIyMgS0I2NjQ3MzQgZmlyc3QgaGl0CgpUaGUgbW9zdCBkaWZmZXJlbnQgcmVnaW9uIG9mIGFsbCBpcyBuZWFyIHRoZSBnZW5lIElEOiBBU1RFMDAyODEzCgpMb29raW5nIGF0IGl0IGluIElHViwgdGhlIHJlYXNvbiBpcyBpbW1lZGlhdGVseSBvYnZpb3VzLiAgVGhlIFdUIDMgc2FtcGxlIGhhcwpjb3ZlcmFnZSBpbiB0aGlzIHJlZ2lvbiB3aGVuIGFsbCBvdGhlciBzYW1wbGVzIChpbmNsdWRpbmcgdGhlIG90aGVyIHdpbGQgdHlwZXMpCmRvIG5vdC4KCiFbZmlyc3RdKGlndi9LQjY2NDczNF8wMS5wbmcpCgojIyBLQjY2NDk0MgoKVGhpcyBpcyBhIG11Y2ggbW9yZSBpbnRlcmVzdGluZyByZWdpb24uICBJdCBzaG91bGQgYmUgbm90ZWQgdGhvdWdoIHRoYXQgdGhpcyBpcwphIHByZXR0eSBzbWFsbCBzY2FmZm9sZCwgdGh1cyB0aGUgcmVsZXZhbnQgcmVhZHMgbWF5IGFjdHVhbGx5IGJlIG9uIG9uZSBvZiB0aGUKYWN0dWFsIGNocm9tb3NvbWVzLiBUaG91Z2ggSSB0aGluayB0aGlzIGlzIG5vdCB0ZXJyaWJseSBsaWtlbHkgYW5kIEkgYW0gbm90IGV2ZW4Kc3VyZSBob3cgdG8gcHJvcGVybHkgdGVzdCB0aGF0IGh5cG90aGVzaXMuLi4gIGhtbSBJIGd1ZXNzIEkgY2FuIGJsYXN0IGEgcGllY2Ugb2YKdGhpcyBhZ2FpbnN0IHRoZSBlbnRpcmUgZ2Vub21lIGFuZCBzZWUgaWYgdGhlcmUgaXMgYW5vdGhlciByZWdpb24gd2hpY2ggaXMKc3VmZmljaWVudGx5IHNpbWlsYXIuICBIb3dldmVyLCB0aGUgb25seSB3YXkgdGhhdCBzaG91bGQgYmUgcmVsZXZhbnQgaXMgaWYgdGhlCm1hcHBlciAoaGlzYXQyKSBtYXBwZWQgdGhlIHJlbGV2YW50IHJlYWRzIGRpZmZlcmVudGx5LCB3aGljaCBpcyB0aGVvcmV0aWNhbGx5CnBvc3NpYmxlIGlmIHdlIGRvbid0IGhhdmUgdGhlIHJhbmRvbSBzZWVkIHNldCwgaXQgcmFuZG9tbHkgYXNzaWduZWQgdGhlIHJlYWRzIHRvCmRpZmZlcmVudCBwbGFjZXMgb24gYSBwZXItc2FtcGxlIGJhc2lzIHNvIHRoYXQgb25seSB0aGUgS08gc2FtcGxlcyBnb3QKcmVhc3NpZ25lZCBkaWZmZXJlbnRseSwgYW5kIG5vIHJlYWRzIGdvdCBhc3NpZ25lZCB0byB0aGlzIHJlZ2lvbi4gIFNvLCBiYXNpY2FsbHkKdGhlIGNoYW5jZXMgYXJlIGxpa2UgMSBpbiB0aGUgbnVtYmVyIG9mIHN0YXJzIC0tIG9yIGltcG9zc2libGUuCgpUaHVzIEkgdGhpbmsgdGhpcyBtaWdodCBiZSBhIHJlYWwgZGlmZmVyZW5jZSB3aGljaCBpcyByZWxhdGVkIHRvIHRoZSBkaWZmZXJlbmNlCmJldHdlZW4gdGhlIFdUIGFuZCBLTyBzYW1wbGVzLiAgV2hldGhlciB0aGlzIGlzIGFjdHVhbGx5IENSSVNQUiBJIGFtIG5vdCBzdXJlCmhvdyB0byBkaXNjZXJuLgoKIVtzZWNvbmRdKGlndi9LQjY2NDk0Ml8wMS5wbmcpCgojIyBLQjY2NDczNCBzZWNvbmQgaGl0CgpUaGlzIGxvb2tzIHF1aXRlIHNpbWlsYXIgdG8gdGhlIHByZXZpb3VzIG9ic2VydmF0aW9uLiAgVGhpcyByZWdpb24gYXQgdGhlIGZpbmFsCjUtMTBrYiBvZiB0aGUgY29udGlnIGhhcyBiZWVuIGxvc3QgaW4gdGhlIEtPIHNhbXBsZXMuCgohW3RoaXJkXShpZ3YvS0I2NjQ3MzRfMDIucG5nKQoKIyMgS0I2NjQ1NDIgZmlyc3QgaGl0CgpUaGlzIGlzIG11Y2ggbW9yZSBzaW1pbGFyIHRvIHRoZSBmaXJzdCBvYnNlcnZhdGlvbi4gIFdUIHNhbXBsZSAxIGlzIHZlcnkKZGlmZmVyZW50IGZyb20gYWxsIG90aGVyIHNhbXBsZXMuICBJIHdvdWxkIGd1ZXNzIHRoYXQgdGhlcmUgaXMgMCBjaGFuY2UgdGhhdAp0aGlzIGlzIHJlbGF0ZWQgdG8gdGhlIENSSVNQUi4KCiFbdGhpcmRdKGlndi9LQjY2NDU0Ml8wMS5wbmcpCgojIyBLQjY2NDQ4OCBmaXJzdCBoaXQKClRoaXMgc2VlbXMgYW4gaW50ZXJlc3RpbmcgY2FuZGlkYXRlIGZvciBhIGZ1bmt5IG9mZi10YXJnZXQgZWZmZWN0LgoKIVtmb3VydGhdKGlndi9LQjY2NDQ4OF8wMS5wbmcpCgoKIyMgS0I2NjQ0ODggc2Vjb25kIGhpdAoKVGhpcyBzZWVtcyB0byBiZSBpcnJlbGV2YW50LiAgSXQgYXBwZWFycyB0aGF0IGRpZmZlcmVudCBzYW1wbGVzIGxvc3QgY292ZXJhZ2UgaW4KdGhpcyByZWdpb24gKGJvdGggV1QgYW5kIEtPKSwgYnV0IGEgYml0IG1vcmUgaW4gdGhlIEtPIHNhbXBsZXMgdGhhbiBXVCwgc28gaXQKZ290IHBpY2tlZCB1cC4gIEl0IHNlZW1zIGxpa2UgdGhpcyBlbnRpcmUgY29udGlnIGhhcyBsb3dlciBjb3ZlcmFnZSB0aGFuIHRoZQpvdGhlcnMgSSBoYXZlIGxvb2tlZCBhdCwgd2hpY2ggaXMgcHJlc3VtYWJseSBjb250cmlidXRpbmcgdG8gdGhlc2UgZmFsc2UKcG9zaXRpdmVzICh0aGVyZSBhcmUgYSBidW5jaCBtb3JlIHdoaWNoIEkgYW0gbm90IGluY2x1ZGluZykuCgohW2ZpZnRoXShpZ3YvS0I2NjQ0ODhfMDIucG5nKQoKIyMgS0I2NjQ0MzkgZmlyc3QgaGl0CgpUaGlzIG9uZSBpcyBuZWF0LCBidXQgSSB3b3VsZCBiZSB3aWxsaW5nIHRvIGJldCB0aGF0IGl0IGlzIG5vdCBDUklTUFIgcmVsYXRlZC4KRXZlbiB0aG91Z2ggYWxsIHRoZSBLTyBzYW1wbGVzIGFyZSBzaW1pbGFyIGluIHBhdHRlcm4sIHRoZXkgc2hhcmUgdGhpcyBwYXR0ZXJuCndpdGggV1QxLgoKIVtzaXh0aF0oaWd2L0tCNjY0NDM5XzAxLnBuZykKCiMjIEtCNjY1MzY1IGZpcnN0IGhpdAoKSSBtaWdodCBzdG9wIGhlcmUsIHRoZXJlIGFyZSAyMiBtb3JlIGNhbmRpZGF0ZXMgYWNjb3JkaW5nIHRvIG15IGNvbXBsZXRlbHkKYXJiaXRyYXJ5IDQgZm9sZCBjdXRvZmYsIGJ1dCBhbGwgdGhlIG9uZXMgSSBsb29rZWQgYXQgYXJlIHNpbWlsYXIgaW4gcGF0dGVybiB0bwp0aGlzIG9uZSwgd2hlcmUgb25lIG9yIG1vcmUgV1Qgc2FtcGxlcyBkaXZlcmdlZCBmcm9tIGFuIG90aGVyd2lzZSBlc3RhYmxpc2hlZApjb3ZlcmFnZSBwYXR0ZXJuLiAgVGh1cyBJIGFtIHJlYXNvbmFibHkgY29uZmlkZW50IHRoYXQgdGhlcmUgaXMgbm8gc2lnbmlmaWNhbnQKY2hhbmNlIHRoYXQgdGhlIGRpZmZlcmVuY2VzIGFyZSBDUklTUFIgcmVsYXRlZC4KCiFbc2V2ZW50aF0oaWd2L0tCNjY1MzY1XzAxLnBuZykKCkkgYW0sIGhvd2V2ZXIsIHRha2luZyBwaWN0dXJlcyBvZiB0aGVtIHNvIHRoYXQgeW91IG1heSBqdWRnZSBmb3IgeW91cnNlbGYuICBUaGUKbmFtaW5nIGNvbnZlbnRpb24gaXM6ICR7Y29udGlnfV8ke29ic2VydmF0aW9ufSAgd2hlcmUgY29uZmlnIGlzIHRoZSBuYW1lIG9mIHRoZQphc3NlbWJseSdzIGNvbnRpZyBhbmQgb2JzZXJ2YXRpb24gaXMganVzdCB0aGUgYXJiaXRyYXJ5IG9yZGVyIGluIHdoaWNoIEkgZmluZAp0aGVtLgoKCgohW2ZvdXJ0aF0oaWd2L0tCNjY0NDg4XzAxLnBuZykKCgoKCgoKCmBgYHtyIHNhdmVtZSwgZXZhbD1GQUxTRX0KcGFuZGVyOjpwYW5kZXIoc2Vzc2lvbkluZm8oKSkKbWVzc2FnZShwYXN0ZTAoIlRoaXMgaXMgaHBnbHRvb2xzIGNvbW1pdDogIiwgZ2V0X2dpdF9jb21taXQoKSkpCnRoaXNfc2F2ZSA8LSBwYXN0ZTAoZ3N1YihwYXR0ZXJuPSJcXC5SbWQiLCByZXBsYWNlPSIiLCB4PXJtZF9maWxlKSwgIi12IiwgdmVyLCAiLnJkYS54eiIpCm1lc3NhZ2UocGFzdGUwKCJTYXZpbmcgdG8gIiwgdGhpc19zYXZlKSkKdG1wIDwtIHNtKHNhdmVtZShmaWxlbmFtZT10aGlzX3NhdmUpKQpgYGAK