Chapter 9 Mutations Bordeaux
This chapter describes the reanalyses of data from Plomion et al. (2018) currently done in the bordeaux
branch of the detectMutations
repository.
9.1 Original mutations on 3P
I reported the original mutations (Tab. 9.1) from the supplementary table 5 from Plomion et al. (2018) and plotted them on the 3P genome (Fig. 9.1) after realignement from the haplome v2.3 to the public 3P genome.
locus | Mutation | origin | f(alt)pool | CHROM | POS |
---|---|---|---|---|---|
Sc0000080_1329750 | G/A | 3P – between XL1 and XL2 | 0,0330 | Qrob_Chr08 | 58757092 |
Sc0000573_185294 | A/T | 3P – between XL1 and XL2 | 0,0000 | Qrob_H2.3_Sc0000573 | 185294 |
Sc0000010_1057132 | G/A | 3P – between XL1 and XL2 | 0,1154 | Qrob_Chr03 | 13766652 |
Sc0000003_4011526 | A/T | 3P – between XL1 and XL2 | 0,0000 | Qrob_Chr02 | 84928601 |
Sc0000010_758473 | G/A | 3P – between XL1 and XL2 | 0,0000 | Qrob_Chr03 | 13467993 |
Sc0000015_2644541 | G/T | 3P – between XL1 and XL2 | 0,0000 | Qrob_Chr03 | 50836623 |
![Original mutations on the 3P genome.](treemutation_files/figure-html/bdxFig-1.png)
Figure 9.1: Original mutations on the 3P genome.
9.2 Coverage
Coverage are around 160X, this is high but strangely lower than expected, besides all data have been used this time. (Fig. 9.2).
![Coverage for Bordeaux's libraries on chromosomes 1 to 4.](treemutation_files/figure-html/covbdx-1.png)
Figure 9.2: Coverage for Bordeaux’s libraries on chromosomes 1 to 4.
9.3 Strelka2
Strelka2
produced 3 millions of candidate mutations with unique position across the 3 branches.
9.3.1 Overlap with original mutations
We found back all the 60 expected mutations (100%) (Tab. 9.2).
Chromosome | Position | Mutated | Normal | Ref | Alt | Mutated AltCount | Mutated RefCount | Normal AltCount | Normal RefCount | Allelic fraction |
---|---|---|---|---|---|---|---|---|---|---|
Qrob_Chr08 | 58757092 | L1 | L2 | G | C | 0 | 119 | 0 | 60 | 0.0000000 |
Qrob_Chr08 | 58757092 | L2 | L1 | G | A | 42 | 60 | 0 | 119 | 0.4117647 |
Qrob_Chr08 | 58757092 | L1 | L3 | G | C | 0 | 119 | 0 | 60 | 0.0000000 |
Qrob_Chr08 | 58757092 | L3 | L1 | G | A | 34 | 60 | 0 | 119 | 0.3617021 |
Qrob_H2.3_Sc0000573 | 185294 | L2 | L1 | A | T | 14 | 57 | 0 | 86 | 0.1971831 |
Qrob_H2.3_Sc0000573 | 185294 | L1 | L3 | A | T | 0 | 86 | 19 | 50 | 0.0000000 |
9.3.2 Filtering
We filtered mutations with the original and a robust filters, adjusting the coverage to the coverage obtained:
Original:
- A read depth for the two sample between half and two times the mean coverage (
normal_DP <= 320, normal_DP >= 80, mutation_DP <= 320, mutation_DP >= 80
) - A null number of alternate allele count in the normal sample (
normal_altCount == 0
) - A minimum of 10 alternate allele count in the mutated sample (
mutation_altCount >= 10
) - An allelic frequency inferior to 0.5 (
tumor_AF <= 0.5
) - A coherent temporal pattern (“XL1-XL2”, “XL2-L3”, “XL1-L1”, “XL2-L2”)
Robust:
- A read depth for the two sample between half and two times the mean coverage (
normal_DP <= 320, normal_DP >= 80, mutation_DP <= 320, mutation_DP >= 80
) - A null number of alternate allele count in the normal sample (
normal_altCount == 0
) - A minimum of 10 alternate allele count in the mutated sample (
mutation_altCount >= 10
) - An allelic frequency inferior to 0.5 (
tumor_AF <= 0.5
) Strelka 2
automatic filtering (Filter == PASS
)- A coherent temporal pattern (“XL1-XL2”, “XL2-L3”, “XL1-L1”, “XL2-L2”)
We obtained 460 candidates (Fig 9.3) for the original filter, And in a robust dataset of 108 mutations (Fig 9.4).
![Mutations retained after original filtering: allele frequency (A) and positions on the 3P genome (B).](treemutation_files/figure-html/mutFilteredSB-1.png)
Figure 9.3: Mutations retained after original filtering: allele frequency (A) and positions on the 3P genome (B).
![Mutations retained after robust filtering: allele frequency (A) and positions on the 3P genome (B).](treemutation_files/figure-html/mutRobustSB-1.png)
Figure 9.4: Mutations retained after robust filtering: allele frequency (A) and positions on the 3P genome (B).
9.4 Mutect2
Mutect2
produced 4.6 millions of candidate mutations with unique position across the 3 branches.
9.4.1 Overlap with original mutations
We tried to find back original mutations to have a look to their metrics.
We found back 57 out of the 60 expected mutations (95%) (Tab. 9.3.
which are exactly interestingly the same as Strelka2
.
Chromosome | Position | Mutated | Normal | Ref | Alt | Mutated AltCount | Mutated RefCount | Normal AltCount | Normal RefCount | Allelic fraction |
---|---|---|---|---|---|---|---|---|---|---|
Qrob_Chr08 | 58757092 | L2 | L1 | G | A | 35 | 55 | 0 | 113 | 0.3888889 |
Qrob_Chr08 | 58757092 | L3 | L1 | G | A | 39 | 61 | 0 | 113 | 0.3900000 |
Qrob_H2.3_Sc0000573 | 185294 | L2 | L1 | A | T | 15 | 52 | 0 | 84 | 0.2238806 |
Qrob_H2.3_Sc0000573 | 185294 | L3 | L1 | A | T | 19 | 56 | 0 | 84 | 0.2533333 |
Qrob_Chr03 | 13766652 | L2 | L1 | G | A | 43 | 5 | 0 | 77 | 0.8958333 |
Qrob_Chr03 | 13766652 | L3 | L1 | G | A | 35 | 16 | 0 | 77 | 0.6862745 |
9.4.2 Filtering
We filtered mutations with the original filter, adjusting the coverage to preliminary low coverage obtained:
Original:
- A read depth for the two sample between half and two times the mean coverage (
normal_DP <= 320, normal_DP >= 80, mutation_DP <= 320, mutation_DP >= 80
) - A null number of alternate allele count in the normal sample (
normal_altCount == 0
) - A minimum of 10 alternate allele count in the mutated sample (
mutation_altCount >= 10
) - An allelic frequency inferior to 0.5 (
tumor_AF <= 0.5
) - A coherent temporal pattern (“XL1-XL2”, “XL2-L3”, “XL1-L1”, “XL2-L2”)
We obtained 948 candidates (Fig 9.5) for the original filter.
![Mutations retained after original filtering: allele frequency (A) and positions on the 3P genome (B).](treemutation_files/figure-html/mutFilteredFB-1.png)
Figure 9.5: Mutations retained after original filtering: allele frequency (A) and positions on the 3P genome (B).