Single Marker Analysis Using F2 Progeny

 

  Click here for a
printer-friendly version
Single Marker Analysis Using F2 Progeny

Reference: Ben Hui Lui. Statistical Genomics: Linkage, Mapping and QTL Analysis. pgs. 402-405.

In this situation we have one marker linked to a QTL. We do not have a marker on either side of the QTL, which would be the case for flanking markers. Marker genotypes are MM, Mm, mm and the QTL genotypes are QQ, Qq, and qq. The marker and the QTL are in coupling phase linkage.

In the F2 population the relative frequencies and means of the three marker classes are:

Marker Class Relative Frequency Marker Mean
MM 0.25 u1
Mm 0.5 u2
mm 0.25 u3

In this same F2 population the relative frequencies and means of the three QTL classes are:

QTL Class Relative Frequency QTL Mean
QQ 0.25 m + a
Qq 0.5 m + d
qq 0.25 m - a

The observed recombination fraction between the marker and the QTL is ‘r'. The relative frequency of crossovers events between the marker and QTL loci is 2r. This is because for each crossover event only two out of four strands are involved in recombination. The gametic relative frequencies for each marker-QTL combination are:

Type of Gamete Relative Frequency
QM (1 - r)/2
qm (1 - r)/2
Qm r/2
qM r/2







A Punnett square shows the genotypes in the F2 population.

                             (1 - r)/2    (1 - r)/2      r/2           r/2
    QM qm Qm qM
(1 - r)/2 QM QQMM QqMm QQMm QqMM
(1 - r)/2 qm QqMm qqmm Qqmm qqMm
r/2 Qm QQMm Qqmm QQmm QqMm
r/2 qM QqMM qqMm QqMm qqMM

We can collect like genotypes to derive the relative frequencies of each class.

  Genotypes                             Relative Frequency
QQMM (1 - r)2/4
QqMM (r - r2)/2
qqMM r2/4
QQMm (r - r2)/2
QqMm (1 - r)2/2 + r2/2
qqMm (r - r2)/2
QQmm (r/2)2
Qqmm (r - r2)/2
qqmm (1 - r)2/4

We can rearrange the results from the above table into a 3 X 3 table.

QQ Qq qq
MM (1 - r)2/4 (r - r2)/2 r2/4
Mm (r - r2)/2 {(1 - r)2 + r2}/2 (r - r2)/2
mm r2/4 (r - r2)/2 (1 - r)2/4

The marginal frequency for the MM marker class is the relative frequency of this marker genotype averaged across the three QTL genotypes:

(1-r)2 + (r-r2) +  r 2 =(1/4)[1-2r+r2+2r+2r2+r2]=1/4
4 2

The marginal frequency for the Mm marker class is:

(r-r2) + [(1-r2) + r2] + (r-r2) =(1/2){r-r2+1-2r+r2+r2+r-r2}=1/2
2 2 2

The marginal frequency for the mm marker class is:

r2  + (r-r2) + (1-r2) =(1/4){r2+2r-2r2+1-2r+r2}=1/4
4 2 4

The MM marker class can be identified with banding patterns and includes QTL genotypes QQ, Qq, and qq. The MM marker class is a mixture of QTL classes due to recombination between the marker and the QTL. We can only identify the means of each marker class, because we cannot identify the QTL genotypes associated with a given marker in an F2 plant. We can only estimate the means of each marker class.

To determine the means of each marker class we must first find the conditional probability of each QTL class for a given marker class:

P(Qj/Mi) = P(QjnMi)
P(Mi)

The conditional probability of the QQ genotype given that the MM marker genotype is present in that F2 is:

P(QQ/MM)= P(QQMM) = {(1-r)2/4} =(1-r)2
P(MM) 4

We are dividing the probability of each marker-QTL genotype by the probability of the marker genotype. We can develop the following table:

QQ Qq qq
MM 1/4 (1 - r)2 2r(1 - r) r2
Mm 1/2 (r - r2) {(1 - r)2 + r2} (r - r2)
mm 1/4 r2 2r(1 - r) (1 - r)2


Now we can calculate the mean of each marker class. The markers do not contribute to the mean value. Only the QTL loci contribute to the mean value.

uMM = ( 1 - r)2u1 + 2r( 1 - r)u2 + r2u3

Now u1 = u + a; u2 = u + d; u3 = u - a;

uMM = (1 - r)2(u + a) + 2r(1 - r)(u + d) + r2 ( u - a)
       = [(1 - r)2 + r2 + 2r(1 - r)]u + (1 - r)2a - r2a + 2r(1 - r)d
       = (1 - 2r + r2 + r2 + 2r - 2r2)u + (1 - r)2a - r2a + 2r(1 - r)d
       = u + [1 - 2r + r2 - r2]a + 2r(1 - r)d
       = u + (1 - 2r)a + 2r(1 - r)d

We can show that
UMm = r(1 - r)(u + a) + [(1 - r)2 + r2](u + d) + r(1 - r)( u - a)
       = u + [(1 - r)2 + r2]d

We can also show that
Umm = r2(u + a) + 2r(1 - r)(u + d) + (1 - r)2(u - a)
       = u - (1 - 2r)a + 2r(1 - r)d

We consolidate these results into a table.

  Marker       ni           pi                Marker mean value
MM n1 0.25 UMM = u + (1 - 2r)a + 2r(1 - r)d
Mm n2 0.50 UMm = u + [(1 - r)2 + r2]d
mm n3 0.25 Umm = u - (1 - 2r)a + 2r(1 - r)d

We have four unknowns to estimate that include: u, a , d, and r. We only have the three equations above. With three equations we cannot estimate four unknowns.

The genetic effects which include u, a, and d are confounded with the estimate of the recombination fraction (r). We cannot determine the recombination between the marker locus and the QTL locus. A one-way analysis of variance will only provide evidence that the marker locus is linked to a QTL. The R2 is the amount of variation explained by the marker, not the QTL.

The markers are the independent variable and the mean of each marker class is the dependent variable. The marker means are a mixture of QTL genotypes. For example, the mean of the MM marker class is UMM = (1 - r)2u1 + 2r(1 - r)u2 + r2u3. This equation indicates that there is a proportion (1 - r)2 of the QQ genotype, 2r(1 - r) of the Qq genotype, and a proportion r2 of the qq genotype associated with the MM marker genotype.

R2 = Sum of Squares due to regression
Sum of Squares Total

The R2 does not provide information on the amount of variation explained by the QTL locus.

We can also show that the additive and dominance effects cannot be determined when we use data from a single marker-QTL association. The linear contrasts for the additive and dominance effects are so S ci = 0. Where ci is the contrast coefficient.

    Contrast                              Marker genotype
                          MM                Mm                 mm
additive 1 0 -1
dominance 1 -2 1

The additive contrast is the difference in the means of the two homozygous genotypes.
Additive contrast = SciYi = (1)uAA + (-1)uaa

Where uAA = u1(1 - r)2 + 2u2 r(1 - r) + u3 r2 and
uaa = u1 r2 + 2r(1 - r)u2 + (1 - r)2 u3.

The contrast is then uAA - uaa = u1(1 - 2r) + (2r -1)u3 = (u +a)
(1 - 2r) + (2r -1)(u - a) = 2( 1 - 2r)a.

This shows that the additive effect cannot be separated from the recombination (r) value. Now we will derive the dominance contrast = SciYi

=(1)[u + (1 - 2r)a + 2r(1 - r)d]-(2){u + [(1 - r)2 + r2]d}+(1)
          [u - (1 - 2r)a + 2r(1 - r)d]

= (-2)(1 - 2r)2d

This shows that the dominance effect cannot be separated from the recombination (r) value. For both the additive and dominance contrasts the genetic effects are confounded with the recombination value. When we have a marker located on only one side of the QTL, we cannot determine the additive or dominance effects or the recombination fraction between the marker and the QTL. Because of recombination between the marker and the QTL, a marker may be associated with either the + or - alleles of the QTL locus. Each marker mean includes the effects of homozygous and heterozygous QTL genotypes. Each marker class is a mixture of QTL genotypes. Because we cannot determine the recombination between the marker and QTL, the R2 for regression of marker genotype on the mean of each marker class is only a measure of the amount of variation explained by the marker. The amount of variation explained by the QTL cannot be determined for the case of a single marker-QTL association.

"The hypothesis test based on the contrasts for the marker genotypes corresponds to the marker and the QTL being independent or no genetic effects can be detected for the putative QTL, which is r = 0.5 or a = 0 and d=0." From Ben Hui Lui (pg. 405).

Copyright 2000©, Ted Helms

Back | Home | Top | Next
Home Forward Back