Speciation (1996)

The key to the origin of species is the phenomenon of hybrid sterility. A mule is the hybrid formed by crossing a healthy fertile horse and a healthy fertile ass. The mule is sterile showing that, despite their health, the parents are reproductively isolated from each other (but not necessarily from other members of their respective species). Mule being a trifle stubborn. Photo by Cortis corporation Theories of this phenomenon are either "genic" or "chromosomal". Until 1996, chromosomal theories had required differences in large segments (e.g. deletions, translocations), which might sometimes be seen on examination with a standard light microscope.

A modified chromosomal theory requiring diffuse differences only in single bases was presented in the Journal of Theoretical Biology (1996). This postulated that (C+G)% is the "accent" of DNA, which, like the accent of human beings (metaphorically speaking), can affect reproductive success (see Eliza Doolittle, below). An important path to this theory was to follow the approach of the molecular biologists in the 1940s and 1950s. They studied the simplest possible biological forms - the viruses that infect bacteria. So, regarding the speciation question, we sought evidence on this from viruses - in this case viruses that infected eukaryotic cells.

In chemical terms, (C+G)% differences have a profound affect on the ability of a duplex DNA molecule to extrude the stem-loop structures by which homologous chromosomes first recognize each other at meiosis.

Donald Forsdyke

Different Biological Species "Broadcast" their DNAs at Different (C+G)% "Wavelengths"

by D. R. FORSDYKE

J. Theor. Biol. (1996) 178, 405-417. (With copyright permission from Academic Press.)

ABSTRACT

1. INTRODUCTION

2. PRIMARY AND SECONDARY COMPONENTS OF INFORMATION

3. CODON CHOICE AND THE NEUTRALIST SELECTIONIST DEBATE.

4. SEX AND RECOMBINATION REPAIR

5. (C+G)% AND SPECIATION

6. EVIDENCE FOR STEM-LOOP "KISSING" MODEL

7. HOW (C+G)% DIFFERENCES MIGHT IMPEDE RECOMBINATION.

8. THE EARLY EVOLUTION OF INTRONS

9. (C+G)% AND PHYLOGENY

10. MAJOR PRESSURES ON DNA ARE SELECTIVE

11. THE DOMINANCE OF THE GENOME.

REFERENCES

End Note March 2008

End Note Jan 2010

End Note September 2010

End_Note_Feb_2013

ABSTRACT

Radio can be used as a metaphor for the transmission of information by DNA through time and space. Just as different radio transmitters broadcast at different wavelengths to prevent interference, so different biological species "broadcast" their DNAs at different (G+C)% "wavelengths" to prevent recombination.

It is postulated that species differences in (G+C)% prevent recombination. First, evidence is presented supporting the early Crick-Sobell stem-loop model for genetic recombination, which proposes that the rate-limiting step in recombination is the recognition ("kissing") of complementary sequences in the loops of stem-loop structures extruded from supercoiled DNA. Then, various ways in which differences in (G+C)% might impede complementary loop interactions are outlined.

The strength of the postulate is that it brings together a variety of disparate observations in fields that have not previously been seen as related. Thus explanations are apparent for why most mutations are not selectively neutral (the "neutralist/selectionist" debate), why introns were present in the earliest genes (the "introns-early / introns-late" debate), and the origin of species.

1. INTRODUCTION.

2. PRIMARY AND SECONDARY COMPONENTS OF INFORMATION

3. CODON CHOICE AND THE NEUTRALIST SELECTIONIST DEBATE.



5. (C+G)% AND SPECIATION.

6. EVIDENCE FOR STEM-LOOP "KISSING" MODEL



7. HOW (C+G)% DIFFERENCES MIGHT IMPEDE RECOMBINATION


8. THE EARLY EVOLUTION OF INTRONS

TABLE 1. MOLAR PROPORTIONS OF BASES IN INSECT VIRUS DNAs
VIRUS TYPE	Virus host	A/T	G/C	R/Y*	(C+G)%
Polyhedral. . . . .	P. dispar	1.06	1.08	1.07	58.5
L. monacha	1.03	1.08	1.06	51.5
*C. fumiferana*	1.03	1.09	1.06	51.3
M. americanum	1.04	1.11	1.07	42.4
B. mori	1.04	1.11	1.07	42.7
C. P. eurytheme	1.08	1.11	1.09	42.5
N. sertifer	1.07	1.09	1.07	37.4
Capsule .	C. murinana	1.05	1.11	1.07	37.4
*C. fumiferana*	1.01	1.12	1.05	34.8
* R = purine (A or G); Y = pyrimidine (C or T).

The upward pointing arrows in Figure 2 are in distinct regions, symbolizing the later-evolving localized pressure for the encoding of specific function. Here there is a conflict. A sequence required to encode a protein might not at the same time be able locally to optimize its folding propensity. The conflict might have been meet in three ways:

First, because of the redundancy of the genetic code, particular synonymous codons could have been preferred.
Second, amino acids with similar functions could have been interchanged to widen the range of codon choice.
Third, the sequences encoding a protein could have been diffused over a wider region, by permitting encoding to occur only in discrete segments.

If the first two options were not sufficient, then only the third option would have been left. Thus, introns might correspond to parts of a gene where the constraints on the first two options were most severe. Introns would have allowed the interspersing of selectively advantageous stem-loops in coding regions of DNA.

Evidence supporting this is presented elsewhere (Forsdyke, 1995b-e). As an example, Figure 3 (upper) shows FORS-D plots for the human troponin-c gene, which may have been under positive Darwinian evolutionary selection (Ohta, 1994). Negative FORS-D values are associated with certain exons. For exons which are not associated with negative FORS-D values, it can be assumed that it was possible to accommodate FORS-D pressure by the use of synonymous codons and conservative amino acids. Negative FORS-D values in parts of the first intron and 5' flank suggests functions for these regions, perhaps regulatory.

Figure 3 (lower) shows that profiles for the folding of the natural sequence (FONS values) and the mean value for randomized sequences (FORS-M values), closely follow each other. This implies that the genome characteristic which controls the FORS-M value (base composition), is a major factor influencing the energetics of stem-loop formation (Forsdyke, 1995b-e). Once introns are removed, the cDNA product (not shown) has generally lower FORS-D values (average 2.31.9 kcal/mol) than the corresponding genomic segment shown in Figure 3 (average 4.40.7 kcal/mol).

Secondary structure analysis of a troponin gene

End Note Feb 2013

Beautiful work on the role of nucleic structures in recombination between polioviruses (Romanova et al. 1986, see above; Tolskaya et al. 1987) has been confirmed and extended by Runckel et al. (2013). Furthermore, they provide strong evidence that GC%, which would tend to stabilize such structures, positively supports recombination. However, for poliovirus they were unable to support the idea that recombination preferentially occurs at gene boundaries, so tending to preserve intact genes.

Runckel C, Westesson O, Andino R, DeRisi JL (2013) Identification and manipulation of the molecular determinants influencing poliovirus recombination. PLOS Pathogens 9, e1003164.

Tolskaya EA, Romanova LI, Blinov VM, Viktorova EG, Sinyakov AN et al. (1987) Studies on the recombination between RNA genomes of poliovirus. The primary structure and nonrandom distribution of crossover regions in the genomes of intertypic poliovirus recombinants. Virology 161, 54-61.

Next: Thinking about Stem-Loops (1998) (Click Here)

Return to Bioinformatics Index (Click Here)

Return to Evolution Index (Click Here)

Return to HomePage (Click Here)

This page was established circa 1998 and was last edited 23 Nov 2014 by D. R. Forsdyke