Professional Documents
Culture Documents
Introduction
Assessing genetic variation is an important parameter in genetic studies, in studying biodiversity, germplasm characterization and generation of genetic variability in plant breeding. Moreover, estimation of genetic variation is important for the selection of desirable genotypes. The understanding that high amounts of genetic variation is not expressed on the phenotype rendered obvious that new methods should be developed to estimate and use this variation in favor of a breeding program. Recently, due to several technical advances made in molecular genetics, genetic variation could be measured at the DNA level by developing different molecular markers. Microsatellites which are the subject of this chapter are based on different repetitive sequences present in the genome. Marker assisted selection could help breeders avoid the traditional phenotype based selections in the eld, thus speeding up the breeding programs and maximizing its progress. By denition, molecular marker is any site (locus) in the genome of an organism where the
Stella K. Kantartzi (ed.), Microsatellites: Methods and Protocols, Methods in Molecular Biology, vol. 1006, DOI 10.1007/978-1-62703-389-3_1, Springer Science+Business Media, LLC 2013
DNA base sequence differs among the individuals of a population. The arrival of genetic tools like restriction enzymes and the polymerase chain reaction plus the growing abundance of DNA sequence data, coupled with automated high-throughput assays, have revealed several classes of molecular markers, including restriction fragment length polymorphisms (RFLPs), variable number tandem repeats (VNTRs), microsatellite DNA, and single nucleotide polymorphisms (SNPs) (molecular marker, http://www. encyclopedia.com/doc/1O6-molecularmarker.html). The genome of higher organisms contains three types of simple repetitive DNA sequences (satellite DNAs, minisatellites, and microsatellites), organized in clusters of differing sizes (1, 2). Microsatellites rst described by Litt and Luty (3) can be found under the term simple sequence repeats (SSRs), simple repetitive sequences (SRS), or simple tandem repeats (STRs) (4) (SSLPs) (5).
Microsatellites
Microsatellites are tandem repeats of very short 16 bp patterns which are not repeated many times at a particular locus but are distributed relatively evenly at many different genomic loci (6). Other scientists dene microsatellites as 28 or even 15 bp repeats (7, 8). The most abundant patterns found in the plant genome are (AT)n, (GA)n, and (GAA)n where n refers to the total number of repeats, usually ranging from 10 to 100. In addition, mononucleotide repeats consisting of A/T repeats are also present in the chloroplast genome. Increased intra- and inter-genetic variation is observed when the number of repeats is increasing (9). Between the two types of mononucleotide repeats, A/T was the most abundant in all plant species, while G/C was comparatively limited. In the mononucleotide repeats category, the maximum (99 %) A/T repeats have been found in the Arabidopsis genome and the minimum (78 %) in the Brachypodium genome. In the dinucleotide repeat category, the distribution of SSRs in different motif types was not uniform and the most frequent motif type was different for each plant species. For example, AG/CT repeats were more frequent in Brachypodium and rice, with 50.7 % and 41.9 % frequency respectively, whereas AT/AT repeats were more frequent in Populus (60.5 %) and Medicago (59.9 %). In rice, both AG/CT and AT/ AT repeats were the most abundant among the other dinucleotide repeats. Interestingly, the CG/CG motif contributed less than 0.5 % in dicots, whereas it was 3.17.0 % in all dinucleotide repeats identied in the monocots. The analysis of mononucleotide and dinucleotide repeats concluded that CG-rich motifs were least preferred in both monocot and dicot genomes. However, for trinucleotide repeats the AGC/CGT, AGG/CCT, and CCG/CGG were observed more frequently in all monocot species, whereas A/T-rich
repeats, such as AAC/GTT, AAG/CTT, and AAT/ATT, were more frequent in dicots. The frequency of tetranucleotide, pentanucleotide, and hexanucleotide repeats was very low in all the plant genomes (10). Overall, there are 501 possibilities of nonredundant monomeric to hexameric repeats. In plants, the frequency and number of microsatellites have been estimated for a number of species and results indicate that the most frequent microsatellite is (GT)n, while in mammals is (AT)n (10). Microsatellites are sometimes associated with other genomic repeats, especially transposable elements. In humans, microsatellites are associated with repetitive DNA, especially non-LTR retrotransposons (11, 12). In plants, however, including Arabidopsis, rice, soybean, maize, and wheat, microsatellites are preferentially located in nonrepetitive DNA regions, which indicates that they reside in regions predating genome expansion (13, 14). Nevertheless, microsatellites are evenly spaced in the genome, although they are highly variable in number of repeat units among individuals (7). Because unique DNA sequences ank individual microsatellites they could be genotyped via STS (Sequence Tagged Site) PCR (15). In species with low levels of genetic diversity identication of a fast mutating locus would be the optimal resource for the development of markers, markers which would thus be ideal for breeding programs. At the moment there are a vast number of SSR markers publicly available for research concerning the most important agricultural crops (1619).
Fig. 1 Replication leads to new alleles with less (deletion) or more (expansion) repeats depending on the strand containing the error
DNA slippage during DNA replication (24), caused by mismatches between DNA strands when they are being replicated during meiosis (25). It has been estimated that replication slippage at each microsatellite occurs about once per 1,000 generations (26). The second involves recombination between DNA strands (27).
Replication Slippage
Replication slippage accounts for many mutations at SSR loci (28). This type of mutation occurs when one DNA strand is mispairing (slip strand) during DNA replication. The mispairing refers to a repeat unit hybridizing to a repeat in such a way where a loop is formed in the nascent strand resulting in the addition of a repeat (22). If the loop occurs in the template strand, then there will be a decrease in the number of units (29). These can lead to gain or loss of certain repeats. The mismatch repair mechanism and exonuclease activity of polymerase corrects a number of errors but many escape and become mutations (Fig. 1).
Recombination
Another mechanism of mutation is the recombination process which could change the SSR length by asymmetrical crossing over or by gene conversion (2, 3033). Asymmetric exchanges, random
Fig. 2 (a) Amplication of microsatellites using a pair of SSR markers. PCR products are analyzed on polyacrylamide gels. (b) Amplication of microsatellites using one ISSR markers. PCR products are analyzed on polyacrylamide gels or simple 1.5 % agarose gels
genetic drift and selection can have a signicant effect on the accumulation of tandem-repetitive sequences in the genome (34). Non reciprocal recombination also mutates tandem repeat number (for both microsatellites and minisatellites) (32, 33).
must be cloned and only a small number of these will be useful for the development of the SSR markers. Moreover, only a number of these markers will give informative results, especially for species with large genomes (3638). In addition, problems that might occur are, for instance, as follows: (a) the primer may not amplify any PCR product; (b) the primer may produce very complex, weak, or nonspecic amplication patterns; (c) the amplication product may not be polymorphic. Other possible problems using SSR markers are as follows: the difculty to resolve bands differing only in one or two base pairs, the cost of polyacrylamide gels and labeled primers, and the differences in identifying band size and their calling between laboratories, making comparisons between results very hard. Yet, despite any problems, SSRs are now the marker of choice in many areas of molecular genetics due to their codominant and polymorphic nature, even between closely related lines, their requirement for low amounts of DNA, and the possibility of being automated for high-throughput screening make them attractive. In addition they can be easily exchanged between laboratories, and are highly transferable between populations (39). For example, a total of 18,828 SSR sequences have been detected in the rice genome (40), of which only 1015 % have yet been used, suggesting the high potential available for such marker systems. SSRs are mostly codominant markers and are indeed excellent for studies of population genetics and mapping (31, 41). Another technical development like the use of uorescent primers in combination with automatic capillary or gel-based DNA sequencers has facilitated the detection of bands and their analysis.
Advances in Microsatellites
Although microsatellites mainly occur in noncoding sequences, the development of EST databases revealed that microsatellite repetitive sequences also occur inside coding sequences (4244). The information obtained by EST libraries has been recently used for the development of SSR markers (4549). Microsatellites designed EST are expected to be slightly less polymorphic than genomic library derived SSRs, as there is selection pressure for sequence conservation in coding regions (5053). This also explains why the most abundant microsatellites in genes are trinucleotides and hexanucleotides and the less frequent are mononucleotides and dinucleotides, as these types cause frame shift and most probably premature stop codons. While technology progresses and new genomes and EST libraries become available with the help of bioinformatics approaches, the development of SSR markers based on ESTs through data mining has become a fast, efcient and relatively inexpensive, compared to development of
genomic SSRs (54). However, these approaches require the existence of sequence information.
Fig. 3 (a) A double stranded DNA fragment melts at a specic temperature (Tm) which is specic for each DNA fragment. The highest rate of uorescence decrease is generally at the melting temperature of the DNA sample (Tm). The Tm is dened as the temperature at which 50 % of the DNA sample is double stranded and 50 % is single stranded. (b) Different PCR products in size have different melting curves and can be distinguished having even one single point mutation
HRM analysis is based on the fact that although DNA melting curves are used primarily for the determination of the melting temperature (Tm) of amplied double-stranded DNA, the precise shape of a melting curve is typical of each DNA sequence (57). By making very small temperature size steps, accurate melting curves can be produced, while normalization and comparison of the melting curves can clarify whether different amplicons have the same or different sequences (58). Different amplicons can have the same Tm values but the advantage of the HRM method is that these different amplicons can be distinguished by the shape of HRM melting curves (59) (Fig. 3b). Even not being strictly a banding pattern-based method, HRM analysis is categorized as such because it relies on PCR amplication and detection of sequence variants without sequencing or hybridization procedures (60). HRM could be used as an alternative method to detect microsatellites, especially for those laboratories that do not have immediate access to capillary sequencers (61). Its main advantage is the fast, accurate, and closed-tube determination of SNPs and sequence variations (62). The sensitivity of HRM analysis has already been broadly veried (63). Besides numerous applications in clinical mutation screening, HRM was suggested as another population genetics genotyping system (64) and has been used to discriminate closely related plant cultivars (61, 6569). Mader et al. (69) proved the ability of HRM application to SSR analysis in principle, but also its limitations in comparison to CE (Capillary Electrophoresis). Specically, only low-complexity SSRs with a few alleles in a population can be fully detected with HRM. The need for production of unknown PCR products articial mixtures with already genotyped standards makes the procedure more complex and labor intensive. It may therefore be unlikely that HRM will replace CE for genotyping the highly complex SSRs typically used in population genetics. Ganopoulos et al. (67) suggested that HRM is able to detect and screen single locus markers without the need of labeled primers, product fractionation, DNA restriction or individual sequence analyses. This makes the technique ideal for cultivar identication studies where large populations are to be scored with numerous SSR loci. There are numerous advantages of the HRM method of scoring SNPs/microsatellites comparing to existing systems that are based on high-resolution gel or CE (70). First, there is no a priori requirement to identify the position or identity of the SNP/microsatellite; any SNP or length polymorphism giving rise to a melt polymorphism can be scored without characterization. Second, there are no additional reagent costs for labeled primers. Third, the capacity to perform HRM directly after PCR makes the need for further handling of samples unnecessary. The capacity of HRM analysis instruments to perform more assays in the same
time means that more data points can be generated within the 1520 min required to perform an HRM following the end of PCR, thus increasing the overall throughput. Finally, the fact that melting curves shapes depends not only to amplied size fragments but also to base composition and SNP position, is more sensitive to distinguish closely related genotypes such as cultivars of the same species.
10
Applications of Microsatellites
Microsatellites have become a marker of choice for a huge range of applications in plants with a vast literature; refer all this literature is beyond the scope of this article. SSR markers are useful for a variety of applications in plant genetics and breeding because of their reproducibility, multiallelic nature, codominant inheritance, relative abundance, and good genome coverage (71). Furthermore, SSR markers have been useful for integrating the genetic physical and sequenced based physical maps in plant species and at the same time they have provided breeders and geneticists with an efcient tool to link phenotypic and genotypic variation (72). Microsatellites are also used in order to estimate genetic variation at molecular level in a germplasm collection which will help towards the correct choice of parents for crosses in a breeding program (i.e., hybrid breeding), mapping and tagging of genes or QTLs (quantitative trait loci) for agronomic and disease resistance traits, genome mapping, MAS of promising lines and Marker Assisted Backcrossing (MAB) during breeding programs, gender identication, studying the population structure and taxonomic and phylogenetic relationships. In addition, the knowledge of genetic variation is mostly useful for characterization of accessions in plant germplasm collections and taxonomic studies and phylogenetic studies (73). In phylogenetic studies organelle specic markers (i.e., cpSSR and mtSSR) have also been used making great impact on the determination of structure and variation within a natural population too. Organelle microsatellites are attractive targets for phylogenetic studies or evolution studies and even migration histories due to uniparental mode of inheritance, conserved gene order and lack of heteroplasmy and recombination of organelle (74). Microsatellites have also been used for hybrid determination and characterization of allelic contribution of each parent (71). Moreover, microsatellites have been used for mapping of specic genomic regions responsible for agronomic traits or mapping of specic genes (75).
10
11
Conclusions
Ever since their development, microsatellite markers are constantly being isolated and characterized in a wide range of plants including cereals, legumes, vegetables, forest trees, fruit plants, conifers, and other economically important plant species. Arrival of new technologies did not eliminate the use of microsatellites instead they have rendered microsatellites a useful multi-tool in plant breeding. Microsatellites are still the method of choice for marker assisted selection, population genetics, estimation of genetic diversity, ngerprinting, mapping, and gene association studies. SSR based association mapping holds a great promise for exploiting genetic diversity, characterizing accumulated phenotypic variation, and associating markers with traits in plant germplasm especially with the progress made in the genome programs. They owe their broad use to their cost-effectiveness easy to use and their excellent results. Microsatellite markers not only are involved in genetic diversity studies, and evolutionary studies, but are also being used in fundamental research like genome analysis, gene mapping, markerassisted selection, etc., yet there are several limitations limiting their use like the need to isolate them de novo although genome projects are expected to solve this problem, the presence of stutter bands, null alleles, and heterologous amplicons (76, 77). In conclusion, genomic progress and advancement in microsatellites markers will make their use even more attractive for molecular breeding and plant genetics and eventually they will have great contribution in major crop improvement.
References
1. Armour J et al (1999) Minisatellites and mutation processes in tandemly repetitive DNA. Oxford University Press, Oxford 2. Hancock JM (1999) Microsatellites and other simple sequences: genomic context and mutational mechanisms. Oxford University Press, Oxford 3. Litt M, Luty JA (1989) A hypervariable microsatellite revealed by in vitro amplication of a dinucleotide repeat within the cardiac muscle actin gene. Am J Hum Genet 44: 397401 4. Tautz D (1989) Hypervariabity of simple sequences as a general source for polymorphic DNA markers. Nucleic Acids Res 17: 64636471 5. McDonald DB, Potts WK (1997) DNA microsatellites as genetic markers for several scales. Academic, New York 6. Tautz D, Renz M (1984) Simple sequences are ubiquitous repetitive components of eukaryotic genomes. Nucleic Acids Res 12:41274138 7. Goldstein DB, Pollock DD (1997) Launching microsatellites: a review of mutation processes and methods of phylogenetic inference. J Hered 88:335342 8. Schltterer C (1998) Microsatellites. IRL, Oxford 9. Queller DC et al (1993) Microsatellites and kinship. Trends Ecol Evol 8:285288 10. Sonah H et al (2011) Genome-wide distribution and organization of microsatellites in plants: an insight into marker development in Brachypodium. PLoS One 6:e21298 11. Kelkar YD et al (2011) A matter of life or death: how microsatellites emerge in and vanish from the human genome. Genome Res 21:20382048
Microsatellites: Evolution and Contribution 12. Nadir E et al (1996) Microsatellite spreading in the human genome: evolutionary mechanisms and structural implications. Proc Natl Acad Sci 93:64706475 13. Morgante M et al (2002) Microsatellites are preferentially associated with nonrepetitive DNA in plant genomes. Nat Genet 30:194200 14. Temnykh S et al (2001) Computational and experimental analysis of microsatellites in rice (Oryza sativa L.): frequency, length variation, transposon associations, and genetic marker potential. Genome Res 11:14411452 15. Weber J, May P (1989) Abundant class of human DNA polymorphisms which can be typed using the polymerase chain reaction. Am J Hum Genet 44:388396 16. Milbourne D et al (1998) Isolation, characterisation and mapping of simple sequence repeat loci in potato. Mol Gen Genet 259:233245 17. Sharopova N et al (2002) Development and mapping of SSR markers for maize. Plant Mol Biol 48:463481 18. Song QJ et al (2002) Characterization of trinucleotide SSR motifs in wheat. Theor Appl Genet 104:286293 19. Temnykh S et al (2000) Mapping and genome organization of microsatellite sequences in rice (Oryza sativa L.). Theor Appl Genet 100: 697712 20. Crow J (1993) How much do we know about spontaneous human mutation rates? Environ Mol Mutagen 21:122129 21. Zhu Y et al (2000) A phylogenetic perspective on sequence evolution in microsatellite loci. J Mol Evol 50:324338 22. Ellegren H (2000) Microsatellite mutations in the germline: implications for evolutionary inference. Trends Genet 16:551558 23. Jin L et al (1996) Mutation rate varies among alleles at a microsatellite locus:Phylogenetic evidence. Proc Natl Acad Sci 93: 1528515288 24. Tachida H, Iizuka M (1992) Persistence of repeated sequences that evolve by replication slippage. Genetics 131:471478 25. Tautz D, Schltterer C (1994) Simple sequences. Curr Opin Genet Dev 4:832837 26. Weber JL, Wong C (1993) Mutation of human short tandem repeats. Hum Mol Genet 2: 11231128 27. Harding RM et al (1992) The evolution of tandemly repetitive DNA: recombination rules. Genetics 132:847859 28. Levinson G, Gutman GA (1987) Slippedstrand mispairing: a major mechanism for DNA sequence evolution. Mol Biol Evol 4: 203221
11
29. Eisen J (1999) Mechanistic basis for microsatellite instability. Oxford University Press, Oxford 30. Brohede J, Ellegren H (1999) Microsatellite evolution: polarity of substitutions within repeats and neutrality of anking sequences. Proc Biol Sci 266:825833 31. Goldstein D, Schlotterer C (1999) Microsatellites, evolution and applications. Oxford University Press, Oxford 32. Jakupciak JP, Wells RD (1999) Genetic instabilities in (CTG CAG) repeats occur by recombination. J Biol Chem 274:2346823479 33. Richard GF, Paques F (2000) Mini- and microsatellite expansions: the recombination connection. EMBO Rep 1:122126 34. Charlesworth B et al (1994) The evolutionary dynamics of repetitive DNA in eukaryotes. Nature 371:215220 35. Bruford M et al (1996) Microsatellites and their application to conservation genetics. Oxford University Press, Oxford 36. Kostia S et al (1995) Microsatellite sequences in a conifer, Pinus sylvestris. Genome 38: 12441248 37. Rder MS et al (1995) Abundance, variability and chromosomal location of microsatellites in wheat. Mol Gen Genet 246:327333 38. Smith DN, Devey ME (1994) Occurrence and inheritance of microsatellites in Pinus radiata. Genome 37:977983 39. Gupta PK et al (1999) Molecular markers and their applications in wheat breeding. Plant Breed 118:369390 40. International Rice Genome Sequencing Project (2005) The map-based sequence of the rice genome Nature 436:793800 41. Jarne P, Lagoda PJL (1996) Microsatellites, from molecules to populations and back. Trends Ecol Evol 11:424429 42. Eujayl I et al (2004) Medicago truncatula ESTSSRs reveal cross-species genetic markers for Medicago spp. Theor Appl Genet 108:414422 43. Hackauf B, Wehling P (2002) Identication of microsatellite polymorphisms in an expressed portion of the rye genome. Plant Breed 121:1725 44. Thiel TT et al (2003) Exploiting EST databases for the development and characterization of gene-derived SSR-markers in barley (Hordeum vulgare). Theor Appl Genet 106: 411422 45. Chapman M et al (2009) Development, polymorphism, and cross-taxon utility of ESTSSR markers from safower (Carthamus tinctorius L.). Theor Appl Genet 120:8591
12
Panagiotis Madesis et al. of gene variants in complex DNA fragments. Hum Mutat 30:876883 Mackay JF et al (2008) A new approach to varietal identication in plants by microsatellite high resolution melting analysis: application to the verication of grapevine and olive cultivars. Plant Meth 4:8 Wu SB et al (2008) High resolution melting analysis of almond SNPs derived from ESTs. Theor Appl Genet 118:114 Reed GH, Wittwer CT (2004) Sensitivity and specicity of single-nucleotide polymorphism scanning by high-resolution melting analysis. Clin Chem 50:17481754 Smith BL et al (2010) High-resolution melting analysis (HRMA): a highly sensitive inexpensive genotyping alternative for population studies. Mol Ecol Resour 10:193196 Bosmali I et al (2012) Microsatellite and DNAbarcode regions typing combined with high resolution melting (HRM) analysis for food forensic uses: a case study on lentils (Lens culinaris). Food Res Int 46:141147 Ganopoulos I et al (2011) Adulterations in Basmati rice detected quantitatively by combined use of microsatellite and fragrance typing with high resolution melting (HRM) analysis. Food Chem 129:652659 Ganopoulos I et al (2011) Microsatellite high resolution melting (SSR-HRM) analysis for authenticity testing of protected designation of origin (PDO) sweet cherry products. Food Contr 22:532541 Ganopoulos I et al (2012) Microsatellite genotyping with HRM (high resolution melting) analysis for identication of the PGI common bean variety Plake Megalosperma Prespon. Eur Food Res Tech 234:501508 Mader E et al (2008) A strategy to setup codominant microsatellite analysis for highresolution-melting-curve-analysis (HRM). BMC Genet 9:69 Reed GH et al (2007) High-resolution DNA melting analysis for simple and efcient molecular diagnostics. Pharmacogenomics 8:597608 Powell W et al (1996) The comparison of RFLP, RAPD, AFLP and SSR (microsatellite) markers for germplasm analysis. Mol Breed 2:225238 Gupta PK, Varshney RK (2000) The development and use of microsatellite markers for genetic analysis and plant breeding with emphasis on bread wheat. Euphytica 113: 163185 Joshi SP et al (1999) Molecular markers in plant genome analysis. Curr Sci 77:230240
46. Choudhary S et al (2009) Development of chickpea EST-SSR markers and analysis of allelic variation across related species. Theor Appl Genet 118:591608 47. Gadaleta A et al (2010) Development and characterization of EST-derived SSRs from a totipotent cDNA library of durum wheat. Plant Breed 129:715717 48. Nunome T et al (2009) Development of SSR markers derived from SSR-enriched genomic library of eggplant (Solanum melongena L.). Theor Appl Genet 119:11431153 49. Wei W et al (2011) Characterization of the sesame (Sesamum indicum L.) global transcriptome using Illumina paired-end sequencing and development of EST-SSR markers. BMC Genomics 12:451 50. Chabane K et al (2005) EST versus genomic derived microsatellite markers for genotyping wild and cultivated barley. Genet Resour Crop Evol 52:903909 51. Cho YG et al (2000) Diversity of microsatellites derived from genomic libraries and GenBank sequences in rice (Oryza sativa L.). Theor Appl Genet 100:713722 52. Eujayl I et al (2001) Assessment of genotypic variation among cultivated durum wheat based on EST-SSRS and genomic SSRS. Euphytica 119:3943 53. Scott KD et al (2000) Analysis of SSRs derived from grape ESTs. Theor Appl Genet 100: 723726 54. Gupta PK et al (2003) Transferable EST-SSR markers for the study of polymorphism and genetic diversity in bread wheat. Mol Genet Genomics 270:315323 55. Wilhelm J et al (2003) Validation of an algorithm for automatic quantication of nucleic acid copy numbers by real-time polymerase chain reaction. Anal Biochem 317:218225 56. Wittwer CT (2009) High-resolution DNA melting analysis: advancements and limitations. Hum Mutat 30:857859 57. Vossen RHAM et al (2009) High-resolution melting analysis (HRMA)more than just sequence variant screening. Hum Mutat 30: 860866 58. Wittwer CT et al (2003) High-resolution genotyping by amplicon melting analysis using LCGreen. Clin Chem 49:853860 59. Stephens AJ et al (2008) High-resolution melting analysis of the spa repeat region of Staphylococcus aureus. Clin Chem 54: 432436 60. Tindall EA et al (2009) Assessing high-resolution melt curve analysis for accurate detection
61.
62.
63.
64.
65.
66.
67.
68.
69.
70.
71.
72.
73.
Microsatellites: Evolution and Contribution 74. Provan J et al (2001) Chloroplast microsatellites: new tools for studies in plant ecology and evolution. Trends Ecol Evol 16:142147 75. Neeraja C et al (2007) A marker-assisted backcross approach for developing submergence-tolerant rice cultivars. Theor Appl Genet 115:767776
13
76. Kalia R et al (2011) Microsatellite markers: an overview of the recent progress in plants. Euphytica 177:309334 77. Wang M et al (2009) Microsatellite markers in plants and insects. Part I: applications of biotechnology. Genes Genomes Genomics 3: 5467