medigraphic.com
SPANISH

Revista Cubana de Informática Médica

ISSN 1684-1859 (Print)
  • Contents
  • View Archive
  • Information
    • General Information        
    • Directory
  • Publish
    • Instructions for authors        
  • medigraphic.com
    • Home
    • Journals index            
    • Register / Login
  • Mi perfil

2019, Number 1

<< Back Next >>

Revista Cubana de Informática Médica 2019; 11 (1)

Methodology for in silico mining of microsatellite polymorphic loci

Martínez OCM, Rivero BA
Full text How to cite this article

Language: Spanish
References: 12
Page: 1-17
PDF size: 1865.73 Kb.


Key words:

SSR, VNTR, molecular marker, data mining, algorithm.

ABSTRACT

Polymorphisms with variable number of tandem repeats (VNTR), are genetic markers used in areas of genomics as evolutionary, epidemiological and population genetics studies. The growth of genomic sequences in data banks and the development of computational tools for bioinformatics allow the mining of these markers without the need to use experimental methods, extending the analysis to non-model organisms of medical or economic importance. Due to the low complexity of these sequences and the high number of candidates presented when inspecting one or several genomes in a scaled manner, difficulties arise in processing the volume of data that is generated and the detection of polymorphisms by visual inspection in candidate markers.
A methodology and its algorithmic specificities are described, implemented in a software pipeline, which allow the fast and reliable identification of polymorphic SSRs loci. The global processing is done by the concatenation of the programs MIDAS, BLAST and the PSSR-Extractor script. The inputs are directory paths where multiple sequence files are found in FASTA or GBFF format and the outputs are the SSRs, access codes to the databases, positions in the genome, number of repetitions and the degree of polymorphism expressed as range of variation, allelic frequency, allele number and polymorphic information content (PIC). An optional script, SSRMerge, allows the identification of unique (non-redundant) loci in the set of processed genome sequences with taxonomically closed relationship.
Twenty three complete genomes (RefSeq from NCBI) belonging to various isolates of Mycobacterium tuberculosis were processed, 4433 SSRs were detected and from them 414 non-redundant loci were extracted within the species. The polymorphisms for these SSRs were mined in the BLAST server outputs and different measures are reported that reflect loci variations.


REFERENCES

  1. Li YC, Korol AB, Fahima T, Beiles A, Nevo E. Microsatellites: Genomic distribution, putativefunctions and mutational mechanisms: A review. Molecular Ecology 2002; 11: 2453–65.

  2. Ellegren, H. Microsatellites: Simple sequences with complex evolution. Nature Reviews. Genetics 2004; 5: 435–45.

  3. Xu, J.S.,Wu,Y.T.,Ye,S.J.,Wang,L.,and Feng,Y.Z. SSR primer screening and assessment on pear germplasm resources. J. Central South Univ. Forest.Technol. 2012; 32, 80–5.

  4. Hodel et al. Using Microsatellites in the 21st Century. Applications in Plant Sciences 2016 4(6)

  5. Leclercq, S., Rivals, E., Jarne, P. Detecting Microsatellites Within Genomes: Significant Variation Among Algorithms. BMC Bioinformatics 2007, 8:125.

  6. Grover A, Aishwarya V, Sharma PC. Searching Microsatellites in DNA Sequences: Approaches Used and Tools Developed. Physiol Mol Biol Plants (January–March 2012) 18(1):11–19

  7. Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. Basic Local Alignment Search Tool. J. Mol. Biol. (1990) 215:403-410.

  8. Martínez CM. MIDAS: Computer Application for the Identification of Exact and Inaccurate Microsatellites in Genomic Sequences. Revista Cubana de Informática Médica, Vol. 18, No. 2 (2018).

  9. Fleischmann RD, Alland D, Eisen JA, Carpenter L, White O, Peterson J et AL. Whole-genome Comparison of Mycobacterium Tuberculosis Clinical and Laboratory Strains. J Bacteriol 2002, 184(19):5479-5490.

  10. Sreenu V, Kumar P, Nagaraju J, Nagarajaram H. Microsatellite Polymorphism Across the M. Tuberculosis and M. Bovis Genomes: Implications on Genome Evolution and Plasticity. BMC Genomics 2006, 7:78.

  11. Supply P, Marceau M, Mangenot S, Roche D, Rouanet C, Khanna V, et al. Genomic Analysis of Smooth Tubercle Bacilli Provides Insights Into Ancestry and Pathoadaptation of Mycobacterium Tuberculosis. Nat Genet. 2013 Feb; 45(2):172–179.

  12. Warholm P, Light S. Identification of a Non-Pentapeptide Region Associated with Rapid Mycobacterial Evolution. PLoS ONE (2016), 11(5): e0154059.




2020     |     www.medigraphic.com

Mi perfil

C?MO CITAR (Vancouver)

Revista Cubana de Informática Médica. 2019;11