Russian version English version
Volume 2   Issue 1   Year 2007
Chaley M.B., Nazipova N.N., Kutyrkin V.A.

Joint Use of Different Homogeneity Testing Criteria for Latent Periodicity Revelation in Biological Sequences

Mathematical Biology & Bioinformatics. 2007;2(1):20-35.

doi: 10.17537/2007.2.20.


  1. Benson G. Tandem repeats finder: a program to analyze DNA sequences. Nucl. Acids Res. 1999;27:573-580. doi: 10.1093/nar/27.2.573
  2. Benson G. A new distance measure for comparing sequence profiles based on path length along an entropy surface. Bioinformatics. 2002;18:S44-S53. doi: 10.1093/bioinformatics/18.suppl_2.S44
  3. Kolpakov R, Bana G, Kucherov G. mreps: efficient and flexible detection of tandem repeats in DNA. Nucl. Acids Res. 2003;31:3672-3678. doi: 10.1093/nar/gkg617
  4. Boeva V, Regnier M, Papatsenko D, Makeev V. Short fuzzy tandem repeats in genomic sequences, identification, and possible role in regulation of gene expression. Bioinformatics. 2006;22:676-684. doi: 10.1093/bioinformatics/btk032
  5. Krishnan A, Tang F. Exhaustive whole-genome tandem repeats search. Bioinformatics. 2004;20:202702-202710.
  6. Collins JR, Stephens RM, Gold B, Long B, Dean M, Burt SK. An exhaustive DNA micro-satellite map of the human genome using high performance computing. Genomics. 2003;82:10-19. doi: 10.1016/S0888-7543(03)00076-4
  7. Denoeud F, Vergnaud G. Identification of polymorphic tandem repeats by direct comparison of genome sequence from different bacterial strains: a web-based resource. BMC Bioinformatics. 2004;5:4. doi: 10.1186/1471-2105-5-4
  8. Le Fleche P, Hauck Y, Onteniente L, Prieur A, Denoeud F, Ramisse V, Sylvestre P, Benson G, Ramisse F, Vergnaud G. A tandem repeats database for bacterial genomes: application to the genotyping of Yersinia pestis and Bacillus anthracis. BMC Microbiol. 2001;1:2. doi: 10.1186/1471-2180-1-2
  9. Naslund K, Saetre P, von Salome J, Bergstrom TF, Jareborg N, Jazin E. Genome-wide prediction of human VNTRs. Genomics. 2005;85:24-35. doi: 10.1016/j.ygeno.2004.10.009
  10. Boby T, Patch AM, Aves SJ. TRbase: a database relating tandem repeats to disease genes for the human genome. Bioinformatics. 2005;21:811-816. doi: 10.1093/bioinformatics/bti059
  11. Missirlis PI, Mead CL, Butland SL, Ouellette BF, Devon RS, Leavitt BR, Holt RA. Satellog: a database for the identification and prioritization of satellite repeats in disease association studies. BMC Bioinformatics. 2005;6:145. doi: 10.1186/1471-2105-6-145
  12. Siwach P, Pophaly SD, Ganesh S. Genomic and evolutionary insights into genes encoding proteins with single amino acid repeats. Mol. Biol. Evol. 2006;23:1357-1369.
  13. Katti MV, Sami-Subbu R, Ranjekar PK, Gupta VS. Amino acid repeat patterns in protein sequences: their diversity and structural-functional implications. Protein Sci. 2000;9:1203-1209. doi: 10.1110/ps.9.6.1203
  14. Tompa P. Intrinsically unstructured proteins evolve by repeat expansion. Bioessays. 2003;25:847-855. doi: 10.1002/bies.10324
  15. Kalita MK, Ramasamy G, Duraisamy S, Chauhan VS, Gupta D. ProtRepeatsDB: a database of amino acid repeats in genomes. BMC Bioinformatics. 2006;7:336. doi: 10.1186/1471-2105-7-336
  16. Turutina VP, Laskin AA, Kudryashov NA, Skryabin KG, Korotkov EV. Identification of amino acid latent periodicity within 94 protein families. J. Comput. Biol. 2006;13:946-964.
  17. Silverman BD, Linsker R. A measure of DNA periodicity. J. Theor. Biol. 1986;118:295-300.
  18. Sharma D, Issac B, Raghava GP, Ramaswamy R. Spectral Repeat Finder (SRF): identification of repetitive sequences using Fourier transformation. Bioinformatics. 2004;20:1405-1412. doi: 10.1093/bioinformatics/bth103
  19. Marple SL. Digital Spectral Analysis with Applications. Baltimore: Prentice-Hall; 1987.
  20. Altaiski M, Mornev O, Polozov R. Wavelet analysis of DNA sequences. Genet. Anal. 1996;12:165-168.
  21. Dodin G, Vandergheynst P, Levoir P, Cordier C, Marcourt L. Fourier and wavelet transform analysis, a tool for visualizing regular patterns in DNA sequences. J Theor Biol. 2000;206:323-326. doi: 10.1006/jtbi.2000.2127
  22. Landau G, Schmidt J, Sokol D. An algorithm for approximate tandem repeats. J. Comp. Biol. 2001;8:1-18.
  23. Castello AT, Martins W, Gao GR. TROLL – tandem repeat occurrence locator. Bioinformatics. 2002;18:634-636. doi: 10.1093/bioinformatics/18.4.634
  24. Hauth AM, Joseph DA. Beyond tandem repeats: complex pattern structures and distant regions of similarity. Bioinformatics. 2002;18:31-37. doi: 10.1093/bioinformatics/18.suppl_1.S31
  25. Shulman MJ, Steinberg CM, Westmoreland N. The coding function of nucleotide sequences can be discerned by statistical analysis. J. Theor. Biol. 1981;88:409-420.
  26. Korotkov EV, Korotkova MA, Kudryashov NA. Information decomposition method to analyze symbolical sequences. Phys. Lett. A. 2003;312:198-210.
  27. Korotkova MA, Korotkov EV, Rudenko VM. Latent periodicity in protein sequences. J. Mol. Model. 1999;5:103-115.
  28. Gatherer D, McEwan N. Analysis of sequence periodicity in E. coli proteins. J. Mol. Evol. 2003;57:149-158.
  29. Shelenkov A, Skryabin K, Korotkov E. Search and classification of potential minisatellite sequences from bacterial genomes. DNA Res. 2006;13:89-102. doi: 10.1093/dnares/dsl004
  30. Li W. The study of correlation structures of DNA sequences: a critical review. Computers Chem. 1997;21:257-271. doi: 10.1016/S0097-8485(97)00022-3
  31. Cramer H. Mathematical methods of statistics. Stockholm; 1946.
  32. Kullback S. Information theory and statistics. Dover Publications; 1968.
  33. Chaley MB, Korotkov EV, Skryabin KG. Method revealing latent periodicity of the nucleotide sequences modified for a case of small samples. DNA Res. 1999;6:153-163. doi: 10.1093/dnares/6.3.153
  34. Gribskov M, Lüthy R, Eisenberg D. Profile analysis. Meth. Enzymol. 1990;183:146-159. doi: 10.1016/0076-6879(90)83011-W
Table of Contents Original Article
Math. Biol. Bioinf.
doi: 10.17537/2007.2.20
published in Russian

Abstract (rus.)
Abstract (eng.)
Full text (rus., pdf)


  Copyright IMPB RAS © 2005-2024