Joint Use of Different Homogeneity Testing Criteria for Latent Periodicity Revelation in Biological Sequences
Chaley M.B., Nazipova N.N., Kutyrkin V.A.
Institute of Mathematical Problems of Biology of Russian Academy of Sciences, Pushchino, Moscow Region, 142290, Russia
Moscow State Technical University n.a. N.E. Bauman, Moscow, 107005, Russia
maramaria@yandex.ru
Abstract. A model of additional statistical experiments has been used in this work to reveal latent periodicity in biological sequences. This model which generalizes a notion of fuzzy tandem repeats (FTRs) has allowed us to propose original statistical methods for estimation of periodicity pattern in the approximate tandem repeats (ATRs). It has been shown that if indels’ percentage in approximate tandem repeats is high, then for a number of cases the alignment of copies which is based on approximation of repeat’s pattern size according to this model appears to be more optimal, compared with alignment obtained by well know Tandem Repeats Finder method (TRF). Compared with existing analogs, the proposed methods have greater power. The main advantage of the proposed methods is in their applicability in practical conditions of unrepresentative sample.
Keywords: latent periodicity, test-period, profile matrix, spectrum of relative amplitudes