Russian version English version
Volume 17   Issue 1   Year 2022
Coronavirus Genus Recognition Based on Prototype Virus Variants

Chaley M.B.1, Kutyrkin V.A.2

1Institute of Mathematical Problems of Biology RAS, Pushchino, Russia
2Moscow State Technical University n.a. N.E. Bauman, Moscow, Russia

Abstract. Method named as variant approach to recognizing genus of coronavirus that is based on frequency of codon distribution in viral ORF1ab and genes of structural proteins (S, M and N) was proposed in the work. This method uses modified statistics whose efficiency was demonstrated earlier for flavivirus species recognition. To recognize genus of coronavirus the variant approach considers both various combinations of several structural coronavirus genes and individual structural genes. Finally, coronavirus genus is determined in the result of analysis of all variants considered. The method proposed was developed with the help of learning sample from prototype viral variants of Alphacoronavirus, Betacoronavirus, Deltacoronavirus and Gammacoronavirus genus. Application of the variant approach to recognizing genus of coronavirus has demonstrated the approach high assurance at level of 95 %. Among all variants of joint analysis, the most reliability (98 %) in recognizing genus has been achieved if codon frequency of the ORF1ab was used. Variant approach has revealed a phenomenon of mosaic structure in coronavirus genomes, i.e., when the results of genus recognition for a few genes differ from final conclusion about coronavirus genus. It seems that such phenomenon reflects homologous recombinations of the genes between various species of the coronaviruses and plasticity of their genomes in evolutionary processes.

Key words: coronavirus genome, ORF1ab, S-gene, M-gene, N-gene, statistical analysis, variant approach to recognizing coronavirus genus.

 
Table of Contents Original Article
Math. Biol. Bioinf.
2022;17(1):10-27
doi: 10.17537/2022.17.10
published in Russian

Abstract (rus.)
Abstract (eng.)
Full text (rus., pdf)
References

 

  Copyright IMPB RAS © 2005-2024