519.25 Stochastic coding models and recognition of structural and statistical characteristics of coding sequences

Kutyrkin V. A. (-), Chaley M. B. (-)

PROFILE LINE OF A RANDOM STRING, PROFILE PERIODIC BEHAVIOUR, PERIODIC BEHAVIOUR PATTERN, STOCHASTIC CODON, MULTI-POLYNOMIAL MODEL


doi: 10.18698/2309-3684-2017-3-119138


The paper introduces stochastic models explaining real characteristic regularities of coding regions from genomes of various organisms. Due to the growing volume of data on sequenced genomes, there arises a problem of its computer-aided analysis. By using these models, we developed methods for recognizing the structural and statistical properties of genomic DNA sequences, which can be used to find algorithms and computer programs for the automated processing of large amounts of data. The properties of the proposed stochastic coding models are demonstrated in numerical experiments with binary recoded paragraphs of literary works in English and Italian.


[1] Aleksandrov A.A., Dimitrienko Yu.I. Matematicheskoe modelirovanie i chislennye metody — Mathematical Modeling and Computational Methods, 2014, no. 1 (1), pp. 3–4. DOI 10.18698/2309-3684-2014-1-None
[2] Zarubin V.S., Kuvyrkin G.N. Matematicheskoe modelirovanie i chislennye metody — Mathematical Modeling and Computational Methods, 2014, no. 1 (1), pp. 5–17. DOI 10.18698/2309-3684-2014-1-517
[3] Chaley M., Kutyrkin V. Journal of Theoretical Biology, 2016, vol. 390, pp. 106–116.
[4] Kutyrkin V.A., Chaley M.B. Matematicheskaya biologiya i bioinformatika — Mathematical Biology and Bioinformatics, 2016, vol. 11, no. 1, pp. 24 – 45. DOI 10.17537/2016.11.24
[5] Chaley M., Kutyrkin V. Spectral-statistical approach for revealing latent regular structures in DNA sequence. Data Mining Techniques for the Life Sciences. New York, Springer Science+Business Media, 2016, pp. 315–340.
[6] Chaley М., Kutyrkin V. Mathematical Biosciences, 2008, vol. 211, no. 1, pp. 186–204. DOI 10.1016/j.mbs.2007.10.008
[7] Chaley M.B., Kutyrkin V.A. Moscow University Biological Sciences Bulletin, 2010, vol. 65, no. 4, pp. 133–135.
[8] Chaley M., Kutyrkin V. DNA Research, 2011, vol. 18, no. 5, pp. 353–362. DOI 10.1093/dnares/dsr023
[9] Kutyrkin V.A., Chaley M.B. Matematicheskaya biologiya i bioinformatika — Mathematical Biology and Bioinformatics, 2014, vol. 9, no. 1, pp. 33–62. DOI 10.17537/2014.9.33
[10] Benson G. Nucleic Acids Research, 1999, vol. 27, pp. 573–580.
[11] Sánchez J. Bioinformation, 2011, vol. 6, pp. 327–329.
[12] Sokol D., Benson G., Tojeira J. Bioinformatics, 2007, vol. 23, pp. 30–35. DOI 10.1093/bioinformatics/btl309
[13] Marhon S.A., Kremer S.C. Journal Computational Biology, 2010, vol. 18, pp. 639–676. DOI 10.1089/cmb.2010.0184
[14] Issac B., Singh H., Kaur H., Raghava G.P.S. Bioinformatics, 2002, 18, pp. 196–197.
[15] Howe E.D., Song J.S. Nucleic Acids Research, 2013, vol. 41, pp. 1395–1405. DOI 10.1093/nar/gks1261
[16] Kutyrkin V.A., Chaley M.B. Inzhenerny zhurnal: nauka i innovatsii — Engineering Journal: Science and Innovation, 2012, no 2.
DOI 10.18698/2308-6033-2012-2-46
[17] KEGG. Kyoto encyclopedia of genes and genomes. Available at:
http://www.kegg.jp (accessed November 23, 2017).


Kutyrkin V.A. ,Chaley M.B.Stochastic coding models and recognition of structural and statistical characteristics of coding sequences.Маthematical Modeling and Computational Methods, 2017, №3 (15), pp. 119–138



Download article

Колличество скачиваний: 194