Español
Abstract
Within development of Bioinformatics, pattern mining has become
a critical point of attention. Patterns on Biological sequences,
specially repeated patterns, usually shows relevant structural
o functional features. As a result of several studies, Susumu
Ohno proposed a set of rules that most species should fulfill
in order to accomplisg their evolution aspects. The present work
validates such rules using frequent pattern mining and Markov
Chain Techniques assessing 32,074 DNA sequences from diverse
biological organisms from GenBank Biological database
as a part of National Center of Biotechnology Information from
National Health Institute of the United States. This could identify
organisms that possess particularities that diers on their
evolutionWithin development of Bioinformatics, pattern mining has become
a critical point of attention. Patterns on Biological sequences,
specially repeated patterns, usually shows relevant structural
o functional features. As a result of several studies, Susumu
Ohno proposed a set of rules that most species should fulfill
in order to accomplisg their evolution aspects. The present work
validates such rules using frequent pattern mining and Markov
Chain Techniques assessing 32,074 DNA sequences from diverse
biological organisms from GenBank Biological database
as a part of National Center of Biotechnology Information from
National Health Institute of the United States. This could identify
organisms that possess particularities that diers on their
evolution
Downloads
References
Aggarwal, C. C., Han, J., 2014. Frequent pattern mining. Springer.
Agrawal, R., Imielinski, T., Swami, A., 1993. Mining association rules between sets of items in large databases. In: Acm sigmod record. Vol. 22. ACM, pp. 207–216.
Calvo, A., 2015. Biología celular biomédica + StudentConsult en español. Elsevier, Barcelona.
Campbell, N., Urry, L. A., Cain, M. L., Wasserman, S. A., Minorsky, P. V., Jackson, R. B., Reece, J. B., 2014. Campbell biology. Pearson, Boston.
Crick, F., 1958. On protein synthesis. In: Symposium of the Society for Experimental Biology XII. New York: Academic Press.
Hall, B. K., 2003. Unlocking the black box between genotype and phenotype: Cell condensations as morphogenetic (modular) units. Biology and Philosophy 18 (2), 219–247.
Hossain, K. S. M. T., Patnaik, D., Laxman, S., Jain, P., Bailey-Kellogg, C., Ramakrishnan, N., Sept 2013. Improved multiple sequence alignments using coupled pattern mining. IEEE/ACM Transactions on Computational Biology and Bioinformatics 10 (5), 1098–1112. DOI: 10.1109/TCBB.2013.36
Junyan, Z., Chenhui, Y., 2015. Sequence pattern mining based on markov chain. In: Information Technology in Medicine and Education (ITME), 2015 7th International Conference on. IEEE, pp. 234–238.
Kumar, V., Sharma, A., Kaur, R., Thukral, A. K., Bhardwaj, R., Ahmad, P., 2017. Differential distribution of amino acids in plants. Amino Acids 49, 821–869.
Liu, X., Wu, J., Gong, H., Deng, S., He, Z., Sept 2014. Mining conditional phosphorylation motifs. IEEE/ACM Transactions on Computational Biology and Bioinformatics 11 (5), 915–927. DOI: 10.1109/TCBB.2014.2321400
Medeiros, I., Neves, N., Correia, M., March 2016. Detecting and removing web application vulnerabilities with static analysis and data mining. IEEE Transactions on Reliability 65 (1), 54–69. DOI: 10.1109/TR.2015.2457411
Mutakabbir, K. M., Mahin, S. S., Hasan, M. A., 2014. Mining frequent pattern within a genetic sequence using unique pattern indexing and mapping techniques. In: Informatics, Electronics & Vision (ICIEV), 2014 International Conference on. IEEE, pp. 1–5.
National Center of Biotechnology Information, 11 2017. Gen bank and wgs statistics. consultado el 15-04-2019 desde https://www.ncbi.nlm.nih.gov/genbank/.
Ohno, S., 1988. Universal rule for coding sequence construction: Ta/cg deficiency-tg/ct excess. Proceedings of the National Academy of Sciences 85 (24), 9630–9634.
Osawa, S., Honjo, T., 8 2015. Evolution of Life. Fossils, Molecules, and Culture. Springer-Verlag. DOI: 10.1046/j.1420-9101.1992.5040725.x
Tahir, M., Hayat, M., Kabir, M., 2017. Sequence based predictor for discrimination of enhancer and their types by applying general form of chou’s trinucleotide composition. Computer Methods and Programs in Biomedicine 146, 69 – 75. DOI: https://doi.org/10.1016/j.cmpb.2017.05.008