Español

  • Luis H. García-Islas Universidad Autónoma del Estado de Hidalgo
  • Anilu Franco-Arcega Universidad Autónoma del Estado de Hidalgo
  • Antonio Quiroz-Gutierrez Universidad Autónoma del Estado de Hidalgo
  • Kristell D. Franco-Sanchez Universidad Autónoma del Estado de Hidalgo
Keywords: Data mining, Frequent pattern, MArkov Chain, DNA, Bioinformatics

Abstract

Within development of Bioinformatics, pattern mining has become
a critical point of attention. Patterns on Biological sequences,
specially repeated patterns, usually shows relevant structural
o functional features. As a result of several studies, Susumu
Ohno proposed a set of rules that most species should fulfill
in order to accomplisg their evolution aspects. The present work
validates such rules using frequent pattern mining and Markov
Chain Techniques assessing 32,074 DNA sequences from diverse
biological organisms from GenBank Biological database
as a part of National Center of Biotechnology Information from
National Health Institute of the United States. This could identify
organisms that possess particularities that diers on their
evolutionWithin development of Bioinformatics, pattern mining has become
a critical point of attention. Patterns on Biological sequences,
specially repeated patterns, usually shows relevant structural
o functional features. As a result of several studies, Susumu
Ohno proposed a set of rules that most species should fulfill
in order to accomplisg their evolution aspects. The present work
validates such rules using frequent pattern mining and Markov
Chain Techniques assessing 32,074 DNA sequences from diverse
biological organisms from GenBank Biological database
as a part of National Center of Biotechnology Information from
National Health Institute of the United States. This could identify
organisms that possess particularities that diers on their
evolution

Downloads

Download data is not yet available.

References

Aggarwal, C. C., Han, J., 2014. Frequent pattern mining. Springer.

Agrawal, R., Imielinski, T., Swami, A., 1993. Mining association rules between sets of items in large databases. In: Acm sigmod record. Vol. 22. ACM, pp. 207–216.

Calvo, A., 2015. Biología celular biomédica + StudentConsult en español. Elsevier, Barcelona.

Campbell, N., Urry, L. A., Cain, M. L., Wasserman, S. A., Minorsky, P. V., Jackson, R. B., Reece, J. B., 2014. Campbell biology. Pearson, Boston.

Crick, F., 1958. On protein synthesis. In: Symposium of the Society for Experimental Biology XII. New York: Academic Press.

Hall, B. K., 2003. Unlocking the black box between genotype and phenotype: Cell condensations as morphogenetic (modular) units. Biology and Philosophy 18 (2), 219–247.

Hossain, K. S. M. T., Patnaik, D., Laxman, S., Jain, P., Bailey-Kellogg, C., Ramakrishnan, N., Sept 2013. Improved multiple sequence alignments using coupled pattern mining. IEEE/ACM Transactions on Computational Biology and Bioinformatics 10 (5), 1098–1112. DOI: 10.1109/TCBB.2013.36

Junyan, Z., Chenhui, Y., 2015. Sequence pattern mining based on markov chain. In: Information Technology in Medicine and Education (ITME), 2015 7th International Conference on. IEEE, pp. 234–238.

Kumar, V., Sharma, A., Kaur, R., Thukral, A. K., Bhardwaj, R., Ahmad, P., 2017. Differential distribution of amino acids in plants. Amino Acids 49, 821–869.

Liu, X., Wu, J., Gong, H., Deng, S., He, Z., Sept 2014. Mining conditional phosphorylation motifs. IEEE/ACM Transactions on Computational Biology and Bioinformatics 11 (5), 915–927. DOI: 10.1109/TCBB.2014.2321400

Medeiros, I., Neves, N., Correia, M., March 2016. Detecting and removing web application vulnerabilities with static analysis and data mining. IEEE Transactions on Reliability 65 (1), 54–69. DOI: 10.1109/TR.2015.2457411

Mutakabbir, K. M., Mahin, S. S., Hasan, M. A., 2014. Mining frequent pattern within a genetic sequence using unique pattern indexing and mapping techniques. In: Informatics, Electronics & Vision (ICIEV), 2014 International Conference on. IEEE, pp. 1–5.

National Center of Biotechnology Information, 11 2017. Gen bank and wgs statistics. consultado el 15-04-2019 desde https://www.ncbi.nlm.nih.gov/genbank/.

Ohno, S., 1988. Universal rule for coding sequence construction: Ta/cg deficiency-tg/ct excess. Proceedings of the National Academy of Sciences 85 (24), 9630–9634.

Osawa, S., Honjo, T., 8 2015. Evolution of Life. Fossils, Molecules, and Culture. Springer-Verlag. DOI: 10.1046/j.1420-9101.1992.5040725.x

Tahir, M., Hayat, M., Kabir, M., 2017. Sequence based predictor for discrimination of enhancer and their types by applying general form of chou’s trinucleotide composition. Computer Methods and Programs in Biomedicine 146, 69 – 75. DOI: https://doi.org/10.1016/j.cmpb.2017.05.008

Published
2019-07-05
How to Cite
García-Islas, L. H., Franco-Arcega, A., Quiroz-Gutierrez, A., & Franco-Sanchez, K. D. (2019). Español. Pädi Boletín Científico De Ciencias Básicas E Ingenierías Del ICBI, 7(13), 84-89. https://doi.org/10.29057/icbi.v7i13.4253

Most read articles by the same author(s)