Using Decision Trees to Predict Critical Reading Performance.
DOI:
https://doi.org/10.19053/01211129.v30.n58.2021.13792Keywords:
academic performance, critical reading, decision trees, J48 algorithm, Saber ProAbstract
In Colombia, all undergraduate students, regardless of the professional training program they take, must complete the general competencies sections of the Saber Pro exam that include Critical Reading, Quantitative Reasoning, Citizen Competencies, Written Communication, and English. This paper presents the application of the classification technique based on decision trees in the prediction of the performance in the Critical Reading section presented by the students of the Pontificia Universidad Javeriana Cali in the years 2017 and 2018. The CRISP methodology was used. From the socioeconomic, academic and institutional data stored in the ICFES databases, a data repository was built, cleaned and transformed. A mineable view composed of 2052 records and 17 attributes was obtained. The J48 algorithm of the Weka tool was used to build the decision tree. The score obtained in the Critical Reading section of the Saber Pro exam was taken as a class. According to the results obtained, the Philosophy, Applied Mathematics, and Medicine programs stood out for having the best performance in this test. Among the predictive variables associated with performance in the Critical Reading skill are the faculty, the age group and the student's transportation index, as three important variables related to the good or low academic performance of the students of the Universidad Javeriana Cali. The knowledge generated in this research is constituted in quality information to support the decision-making process of the university directives in order to improve the quality of the higher education offered in this institution.
Downloads
References
Icfes, Saber Pro: Módulos de Competencias Genéricas 2017. Instituto Colombiano para la Evaluación de la Educación Superior, Bogotá D.C., Colombia, 2017. https://www.icfes.gov.co/documents/20143/495161/Guia%20de%20orientacion%20modulos%20de%20competencias%20genericas-saber-pro-2017.pdf
Icfes, Guía de orientación Saber Pro: Módulos de competencias genéricas, Bogotá D.C., Colombia, 2018. https://www.icfes.gov.co/documents/20143/496194/Guia%20de%20orientacion%20modulos%20de%20competencias%20genericas-saber-pro-2018.pdf
R. Timarán, I. Hernández, J. Caicedo, A. Hidalgo, J. Alvarado, Descubrimiento de patrones de desempeño académico, Bogotá, Colombia: Ediciones Universidad Cooperativa de Colombia, 2016. DOI: https://doi.org/10.16925/9789587600490
Icfes, Informe nacional de resultados Saber Pro 2012-2015, Bogotá D.C., Colombia, 2016. https://www.icfes.gov.co/documents/20143/194324/Informe%20nacional%20de%20resultados%20saber%20pro%202012%20-%202015.pdf
L. Zapata, Factores académicos asociados al bajo rendimiento en inglés en las pruebas ECAES presentadas por los estudiantes de la Facultad de Educación en el año 2009, Grade Thesis, Fundación Universitaria Luis Amigó, Medellín, Colombia, 2011
UNAL, Análisis de los resultados obtenidos por la Universidad Nacional de Colombia sede Bogotá en las pruebas Saber Pro 2011–2, Bogotá D.C., Colombia, 2012. https://www.unal.edu.co/diracad/evaluacion/SaberPro_2012/analisis_de_resultados.pdf
R. Timarán, A. Calderón, J. Jiménez, Detección de Patrones de Deserción Estudiantil con Minería de Datos, San Juan de Pasto, Colombia: Editorial Universidad de Nariño, 2017
S. Valero, A. Vargas, M. Alonso, Minería de datos: predicción de la deserción escolar mediante el algoritmo de árboles de decisión y el algoritmo de los k vecinos más cercanos, 2005. http://fcaenlinea.unam.mx/anexos/1566/1566_u6_act1b.pdf.
H. Escobar, M. Alcívar, C. Márquez, C. Escobar, “Implementación de Minería de Datos en la Gestión Académica de las Instituciones de Educación Superior,” Didasc@lia: Didáctica y Educación, vol. 8, no. 3, pp. 203-212, 2017
A. Azevedo, M. Santos, “KDD, SEMMA and CRISP-DM: a parallel overview,” in Proceedings of IADIS European Conference on Data Mining, pp. 182-185, 2008
J. Villena, CRISP-DM: La metodología para poner orden en los proyectos de Data Science, 2016. https://data.sngular.team/es/art/25/crisp-dm-la-metodologia-para-poner-orden-en-los-proyectos-de-data-science
J. Hernández, M. Ramírez, C. Ferri, Introducción a la Minería de Datos. Madrid, España: Editorial Pearson Educación S.A., 2005
J. Han, M. Kamber, Data Mining: Concepts and Techniques. San Francisco, USA: Morgan Kaufmann Publishers, 2001
K. Sattler, O. Dunemann, “SQL Database Primitives for Decision Tree Classifiers,” in 10th ACM International Conference on Information and Knowledge Management, pp. 379-386, 2001
R. Timarán, J. Caicedo, A. Hidalgo, Aplicación de la minería de datos en la detección de patrones de desempeño académico en las pruebas Saber Pro, San Juan de Pasto, Colombia: Editorial Universidad de Nariño, 2021
E. Hernández, R. Lorente, Minería de datos aplicada a la detección de Cáncer de Mama. Universidad Carlos III, Madrid, Spain, 2009. http://www.it.uc3m.es/jvillena/irc/practicas/08-09/14.pdf
I. Witten, E. Frank, M. Hall, Data Mining: Practical Machine Learning Tools and Techniques (Third Edition). New York, USA: Morgan Kaufmann, 2011. DOI: https://doi.org/10.1016/C2009-0-19715-5
J. R. Quinlan, Programs for Machine Learning. San Francisco, USA: Morgan Kaufmann Publishers, 1993
M. García, A. Álvarez, Análisis de Datos en WEKA: Pruebas de Selectividad, 2010. http://www.it.uc3m.es/jvillena/irc/practicas/06-07/28.pdf
Published
How to Cite
Issue
Section
License
Copyright (c) 2021 Andrea Timaran-Buchely, Silvio-Ricardo Timarán-Pereira, Arsenio Hidalgo-Troya

This work is licensed under a Creative Commons Attribution 4.0 International License.
All articles included in the Revista Facultad de Ingeniería are published under the Creative Commons (BY) license.
Authors must complete, sign, and submit the Review and Publication Authorization Form of the manuscript provided by the Journal; this form should contain all the originality and copyright information of the manuscript.
The authors who publish in this Journal accept the following conditions:
a. The authors retain the copyright and transfer the right of the first publication to the journal, with the work registered under the Creative Commons attribution license, which allows third parties to use what is published as long as they mention the authorship of the work and the first publication in this Journal.
b. Authors can make other independent and additional contractual agreements for the non-exclusive distribution of the version of the article published in this journal (eg, include it in an institutional repository or publish it in a book) provided they clearly indicate that the work It was first published in this Journal.
c. Authors are allowed and recommended to publish their work on the Internet (for example on institutional or personal pages) before and during the process.
review and publication, as it can lead to productive exchanges and a greater and faster dissemination of published work.
d. The Journal authorizes the total or partial reproduction of the content of the publication, as long as the source is cited, that is, the name of the Journal, name of the author (s), year, volume, publication number and pages of the article.
e. The ideas and statements issued by the authors are their responsibility and in no case bind the Journal.