Skip to main content

7.4.- Classification, Regression, Clustering and Association Rules in Spark

In the following content, Francisco Javier García Castellano, Associate Professor in Computer Science and Artificial Intelligence at the University of Granada, will present us with different examples of classification algorithms, regression, clustering and association rules using Apache Spark.

The example has been designed using the Python programming language, and this document details the theoretical framework of the methods and/or instructions used, as well as the Python code that would be used to carry out each of the steps. of the analysis. The evaluation of the module will be carried out based on the information contained in it as well as the content of the videos of the module. Additionally, although it is not necessary to pass the module, for those who want to execute the commands and have previous knowledge with Jupyter Notebooks, we have made the Python Notebook available to you where you can execute the code. To do this, you must enter the following link with a gmail account. Notebook / Notebook (link to Google Collaboratory).
Last modified: Tuesday, 15 March 2022, 1:39 PM