7.4.- Classification, Regression, Clustering and Association Rules in Spark
In the following content,
Francisco Javier García Castellano,
Associate Professor in Computer Science and Artificial Intelligence at the University of Granada, will present us with different examples of classification algorithms, regression, clustering and association rules using Apache Spark.
The example has been designed using the Python programming language, and
this document details the theoretical framework of the methods and/or instructions used, as well as the Python code that would be used to carry out each of the steps. of the analysis. The evaluation of the module will be carried out based on the information contained in it as well as the content of the videos of the module.
Additionally, although it is not necessary to pass the module, for those who want to execute the commands and have previous knowledge with Jupyter Notebooks, we have made the Python Notebook available to you where you can execute the code. To do this, you must enter the following link with a gmail account. Notebook / Notebook (link to
Google Collaboratory).
Last modified: Tuesday, 15 March 2022, 1:39 PM