Advanced Spark for Developers
The Advanced Spark for Developers Course will help trainees get a proper understanding of the internal structure and functioning of Apache Spark – Spark Core (RDD), Spark SQL and Spark Streaming.
It’s a practical course in machine learning. This course covers the entire lifecycle of the solution – from initial data capture (“.csv file”) through building a model to explaining data and outcomes to the customer. The theory on classification, regression, predictions, and ensembles – is provided to the extent required for the correct understanding of discussed cases and building solutions for them.
To be determined
This course is built around some practical cases; datasets are included.
For each case, we go through the entire life cycle of a machine learning project:
A part of the course will be devoted to discussing practical tasks that trainees deal with, which can be solved by using reviewed methods.
Primary:
Secondary:
Ability to read and write code in Python, using numpy and sklearn libraries, and strong knowledge of statistics and calculus.
Total: theory 12h, practice 12h
Denys Zamyatin