Apache Spark Fundamentals
Czas trwania
26
hours
Lokalizacja
Online
Język
English
Kod
EAS-017
Szkolenie dla #uczestników# lub większej liczby osób?
Dostosuj treningi
dla Twoich konkretnych potrzeb
Opis
Po ukończeniu kursu na formularzu Luxoft Training
wydawany jest certyfikat
wydawany jest certyfikat
Cele
During the training participants will:
- Write a Spark pipeline via functional Python and RDDs;
- Write a Spark pipeline via Python, Spark DSL, Spark SQL and DataFrame;
- Draw architecture with different sources;
- Write a Spark pipeline with external systems (Kafka, Cassandra, Postgres) which works in parallel modes;
- Resolve problems with slow joins.
After the training, participants will be able to build a simple PySpark application and execute it on the cluster in parallel mode.
Grupa docelowa
- Software developers
- Software architects
Warunki wstępne
Basic Java, Python, Scala programming skills. Unix/Linux shell familiarity. Experience with databases is optional.
Plan działania
- Spark concepts and architecture
- Programming with RDDs: transformations and actions
- Using key/value pairs
- Loading and storing data
- Accumulators and broadcast variables
- Spark SQL, DataFrames, Datasets
- Spark Streaming
- Machine Learning using MLLib and Spark ML
- Graph analysis using GraphX