Machine Learning in Practice

It’s a practical course in machine learning. This course covers the entire lifecycle of the solution – from initial data capture (“.csv file”) through building a model to explaining data and outcomes to the customer. The theory on classification, regression, predictions, and ensembles – is provided to the extent required for the correct understanding of discussed cases and building solutions for them.

24 hours
English
Online

Description

This course is built around some practical cases; datasets are included.

For each case, we go through the entire life cycle of a machine learning project:

Exploring, cleaning, and preparing data;
Selecting a learning method to match the task (linear regression for regression, random forest for classification, K-average and DBSCAN for clustering);
Learning with the use of the selected method;
Outcome assessment;
Model optimization;

A part of the course will be devoted to discussing practical tasks that trainees deal with, which can be solved by using reviewed methods.

Objectives

Understand what tasks can be solved with the help of machine learning (and find out that Big Data is just a subsection, not a mandatory requirement).
Learn how to utilize basic methods of machine learning and how to use fast prototyping tools to answer the question, “Can you evaluate an actual return from possible implementation?”
Highlight data that should be collected and what can be required from it in the near future. Why “we want to store petabytes” – it’s not always just a whim.
Get prepared for more complex themes, particularly to complete solutions to real complex business problems.
See how exactly machine learning fits with classical analytics. In particular, make sure that it’s unnecessary (or even harmful) to dismiss all existing analysts for concept implementation.

Roadmap

Task overview (theory - 1 hour)
Tasks that are better solved by machine learning. What will happen if instead of a Data Scientist you hire a non-specialist in a given domain (just a developer/analyst/manager), expecting that they will learn everything in the process.
Preparing, cleaning, and exploring data (theory - 1 hour; practice – 1 hour)
How to gain insight into data provided by business (and find whatever order in it at all). Processing steps. What can and should be done by domain analysts, and what should better be done by a Data Scientist. Priorities in solving a specific task.
Classifiers and Regressors (theory - 2 hours; practice – 2 hours)
Practice – well formalized tasks with prepared data. Differences between tasks (binary/nonbinary/probabilistic classification, regression), redistribution of tasks across classes. Examples of practical tasks classification.
Clustering (theory - 1 hour; practice – 2 hours)
Where and how to do clustering: exploring data, task setting check, and validation of results. Which cases can be reduced to clustering.
Model evaluation (theory - 1 hour; practice – 1 hour)
Business metrics and technical metrics. Metrics for tasks of classification and regression, error matrix. Internal and external metrics of clustering quality. Cross validation. Overfitting.
Optimization (theory - 2 hours; practice – 2 hours)
What makes one model better than another: parameters, traits, and ensembles. Parameter management. Traits selection practice. Overview of tools for searching best parameters/traits/methods.
Neural networks (theory 2 hours; practice – 2 hours). Gradient descent, backpropagation, generative neural networks, convolutions, recurrency, batch normalization, dropout, activation functions, batching
Graphs, reports, dealing with real-life tasks (theory - 1 hour; practice – 2 hours).
How to visualize and present results. Semi-automated tests, process control points. From real-life tasks to complete R&D process (“R&D in practice”) – reviewing and analyzing tasks from the audience.
Data science interview questions and answers (theory – 1 hour)

Total: theory 12h, practice 12h

Related courses

Unlock the power of big data analytics with "BigData SQL Hive." This course dives deep into Apache Hive, covering everything from architecture and data types to complex queries, transactions, and performance tuning. Perfect for data professionals looking to enhance their SQL skills in a big data environment.

Dive deep into the world of Reinforcement Learning (RL) with our "Reinforcement Learning - From Fundamentals to Deep RL" course. Learn the mathematical foundations, explore key RL algorithms, and master advanced techniques in deep reinforcement learning. Perfect for aspiring data scientists and AI researchers aiming to leverage RL in real-world applications.

Master the fundamentals of data warehousing with our "Data Warehouse Fundamentals" course. Explore key concepts, architectures, and methodologies from Inmon, Kimball, and DataVault. Understand how data governance and design methods shape modern data warehouses. Ideal for those looking to build robust, scalable data systems.

Machine Learning in Practice

Description

Objectives

Target Audience

Prerequisites

Roadmap

Related courses

BigData SQL Hive

Reinforcement Learning - from Fundamentals to Deep RL

Data Warehouse Fundamentals

BigData SQL Hive

Reinforcement Learning - from Fundamentals to Deep RL

Data Warehouse Fundamentals

You may also be interested in

Discover more about professional growth and skills development