
Course for industry, students
Course duration: 7 weeks (28 hours)
Online teaching
Language: English
Course held between 18 May – 30 June 2022
About this course
Specific Skills Acquired: The course focuses on gaining practical skills in working with machine learning tools and libraries in the Python ecosystem. Participants will gain practical experience in building ML models, starting from the analysis of raw data and ending with the evaluation of the model and the hyperparameter tuning, using both traditional algorithms and neural networks. Among the tools mastered by the participants are the libraries of pandas, matplotlib, sckit-learn, keras, etc.
General Objectives: This course is an introductory course in machine learning. The main goal of the course is to provide participants with a general understanding of the problems solved by machine learning methods and basic algorithms for solving them. The course has a practical orientation, so all stages of working with this data are accompanied by practical examples.
Course content
General formulation of the machine learning task. Classification of ML tasks.
- Data types and control flow
- Containers
- Functions
- Files I/O
- Modules and packages
- Object-oriented programming
Basic syntactic constructionsand tools of the Python language, which are necessary for solving machine learning problems
- numpy
- pandas
- matplotlib, seaborn
Practical introduction to the main Python libraries used for data processing
- scklearn library
- Nearest neighbors classifiers
- Decision trees
- Bayesian classifiers
- Linear classifiers, SVM
- Regression algorithms
- Clustering: KMeans, DBSCAN
- Neural networks (Keras library)
- Convolutional neural networks (Keras library)
Application of basic ML algorithms for solving classification, regression and clustering task using libraries scikit-learn and keras
- Hadoop architecture
- Hadoop ecosystem: Pig, Hive
- NoSQL databases: MongoDB,Redis
- Spark
Traditional and modern big data tools with application examples