Posted on 1 October 2021
in lectures
The “API in data mining” course aims to familiarize students with the tools, libraries and cloud solutions used in data mining. During the course, students will acquire the necessary skills to design complete applications that use cloud-based services to process and collect data. They will gain the ability to integrate and communicate between libraries and data mining tools. The contents of the module introduce you to the basic terms and concepts related to modern data processing pipeline architecture. After completing the course, participants will gain the basic skills necessary to use advanced tools and techniques in working with data and to design and implement cloud-based applications for data processing and collection.
topics:
Definition of API and data mining concepts, application of API in data mining
Cloud computing challenges,ETL and ELT
Specificity of the organization and roles related to working with data.
The structure of applications using machine learning models and big data-related services.
Integration with big data and machine learning services within AWS / Google Cloud AI.
Designing a data processing pipeline using websites to process and collect data.
Data import and preparation, data storage and structuring
Creating a data Lake, creating a data warehouse
Big Data processing
Scaling, containerization and microservices architecture in modern dev and prod env
AWS, Data engineering, Python
[Top]