Who Is This Course For
Our comprehensive data engineering training program is designed for individuals at different stages of their data career journey. Whether you're just starting out or looking to elevate your existing skills, our course caters to a variety of backgrounds and experiences:
● Definition and scope
● Importance in modern data-driven businesses
Azure Overview
● Introduction to Azure services relevant to data engineering
● Setting up an Azure account and resources
● Key skills and knowledge to be gained
● Overview of the bootcamp structure and learning outcomes
Learning Path
● Modules, projects, and assessments
● Expectations for hands-on labs and practical assignments
● Identifying stakeholders and their end goals
● Translating business needs into technical requirements
Case Studies
● Real-world examples of data engineering solutions
● Definition and significance
● Common data wrangling tasks
● Overview of Azure Storage services
Introduction to Data Lakes
● Azure Data Lake Storage (ADLS)
● Setting up and configuring ADLS
● Best practices for data storage and management
● DBMS, RDBMS and SQL world
Data Modeling Concepts
● Types of data models (relational, dimensional, NoSQL)
● Importance of a well-designed data model
Designing Data Models
● Tools and techniques for data modeling
● Practical examples and case studies
● Definition and importance
● Key components of orchestration
Azure Data Factory
● Overview and capabilities
● Creating and managing data pipelinesEnhance your workforce's expertise in data engineering to improve project outcomes and drive business success.
● Key concepts and benefits
Introduction to big data tools in Azure (Spark & Databricks)
● Security in Big Data
● Ensuring data security and compliance
● Best practices and tools for data protection
● Understanding data sources and ingestion methods
● Real-time vs. batch ingestion
Azure Data Factory
● Configuring data ingestion pipelines
● Hands-on examples
● Importance and types of data transformation
● ETL vs ELT
Tools for Data Transformation
● Using Azure Data Factory and Databricks for data transformation
● Hands-on labs and exercises
● Importance of reporting in data engineering
● Types of reports and dashboards
Self-Service BI Tools
● Power BI and Azure Analysis Services
● Creating and sharing reports
● Definition and key features
● Benefits of lakehouse architecture
Implementing a Lakehouse in Azure
● Tools and best practices
● Case studies and examples
● Definition and components
● Benefits and use cases
Designing Medallion Architecture in Azure
● Practical steps and tools
● Hands-on exercises
● Overview of Databricks platform and community
● Key features and benefits
Engaging with the Databricks Community
● Resources, forums, and support
● Collaborative projects and contributions
● Designing complex data pipelines
● Monitoring and managing pipelines
Hands-on Labs
● Building and deploying pipelines in Azure Data Factory
● Comparison of Azure Data Factory, Dagster, and others
● When to use which tool
Practical Examples
● Implementing orchestration with different tools
● Key features and benefits
● Spark in Azure (Databricks)
End-to-End Workflow
● Data ingestion, processing, and analysis with Spark
● Hands-on project
● Key features and architecture
End-to-End Workflow
● Building a complete data analytics solution in Databricks
● Hands-on project
● End-to-End Workflow
● Implementing data quality in your ELT/ETL processes
● Key concepts and importance
● Tools and practices in Azure
Implementing CI/CD for Data Pipelines
● Practical steps and tools
● Hands-on examples
● Machine learning and AI in data engineering
● Emerging technologies and future trends
● Spark/Databricks troubleshooting and optimisation
Group Project
● End-to-end project encompassing all learned skills in a group
● Presentation and feedback Sessions