Big Data Analytics with PySpark + Power BI + MongoDB
Introduction
Project Files
Python Installation
Installing Apache Spark
Installing Java (Optional)
Testing Apache Spark Installation
Installing MongoDB
Installing NoSQL Booster for MongoDB
Integrating PySpark with Jupyter Notebook
Data Extraction
Data Transformation
Loading Data into MongoDB
Data Pre-processing
Building the Predictive Model
Creating the Prediction Dataset
Installing Visual Studio Code IDE
Creating the PySpark ETL Script
Creating the Machine Learning Script
Installing Power BI Desktop
Installing MongoDB ODBC Drivers
Creating a System DSN for MongoDB
Loading the Data Sources
Creating a Geo Map
Creating a Table Plot
Creating an Area Chart
Creating a Bar Chart
Creating a Doughnut Chart
Source Code and Notebook