Big Data Analytics with PySpark + Tableau Desktop + MongoDB
Introduction
Source Code and Notebook
Python Installation
Installing Java (Optional)
Installing Apache Spark
Testing Apache Spark Installation
Installing MongoDB
Installing NoSQL Booster for MongoDB
Integrating PySpark with Jupyter Notebook
Data Extraction
Data Transformation
Loading Data into MongoDB
Data Pre-processing
Building the Predictive Model
Creating the Prediction Dataset
Installing Visual Studio Code
Creating the PySpark ETL Script
Creating the Machine Learning Script
Installing Tableau
Installing MongoDB ODBC Drivers
Creating a System DSN for MongoDB
Loading the Data Sources
Creating a Geo Map
Creating a Bar Chart
Creating a Magnitude Chart
Creating a Table Plot
Creating a Dashboard
Source Code and Notebook