https://jmtirado.net/pyspark_analytics_tutorial/