https://camera-optical-zoom.blogspot.com/2024/02/pyspark-2-kmeans-input-data-is-not.html
Pyspark 2: Kmeans The Input Data Is Not Directly Cached