Clustering with Spark and SageMaker on EMR