Hands-On Big Data Analytics with PySpark
上QQ阅读APP看书,第一时间看更新

Loading data on to Spark RDDs

In this section, we are going to look at loading data on to Spark RDDs, and will cover the following topics:

  • The UCI machine learning data repository
  • Getting data from the repository to Python
  • Getting data into Spark

Let's start with an overview of the UCI machine learning data repository.