You need the following to smoothly work through the chapters: Apache Spark (downloadable from http://spark.apache.org/downloads.html) Python