Getting ready

To execute this recipe, you need to have a working Spark environment. Also, we will be working off of the no_outliers DataFrame we created in the Handling outliers recipe so we assume you have followed the steps to handle duplicates, missing observations, and outliers.

No other prerequisites are required.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset