How to do it...

Calculating the descriptive statistics for your data is extremely easy in PySpark. Here's how:

descriptive_stats = no_outliers.describe(features)

That's it!

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset