Calculating the descriptive statistics for your data is extremely easy in PySpark. Here's how:
descriptive_stats = no_outliers.describe(features)
That's it!
Calculating the descriptive statistics for your data is extremely easy in PySpark. Here's how:
descriptive_stats = no_outliers.describe(features)
That's it!