In this chapter, we looked at Spark Notebooks for quick iterations. We then used sampling or filtering to pick out relevant data points. We also learned how to split datasets and create new combinations with set operations.
In the next chapter, we will cover aggregating and summarizing data into useful reports.