In this chapter, we established the framework for the different data processing units that will be introduced in this book. There is a very good reason why the topics of model validation and overfitting are explored early in this book. There is no point in building models and selecting algorithms if we do not have a methodology to evaluate their relative merits.
In this chapter, you were introduced to the following:
The next chapter will address the problem of overfitting by identifying outliers and reducing noise in data.