In this section, we will first explore the MSD (Million Song Dataset) that will be used for the regression analysis. Then we will show how to use PCA to reduce the dimensions of the dataset. Finally, we will evaluate the linear regression model for the regression quality.