- Can you also tweak other hyperparameters, such as the max_df and min_df parameters in CountVectorizer? What are their optimal values?
- Practice makes perfect—another great project to deepen your understanding could be sentiment (positive/negative) classification for movie review data, which can be downloaded directly at http://www.cs.cornell.edu/people/pabo/movie-review-data/review_polarity.tar.gz, or from the page at http://www.cs.cornell.edu/people/pabo/movie-review-data/.