Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

Machine learning isn't perfect

There are many caveats of machine learning. Many are specific to different models being implemented, but there are some assumptions that are universal for any machine learning model, as follows:

The data used is, for the most part, is preprocessed and cleaned using the methods outlined in the earlier chapters.
Almost no machine learning model will tolerate dirty data with missing values or categorical values. Use dummy variables and filling/dropping techniques to handle these discrepancies.
Each row of a cleaned dataset represents a single observation of the environment we are trying to model.
If our goal is to find relationships between variables, then there is an assumption that there is some kind of relationship between these variables.
This assumption is particularly important. Many machine learning models take this assumption very seriously. These models are not able to communicate that there might not be a relationship.
Machine learning models are generally considered semiautomatic, which means that intelligent decisions by humans are still needed.
The machine is very smart but has a hard time putting things into context. The output of most models is a series of numbers and metrics attempting to quantify how well the model did. It is up to a human to put these metrics into perspective and communicate the results to an audience
Most machine learning models are sensitive to noisy data.
This means that the models get confused when you include data that doesn't make sense. For example, if you are attempting to find relationships between economic data around the world and one of your columns is puppy adoption rates in the capital city, that information is likely not to be relevant and will confuse the model.

These assumptions will come up again and again when dealing with machine learning. They are all too important and often ignored by novice data scientists.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.

Table of Contents for Machine learning isn't perfect

Create new playlist

Sign In

Sign Up

Machine learning isn't perfect

Table of Contents for
Machine learning isn't perfect