The following are the features of the historical dataset data that we have:
Name | Type | Description |
NAME | Category | Identifies a particular vehicle |
CYLINDERS | Continuous | The number of cylinders (between 4 and 8) |
DISPLACEMENT | Continuous | The displacement of the engine in cubic.inches |
HORSEPOWER | Continuous | The horsepower of the engine |
ACCELERATION | Continuous | The time it takes to accelerate from 0 to 60 mph (in seconds) |
The target variable for this problem is a continuous variable, MPG, that specifies the mpg for each of the vehicles.
Let's first design the data processing pipeline for this problem.