The diabetes dataset concerns 442 individual diabetes patients and the progression of the disease one year after a baseline measurement. The dataset consists of 10 features, which are the patient's age, sex, body mass index (bmi), average blood pressure (bp), and six measurements of their blood serum. The dataset target is the progression of the disease one year after the baseline measurement. This is a regression dataset, as the target is a number.
In this book, the dataset features are mean-centered and scaled such that the dataset sum of squares for each feature equals one. The following table depicts a sample of the diabetes dataset:
age |
sex |
bmi |
bp |
s1 |
s2 |
s3 |
s4 |
s5 |
s6 |
target |
0.04 |
0.05 |
0.06 |
0.02 |
-0.04 |
-0.03 |
-0.04 |
0.00 |
0.02 |
-0.02 |
151 |
0.00 |
-0.04 |
-0.05 |
-0.03 |
-0.01 |
-0.02 |
0.07 |
-0.04 |
-0.07 |
-0.09 |
75 |
0.09 |
0.05 |
0.04 |
-0.01 |
-0.05 |
-0.03 |
-0.03 |
0.00 |
0.00 |
-0.03 |
141 |
-0.09 |
-0.04 |
-0.01 |
-0.04 |
0.01 |
0.02 |
-0.04 |
0.03 |
0.02 |
-0.01 |
206 |