Using IBM SPSS

The generally accepted process for preparing or tuning a data file for analysis in Watson is shown in the following figure:

Using IBM SPSS

The process for preparing a data file in Watson is described in the following steps:

  1. The data file is loaded into Watson.
  2. You can use Watson to review the quality of the data in the file.
  3. If the quality of the data is unacceptable, you can clean and/or reformat the data file and then reload the new version of the file into Watson.
  4. If the quality of the data is acceptable, you can begin your analysis.

Steps 2 and 3 can be performed solely using Watson. However, you may want to consider extending the process by using a tool that may offer specific data preparation functionalities, such as the IBM SPSS modeler.

For example, earlier in this section, we saw that the Payment Method column contained missing values in 3% of the records in the file. As it turns out, the IBM SPSS modeler is built to recognize several types of missing values:

  • Null or system-missing values
  • Empty strings and white space
  • Blank or user-defined missing values
..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset